RSC Cluster: Data Integrity, Version Control and Audit Trails

The Data Integrity, Version Control and Audit Trails Cluster provides the governance foundation that makes operational data trustworthy. It explains permissions, approvals, revision control, and immutable audit trails across internal and supplier-facing workflows. The content makes responsibility and evidence preservation explicit. This cluster ensures credibility with auditors, customers, and internal stakeholders.

  • Context dependence

    Context dependence commonly refers to the idea that the meaning, impact, or validity of something changes based on the surrounding situation. In industrial and manufacturing environments, it often describes how data, rules, or decisions cannot be interpreted correctly without understanding the operational context in which they were created or used.

    Context dependence in manufacturing and operations

    In regulated manufacturing, many elements are context dependent, including:

    • Data and metrics: A cycle time, scrap rate, or OEE number may be acceptable in one product family, shift, plant, or configuration, but indicate a problem in another. The same value can signal different things depending on volume, mix, revision level, or customer requirements.
    • Specifications and limits: Tolerances, control limits, or inspection requirements often depend on product revision, customer, export classification, risk level, or process capability. A rule that applies to one program or part number may not apply to another.
    • Work instructions and routings: Steps can vary based on machine availability, operator certification, tooling configuration, or material lot. A routing that is valid for one work center setup might be non-compliant in a different setup.
    • Alarms, events, and exceptions: System alerts that are critical during a qualification run might be informational during routine, stable production, or vice versa.
    • Quality decisions: Disposition of a nonconformance can depend on end use, customer contract, criticality of the feature, or concurrent deviations and waivers.

    Operational systems such as MES, QMS, ERP, and data historians often need to model and store this context (for example, product, revision, machine, date, shift, operator, or customer) so that data and decisions can be interpreted correctly later for analysis, audits, and investigations.

    Why context dependence matters in regulated environments

    In regulated and aerospace or defense environments, context dependence is especially important for:

    • Traceability and genealogy: Being able to reconstruct not only what was done, but under which conditions, with which instructions, versions, and approvals.
    • Audit trails: Showing which rule or specification applied at the time and why a specific decision was reasonable in that context.
    • Change control and document control: Ensuring that the correct version of a procedure or drawing is applied only in the contexts where it is authorized.
    • Analytics and KPIs: Avoiding misleading comparisons by normalizing or segmenting metrics according to product mix, process changes, or configuration differences.

    Common confusion

    • Context dependence vs. variability: Variability is a change in results over time or between units. Context dependence is about how the interpretation of those results depends on surrounding conditions, even if the raw values are the same.
    • Context dependence vs. configuration: Configuration captures the defined setup (for example, machine model, software version). Context dependence is broader and includes situational factors like customer, order, program, or regulatory regime that influence how rules and data should be understood.

    System and workflow considerations

    From a systems perspective, handling context dependence often involves:

    • Capturing contextual attributes alongside transactional data (for example, part number, revision, route, work center, operator, shift, lot, customer, export classification).
    • Defining rules, specifications, or workflows that activate only under specific contextual conditions.
    • Ensuring reports, dashboards, and investigations can filter and segment by relevant context so that comparisons are meaningful.
    • Maintaining versioned records so that the governing context at the time of execution can be reconstructed later.

    Failing to account for context dependence can lead to misinterpretation of performance, incorrect application of requirements, or incomplete evidence during audits and root cause investigations.

  • human-in-the-loop

    Core meaning

    Human-in-the-loop (HITL) commonly refers to a design approach in which human operators remain actively involved in reviewing, approving, or overriding the outputs of an automated system. The system is intentionally built so that critical decisions, or the authority to enact them, pass through a human rather than being executed fully autonomously.

    In industrial and manufacturing contexts, this typically means that automated analytics, optimization engines, or AI models generate recommendations or preliminary decisions, while qualified personnel confirm, adjust, or reject those outputs before they affect production, quality disposition, or regulatory records.

    Use in industrial and regulated environments

    In operations and manufacturing systems, human-in-the-loop is often applied to:

    – **AI and advanced analytics**: Models propose setpoint changes, maintenance actions, or quality classifications, but engineers or supervisors must review and approve before implementation in MES, DCS, or ERP.
    – **Workflow and MES steps**: Systems may route exceptions, deviations, or out-of-trend results to an operator for assessment and decision, even if detection or triage is automated.
    – **Electronic records and release decisions**: Automated checks (e.g., specification limits, rule engines) can flag issues, but batch release, disposition, or corrective actions are confirmed by authorized personnel.
    – **Safety- and compliance-relevant actions**: Any action that could significantly affect product quality, patient safety, or regulatory records is kept under explicit human authority, even when automation supports the decision.

    Human-in-the-loop designs are commonly supported by:

    – Clear roles and responsibilities for who can approve or override automated outputs
    – System features that pause execution until human review is recorded
    – Audit trails documenting human decisions and rationale
    – Interfaces that present model or system outputs in a way that is understandable to the responsible user

    Boundaries and what it is not

    Human-in-the-loop:

    – **Is**: A governance and interaction pattern where humans remain accountable decision-makers while using automation as an input.
    – **Is not**: Purely manual operation without automation; by definition, there is an automated or algorithmic component being supervised.
    – **Is not**: Fully autonomous control, where a system executes actions without any human review or approval within a defined decision loop.

    It can coexist with other patterns, such as:

    – **Human-on-the-loop**: Humans monitor high-level behavior and can intervene but are not required to approve each action.
    – **Human-out-of-the-loop**: Systems operate autonomously in defined domains without real-time human oversight.

    Common confusion and related terms

    – **Automation vs. autonomy**: Human-in-the-loop systems may be highly automated but are not fully autonomous, because critical decisions still involve human review.
    – **Decision support vs. decision automation**: With HITL, automation typically provides decision support (recommendations, scores, classifications), while the final decision authority remains with a human.
    – **Explainability**: HITL does not guarantee that an AI model is explainable, but it is frequently combined with explainability techniques so humans can understand and justify their approvals.

    Application in MES and AI integration (site context)

    When AI models are integrated with MES or other production systems, human-in-the-loop commonly means:

    – AI outputs (e.g., recommended parameters, predicted quality outcomes, suggested holds) are presented as advisory information.
    – MES workflows require a human to confirm, adjust, or reject these AI suggestions before they change master data, execution parameters, or product status.
    – Change control, access control, and audit trails record both the AI recommendation and the human decision as part of the electronic record.

    This pattern is often used to maintain clear decision accountability, support investigations, and align AI-enabled operations with existing procedures and regulatory expectations.

  • control recipe

    A control recipe is the fully specified, executable instance of a recipe used to run a particular batch, lot, or production order in an automated or semi-automated manufacturing system. The term is most commonly associated with ISA‑88 (S88) batch control, but the concept also appears in other recipe-driven production environments.

    What a control recipe includes

    In an S88-style model, a control recipe typically contains:

    • Identification information, such as recipe ID, batch ID, product code, and version
    • The selected master recipe reference (the standard definition of how to make the product)
    • Specific parameter values for this run, such as quantities, target setpoints, and timing
    • Selected equipment and units, consistent with the plant’s equipment model
    • Scheduling and sequencing information for the batch or order
    • Links to required materials, intermediates, and consumables
    • Required checks, holds, or approvals, such as quality checks or signoffs
    • Reporting requirements, such as which data must be logged as batch records

    The control recipe is what the control system (for example a batch engine, DCS, or MES) actually uses to orchestrate and monitor the production run.

    Role in operations

    Operationally, a control recipe connects the product definition to the plant floor:

    • It is created from a master recipe when a specific batch, lot, or order is planned.
    • It may be instantiated or managed by a batch management system, MES, or a dedicated recipe management system.
    • It provides the control system with all necessary details to execute, monitor, and log the process.
    • It often forms part of the electronic batch record or production history for compliance and traceability.

    In regulated or highly controlled environments, version control and approval workflows are commonly applied to control recipes and the parameters they contain.

    What a control recipe is not

    • It is not the generic product definition. That is usually the domain of a master recipe or product specification.
    • It is not the low-level equipment logic (for example PLC code or DCS configuration), although it references and drives that logic.
    • It is not the long-term production plan or finite schedule, but it may be generated as part of scheduling.

    Common confusion

    Control recipe vs. master recipe: In S88, a master recipe is the generalized, approved description of how to manufacture a product. A control recipe is a specific, executable instance of that master recipe for a particular run, with concrete parameter values and equipment selections.

    Control recipe vs. formula or BOM: A formula or bill of materials usually lists materials and quantities. A control recipe includes those, but also embeds the procedural steps, setpoints, and runtime instructions needed by the control system.

    Connection to ISA‑88 (S88)

    Within the ISA‑88 framework, control recipes are one of the defined recipe types. They support the separation of:

    • What is being made (product and master recipe)
    • How the plant equipment is organized (equipment model)
    • How a given batch is actually executed in the control system (control recipe)

    In practice, this separation allows plants to maintain standard master recipes while generating many different control recipes adapted to specific batch sizes, equipment availability, and order requirements.

  • Information Asset

    An information asset is any collection of data, document, system, or technology resource that has value to an organization and therefore needs to be identified, managed, and protected. In industrial and regulated manufacturing environments, information assets include both digital and physical records that support operations, quality, safety, and compliance.

    What an information asset includes

    In manufacturing and industrial operations, information assets commonly include:

    • Production data, such as batch records, machine parameters, and process histories
    • Quality and compliance documentation, including nonconformance reports, CAPA records, and audit trails
    • Engineering and technical information, such as drawings, specifications, PLC logic, and recipes
    • Systems and databases, for example MES, historian databases, ERP master data, and LIMS data sets
    • Standard operating procedures, work instructions, and training records
    • Supplier and customer data, including material certificates and delivery records
    • Cyber-physical information, such as OT network configurations, access control lists, and system logs

    The term can apply to both structured information (databases, configuration files, log records) and unstructured information (PDF procedures, email threads with deviation approvals, scanned paper records).

    What an information asset is not

    An information asset is not just any piece of data. It typically:

    • Has identifiable business, operational, safety, or compliance value
    • Has an owner or custodian responsible for it
    • Requires control over accuracy, access, retention, and, in some cases, confidentiality

    For example, transient sensor noise that is never stored or used would not usually be treated as an information asset, while archived sensor trends used for investigations and optimization normally would be.

    Operational meaning in manufacturing

    In practice, classifying something as an information asset means it is brought under formal governance. Typical activities include:

    • Cataloging the asset in an information or asset register, including owner, location, and criticality
    • Defining access control rules, backup and restore needs, and change control requirements
    • Applying document control or data integrity practices, such as version control and audit trails
    • Assigning retention and disposition rules aligned with regulatory and business needs
    • Including the asset in risk assessments for cybersecurity, data integrity, and business continuity

    For OT and IT systems like MES, SCADA, historians, and ERP, the system itself and the data it stores are often treated as separate, but related, information assets.

    Relationship to risk and compliance

    Many information security and data governance frameworks expect organizations to identify and classify information assets before they can manage risks effectively. In regulated manufacturing, this often includes:

    • Identifying which assets contain regulated records, such as electronic batch records or device history records
    • Classifying criticality for safety, product quality, and regulatory reporting
    • Defining controls for data integrity, including traceability, completeness, and accuracy
    • Flagging assets that contain export-controlled or confidential technical data

    Common confusion

    • Information asset vs. physical asset: A physical asset is a tangible resource such as a machine, robot, or building. An information asset is the data and information associated with these resources, such as maintenance histories, calibration records, or control programs.
    • Information asset vs. record: A record is a specific type of information that provides evidence of an activity or decision. An information asset can be broader and may include records, reference data, configuration data, and working documents.
    • Information asset vs. data: Not all raw data is treated as an asset. An information asset usually represents data that has recognized value, context, and management responsibilities.
  • How do I align equipment and MES timestamps in multi-plant environments?

    You align them by treating time synchronization as a controlled architecture problem, not just an IT setting.

    In practice, that means using a common time standard across plants, defining which system is authoritative for which event, preserving the original source timestamp, and monitoring drift continuously. If you skip any of those steps, timestamps may look aligned in reports while still being unreliable for genealogy, downtime analysis, batch history, or exception investigation.

    What usually works

    • Standardize time synchronization at every plant using a defined enterprise approach, typically NTP and, where higher precision is required, PTP for specific equipment or networks.

    • Use a common reference such as UTC for storage and integration, while handling local plant time zones only at the presentation layer.

    • Keep the original source timestamp, source system identifier, timezone or offset, and timestamp receipt time in the MES or data platform.

    • Define event precedence rules. For example, machine cycle completion may be authoritative from the equipment controller, while operator signoff time may be authoritative from MES.

    • Set drift thresholds and alerts so plants can detect when a PLC, SCADA node, historian, edge gateway, or workstation falls out of tolerance.

    • Document synchronization behavior under loss of network connectivity, failover, daylight saving changes, and system restart conditions.

    What not to assume

    Do not assume all assets can be synchronized to the same accuracy. Older controllers, isolated cells, vendor black boxes, and manually entered records may only support coarse or inconsistent time behavior. Some devices timestamp at event creation, others at scan cycle, poll cycle, message transmission, or MES receipt. Those are not equivalent.

    Do not assume a central MES can correct every problem after the fact. If the equipment clock is wrong, the network buffers messages, or an integration layer rewrites timestamps on ingest, the resulting sequence may be misleading even if the final dashboard looks clean.

    Recommended architecture for multi-plant use

    • Enterprise time policy: define approved time sources, allowed protocols, timezone handling, and acceptable drift by system class.

    • Plant-level synchronization design: account for segmented OT networks, firewalls, DMZs, offline cells, and vendor support boundaries.

    • Authoritative event model: specify whether each key event comes from equipment, MES, historian, QMS transaction, or operator input.

    • Dual-timestamp pattern where needed: retain both event-occurrence time and system-ingest time.

    • Data quality monitoring: track drift, missing offsets, duplicate timestamps, out-of-order events, and daylight saving anomalies.

    • Change control and validation: test timestamp behavior after patching, controller replacement, interface changes, or historian reconfiguration.

    Important tradeoffs

    Higher precision usually means more design effort and more constraints. PTP can improve precision, but it may require compatible switches, network design changes, and vendor support that are not realistic in every brownfield plant. NTP is easier to deploy broadly, but it may not be sufficient for high-speed sequence-of-events use cases.

    Storing only normalized enterprise time simplifies analytics, but it can make investigations harder if you discard local context or original source values. Keeping both normalized and source values improves traceability, but it adds integration and storage complexity.

    Strict central governance improves consistency across plants, but local exceptions are common because asset age, network segmentation, and qualification constraints vary widely. Over-standardizing without accounting for plant reality often creates workarounds outside controlled systems.

    Brownfield reality

    Most multi-plant environments cannot just replace equipment, historians, or MES interfaces to solve timestamp issues. Full replacement strategies often fail because the qualification burden is high, downtime windows are limited, integrations are deeply entangled, and long asset lifecycles make staged coexistence unavoidable. A phased approach is usually more realistic: standardize time services first, then remediate the highest-risk assets and interfaces, then tighten event rules and monitoring.

    How to validate that alignment is good enough

    Use representative production scenarios, not just lab checks. Test normal operation, network interruption, store-and-forward recovery, shift change, daylight saving transition if applicable, batch completion, manual entry, and interface restart. Compare event order and time deltas across equipment, MES, historian, and downstream reporting. The acceptance threshold should be tied to the business use case. For some KPI reporting, a few seconds may be acceptable. For electronic records, exception reconstruction, or high-speed process correlation, that may not be acceptable.

    If the question is whether timestamps can be made perfectly identical across all plants and systems, the answer is usually no. The goal is controlled, explainable, and fit-for-purpose alignment with documented limits.