RSC Topic: Data Integration & Interoperability

ERP, MES, PLM, QMS, and supplier system mapping and synchronization.

  • Integration boundary

    An integration boundary is the defined limit between two systems, applications, process areas, or data domains where information, events, or control signals are exchanged. It marks where one system’s responsibility ends and another begins.

    In manufacturing and regulated operations, the term commonly refers to the interface between platforms such as MES, ERP, PLM, QMS, LIMS, SCADA, or shop-floor equipment. The boundary may include data structures, message formats, timing rules, ownership of records, validation checks, security controls, and error-handling rules.

    An integration boundary is not the same as the integration itself. The integration is the mechanism or workflow used to connect systems. The boundary is the line that defines what crosses between them, under what conditions, and which system is authoritative for each type of data.

    What it typically includes

    • The systems or process areas on each side of the interface

    • The specific data, documents, events, or transactions that cross the boundary

    • Source-of-record ownership, such as which system owns part masters, work orders, production status, or quality results

    • Trigger points, frequency, and timing, such as real-time, near-real-time, or batch exchange

    • Validation, exception handling, and reconciliation rules

    • Access, security, and audit-related constraints where applicable

    How it appears in operations

    Operationally, an integration boundary often appears when a business process moves from planning to execution, from execution to quality review, or from plant systems to enterprise reporting. For example, ERP may release a production order to MES across an integration boundary, while MES returns completion quantities, labor, material consumption, and traceability data back to ERP.

    Clear integration boundaries help teams describe which records are created in one system, which are only referenced, and which are updated across systems. This is especially important where data integrity, version control, traceability, and controlled changes matter.

    Common confusion

    Integration boundary is commonly confused with system boundary. A system boundary describes what is inside or outside a single system’s scope. An integration boundary focuses on the handoff or exchange point between systems.

    It can also be confused with an API or interface. An API or interface is a technical means of connection. The integration boundary is broader and includes business responsibility, data ownership, and process rules, not just the transport method.

    It is also different from a network boundary in cybersecurity. A network boundary concerns segmentation and communications control between networks or zones. An integration boundary concerns business and application-level exchange, though the two may overlap in OT and IT environments.

  • What data quality checks should we run before publishing KPIs?

    Run data quality checks in four areas before publishing KPIs: metric definition, source data integrity, transformation logic, and operational governance. The right checklist depends on how many systems feed the KPI, how stable your master data is, and whether the metric will be used for management action, customer reporting, or quality evidence.

    At a minimum, do not publish a KPI until you can answer three questions clearly: what exactly is being measured, which system or systems are authoritative, and how exceptions are handled. If those answers are not controlled, the KPI may still be useful for internal exploration, but it is not ready to be treated as a reliable operating signal.

    Minimum checks to run

    • Definition and scope check. Confirm the KPI formula, inclusion and exclusion rules, time window, aggregation level, and population being measured. Many disputes are definition problems, not data problems.

    • Source system authority check. Identify the system of record for each input field. If ERP, MES, PLM, QMS, historian, and manual logs all contribute, document which source wins when values conflict.

    • Completeness check. Measure missing records, null fields, missing shifts, missing work orders, missing machine states, and delayed transactions. A KPI can look stable while silently excluding part of the operation.

    • Timeliness and latency check. Verify data arrival times, refresh frequency, and cutoff logic. Publishing a daily KPI from sources that close at different times can create false variance.

    • Uniqueness and duplicate check. Detect duplicate events, repeated uploads, replayed interface messages, and duplicate production confirmations. This is common in retry-heavy integrations.

    • Validity and range check. Look for impossible or out-of-bounds values such as negative cycle times, future timestamps, scrap quantities above production quantities, or utilization above 100 percent unless the definition explicitly allows overlapping capacity logic.

    • Consistency check. Confirm that units of measure, asset names, shift calendars, reason codes, product identifiers, and status values are normalized across sources. Mixed coding schemes are a common brownfield failure mode.

    • Referential integrity check. Ensure records link correctly to work orders, operations, part numbers, resources, lots, serials, and personnel where applicable. Orphan records can distort both numerator and denominator.

    • Reconciliation check. Compare KPI inputs and outputs against trusted reports from source systems. For example, production counts should reconcile within an agreed tolerance to MES or ERP postings, and quality counts should reconcile to QMS or NCR records.

    • Transformation logic check. Test joins, filters, rollups, timezone handling, calendar boundaries, and business rules in the KPI pipeline. Most KPI defects are introduced during mapping and transformation, not at the dashboard layer.

    • Master data alignment check. Validate work center hierarchies, part master, routing versions, BOM relationships, customer or program mappings, and reason code dictionaries. If master data is unstable, trend analysis may be misleading even when transactions are accurate.

    • Exception handling check. Define how rework, split lots, partial completions, late entries, backflushing, reversals, and corrected quality events affect the KPI. If exception logic is undocumented, the metric will not hold up under scrutiny.

    • Historical stability check. Recalculate prior periods and see whether the KPI changes materially after late transactions arrive. If history is unstable, label the KPI as provisional until the close window is complete.

    • Auditability check. Confirm that calculation version, source extract time, lineage, and approval status are recorded. In regulated operations, traceability of the metric logic matters as much as the displayed number.

    Checks that matter most in brownfield environments

    If your KPIs depend on multiple legacy systems, focus heavily on mapping quality, transaction timing, and code harmonization. Mixed MES, ERP, PLM, QMS, spreadsheets, and manual logs often produce KPI disagreements because the systems were not designed around a shared canonical model.

    Typical failure modes include:

    • the same event recorded in two systems with different timestamps

    • different definitions of completed production, scrap, downtime, or release status

    • manual corrections made in one system but not propagated to others

    • routing or resource changes that break historical comparisons

    • interface retries that create duplicate records

    • work performed offline and entered later in batch form

    That is why KPI publication should usually sit behind controlled data validation and change control, not only dashboard development. Full replacement of legacy stacks is often proposed as the fix, but in regulated, long-lifecycle environments that frequently fails due to qualification burden, validation cost, downtime risk, integration complexity, and the need to preserve traceability across existing processes. In practice, most plants need governed coexistence, not wholesale replacement.

    Set release criteria before the KPI goes live

    A practical approach is to assign release criteria to each KPI. For example:

    • documented formula and owner

    • named systems of record

    • data completeness above a defined threshold

    • reconciliation variance within an agreed tolerance

    • known exceptions documented

    • calculation logic version-controlled and approved

    • user-facing label for provisional versus final values

    The thresholds are site-specific. A near-real-time operational dashboard may tolerate more latency or correction than a KPI used for formal quality review or customer-facing performance commitments.

    What not to assume

    Do not assume a KPI is reliable because the source system is validated, because the dashboard looks consistent, or because the number matches expectations. A validated source application does not automatically validate the extraction, mapping, transformation, aggregation, and presentation layers around it.

    Also do not assume one-time data cleansing is enough. KPI quality degrades when master data changes, new equipment is added, reason codes evolve, integrations are modified, or operators adopt workarounds under schedule pressure. Ongoing monitoring is part of KPI governance.

    Bottom line

    Before publishing KPIs, run checks for definition control, completeness, timeliness, duplicates, validity, consistency, referential integrity, reconciliation, exception logic, and auditability. If you cannot trace the number back to governed logic and trusted source data, publish it as provisional at most, or do not publish it.

  • data integrity

    Core meaning

    Data integrity commonly refers to the degree to which data is complete, accurate, consistent, and reliable across its entire lifecycle. In industrial and regulated manufacturing environments, it describes the trustworthiness of data used to plan, execute, monitor, and release production and quality activities.

    It focuses on whether data correctly reflects what actually happened and whether it can be relied on for decisions, records, and investigations.

    Key characteristics in manufacturing systems

    Data integrity in operations and manufacturing IT/OT systems typically involves:

    – **Accuracy**: Data correctly represents the underlying event or measurement (for example, a temperature reading matches the actual process temperature within defined limits).
    – **Completeness**: All required data is captured (for example, every batch step has a timestamp, operator, and result recorded).
    – **Consistency**: Data values and formats align across systems and over time (for example, the same batch ID and result in MES, historians, and quality systems).
    – **Attributability**: It is clear who or what created, modified, or approved data (for example, user accounts tied to individuals, system IDs tied to specific equipment).
    – **Traceability**: The history of data creation, changes, and usage can be reconstructed (for example, audit trails, version history, and event logs).
    – **Timeliness**: Data is recorded at the time of the activity or within a defined, controlled delay.
    – **Protection from loss or corruption**: Technical and procedural controls prevent unauthorized change, deletion, or distortion (for example, backups, access controls, and integrity checks).

    How the term is used in real workflows

    In regulated and industrial operations, data integrity is discussed when:

    – Designing or validating **MES, LIMS, historians, and ERP integrations**, to ensure that data is not altered, dropped, or misaligned between systems.
    – Configuring **equipment interfaces** (PLCs, SCADA, edge gateways) so that sensor data, setpoints, and results are reliably captured and reconciled.
    – Establishing **electronic batch records** and quality records to ensure that the documented history of a batch or lot is trustworthy.
    – Setting up **user management, access control, and audit trails** to link actions to specific users or systems.
    – Performing **deviations, investigations, and root cause analysis**, where the reliability of logs and records directly affects conclusions.

    In these contexts, data integrity is treated as a property of both the technical design (architecture, interfaces, storage) and the operational controls (procedures, governance, and training).

    Boundaries and exclusions

    Data integrity:

    – **Includes**: How data is captured, transformed, stored, transmitted, retrieved, and retired across OT and IT systems and manual processes.
    – **Includes**: Controls that prevent or detect unauthorized or unintended data creation, modification, or deletion.
    – **Does not inherently include**: Overall system security or cybersecurity strategy, although security controls strongly influence data integrity.
    – **Does not equal**: Data quality projects focused only on analytics normalization or reporting aesthetics; data integrity is about trustworthy, faithful records of operations.

    Common confusion and related terms

    – **Data integrity vs data quality**: Data quality often addresses usefulness for reporting or analytics (for example, standardized codes, completeness for KPIs). Data integrity focuses on whether records truthfully and reliably represent actual events and can be trusted in audits, releases, and investigations. In regulated environments, data integrity is usually treated as more fundamental.
    – **Data integrity vs system validation**: System validation assesses and documents that a computerized system does what it is intended to do. Data integrity is one of the properties expected from a properly validated system but is not the same as validation itself.
    – **Data integrity vs cybersecurity**: Cybersecurity protects systems and data from malicious or unauthorized access. Data integrity overlaps with this domain but also covers non-malicious issues such as configuration errors, interface mapping mistakes, and uncontrolled manual edits.

    Site context: data integrity in MES and equipment integration

    When integrating MES with special process equipment and other OT systems, data integrity considerations typically include:

    – Ensuring **unambiguous mapping** between equipment signals, tags, or data points and MES data structures (for example, operations, parameters, and materials).
    – Designing **interfaces and edge gateways** so that data is not lost during communication outages, buffering, or protocol translation.
    – Preserving **original source data** (for example, raw values and timestamps from equipment) alongside any calculated or summarized values in MES or historians.
    – Implementing **audit trails and event logs** that show when data was collected, transferred, or modified, and by which system or user.
    – Coordinating **time synchronization** across PLCs, data historians, MES, and ERP so that sequences of events can be reconstructed accurately.

    In this setting, discussions of data integrity often drive choices between direct equipment integration, use of gateways, or relying on procedural/manual data capture where automated capture is not feasible or would impose excessive validation burden.

  • Data imputation

    Data imputation is the process of replacing missing data values with estimated, inferred, or rule-based values so a dataset can still be analyzed, reported, or processed by software. It is commonly used in analytics, machine learning, quality reporting, and operational data pipelines when sensor readings, inspection results, timestamps, or transaction fields are incomplete.

    In manufacturing and regulated operations, data imputation usually refers to a data handling method, not to creating original evidence. It can help maintain continuity in calculations, dashboards, and models, but the imputed value is still a substitute for an observed value. For that reason, imputation should be distinguishable from actual recorded shop floor, lab, maintenance, or quality data.

    What it includes

    Data imputation can include simple or advanced approaches, such as:

    • Replacing blanks with a fixed value such as zero, a default code, or a known status

    • Using the mean, median, or most frequent value from similar records

    • Carrying forward the last known reading in time-series data

    • Estimating a value from related variables, historical patterns, or statistical models

    Example: if a production dataset is missing a temperature reading for one interval, an analytics workflow might estimate it from nearby timestamps so trend analysis can continue.

    What it does not mean

    Data imputation does not mean the missing value was actually measured, observed, or verified. It also does not mean source records have been corrected. In quality, traceability, or compliance-sensitive contexts, the original missingness often still matters even if an imputed value is used downstream for analysis.

    Common confusion

    Data imputation is often confused with data cleansing, data correction, and interpolation.

    • Data cleansing is the broader process of improving data quality, which may include standardization, deduplication, and error handling.

    • Data correction usually means fixing a known wrong value based on evidence, rather than estimating a missing one.

    • Interpolation is a specific form of estimation between known points, commonly used in time-series or process data.

    In some disciplines, imputation is discussed mainly as a statistical technique, while in operational systems it may appear as part of ETL, reporting logic, or analytics preprocessing.

    Why it matters in operations systems

    In MES, ERP, historians, and quality systems, missing data can affect KPI calculations, exception reporting, model outputs, and cross-system reconciliation. Data imputation is one way to keep those processes functioning, but it should be handled transparently so users can tell which values were observed and which were estimated.

  • shop-floor data collection

    Shop-floor data collection commonly refers to the capture of operational data at or near the point where manufacturing work is performed. This can include information entered by operators, recorded by machines, scanned from materials or travelers, or generated by connected equipment and sensors.

    In manufacturing environments, the term usually covers data such as production counts, work order status, lot or serial information, material usage, process parameters, downtime events, inspection results, nonconformances, and labor activity. The purpose is to create a current record of what happened on the shop floor, when it happened, where it happened, and often who or what performed the activity.

    It applies across manual, semi-automated, and automated operations. Data may be collected on paper forms, terminals, HMIs, tablets, barcode scanners, PLC-connected systems, MES applications, or other plant systems. The term describes the collection activity itself, not any one software product or device.

    What it includes and excludes

    • Includes: production reporting, machine status capture, operator entries, material traceability scans, in-process quality results, reason codes, and timestamped execution records.

    • Does not necessarily include: analysis, KPI calculation, scheduling, or enterprise planning, although collected data is often used by those functions later.

    • Is not limited to automation: manual entry is still shop-floor data collection if it records manufacturing events at the point of work.

    How it appears in operations and systems

    Operationally, shop-floor data collection is the mechanism that feeds execution, quality, traceability, and performance systems with factual production records. In a connected environment, it often links shop-floor events to MES, ERP, QMS, historians, CMMS, or analytics platforms. For example, a completed operation may trigger quantity reporting to ERP, update a traveler in MES, record a lot genealogy event, and attach inspection evidence for quality review.

    In regulated or traceability-focused environments, the captured record may also support reconstruction of as-built or as-performed history. The term itself does not imply that records are complete, approved, or compliant. It only refers to the gathering of data from manufacturing activity.

    Common confusion

    Shop-floor data collection is often confused with MES, SCADA, or machine monitoring. They are related but not identical.

    • MES manages and records manufacturing execution more broadly. Shop-floor data collection is one capability commonly used within MES.

    • SCADA focuses on supervisory monitoring and control of industrial processes. It may provide data used in shop-floor data collection, but it is not the same concept.

    • Machine monitoring centers on equipment state and performance, while shop-floor data collection also includes labor, materials, quality, and transaction-level production events.

    • Data entry is narrower. Shop-floor data collection may involve manual entry, but also automated capture, scanning, and device integration.

  • data backbone

    A data backbone is the core data and integration structure that connects systems, moves information between them, and keeps shared operational data available across an organization. In manufacturing, it commonly refers to the combination of interfaces, data models, message flows, and governance used to connect systems such as MES, ERP, PLM, QMS, historians, and shop-floor equipment.

    The term usually describes an enterprise or plant-wide foundation rather than a single application. A data backbone may support work orders, material status, specifications, quality records, traceability, equipment data, and production events as they move across OT and IT boundaries. It can be built with middleware, APIs, event streams, service buses, data hubs, or similar integration patterns.

    It should not be confused with a database, a network backbone, or a full digital thread. A database stores data, and a network backbone carries traffic at the infrastructure level. A digital thread is a broader concept about connected lifecycle information and context. A data backbone is the practical data exchange layer that helps make those connections possible.

    In regulated or quality-sensitive environments, the term often implies consistent identifiers, controlled data handoffs, and reliable links between source systems, but it does not by itself indicate compliance or validation status.

  • systems engineering

    Systems engineering commonly refers to an interdisciplinary approach for defining, designing, integrating, verifying, and managing complex systems throughout their lifecycle. It focuses on the whole system, including how people, processes, software, hardware, data, equipment, and external interfaces work together to meet requirements.

    In manufacturing and regulated operations, systems engineering often appears where multiple subsystems must operate as one controlled environment. Examples include production equipment connected to OT networks, MES and ERP integrations, quality data flows, device and software interfaces, and traceability across design, execution, inspection, and maintenance records.

    It includes activities such as requirements management, interface definition, architecture development, integration planning, test and verification planning, change control, and lifecycle coordination. It does not refer only to software development, only to machine design, or only to day-to-day system administration.

    Operational meaning

    Operationally, systems engineering helps structure how a complex manufacturing or industrial system is specified and governed. For example, a plant may treat a line as a system made up of PLCs, SCADA or HMI layers, historians, MES transactions, quality checks, user roles, and ERP handoffs. Systems engineering provides methods for managing dependencies, interfaces, and validation logic across those elements.

    This discipline is commonly used at both project and lifecycle levels. During implementation, it helps define what the system must do and how components connect. After deployment, it supports controlled changes, impact assessment, and coordination between engineering, operations, IT, quality, and suppliers.

    What it includes and excludes

    • Includes: system requirements, architecture, interfaces, integration, verification planning, lifecycle considerations, and cross-functional coordination.

    • Excludes: purely isolated component design, routine equipment maintenance by itself, and general IT support activities unless they are part of the system-level design and management effort.

    Common confusion

    Systems engineering is often confused with industrial engineering, software engineering, or automation engineering. Industrial engineering typically focuses more on process efficiency, flow, labor, and optimization. Software engineering focuses on building software. Automation engineering focuses on control systems and equipment behavior. Systems engineering is broader and is concerned with how all relevant parts fit together and continue to function as an integrated whole.

    It can also be confused with a system integrator. A system integrator is usually a person or company that connects components in practice. Systems engineering is the discipline used to define, coordinate, and manage the integrated system.

  • hybrid architecture

    Hybrid architecture commonly refers to a system design that combines two or more computing environments within one overall solution. In manufacturing, this usually means some applications, data, or control functions remain on-premises or close to the shop floor, while other functions run in the cloud or in enterprise IT platforms.

    This model is common where plants need low-latency execution, equipment connectivity, or local resiliency, but also want centralized analytics, planning, reporting, or multi-site coordination. For example, an MES or edge layer may manage real-time production events locally, while ERP, data warehousing, or advanced analytics operate in a cloud environment.

    Hybrid architecture should not be confused with a single system that is merely accessible remotely. The term describes how the solution is deployed and integrated across environments, not just where users log in. It also differs from a pure cloud or pure on-premises architecture, where most components run in only one environment.

    In industrial systems, the term often matters in discussions about integration, security boundaries, data flows, validation scope, and system ownership between OT and IT teams. The exact split varies by use case, but the core idea is a deliberate mix of local and centralized platforms working together.