Construction Data Strategy Glossary - Common Data, Financial, and Operational Concepts
May 19, 2026

This is a helpful glossary of common terms you will encounter when building data infrastructure and data-driven deliverables for general contractors and specialty contractors/subcontractors. This covers both construction-specific financial terms, operational concepts, and data engineering technicalities.
Data Engineering Terms to Know in Construction
API (Application Programming Interface) A defined interface that allows one software system to communicate with another. Construction platforms like Procore, Autodesk, CMiC, and Sage Intacct provide APIs that allow data pipelines to pull data programmatically.
Bronze Layer The first layer in a medallion architecture. Contains raw data exactly as it arrived from the source system, with no transformation. The source of truth for data recovery if something goes wrong downstream.
Change Data Feed A mechanism that tracks only the records that have been added or modified since the last sync, rather than pulling all records every time. Faster and more efficient than full resyncs, but can miss deletions or fail under certain conditions.
Data Catalog A centralized metadata repository that documents the tables, columns, and definitions in a data warehouse. Improves discoverability and helps AI tools understand the context of the data they are querying.
Data Lake A storage repository that holds large volumes of raw data in its native format. More flexible than a data warehouse but less structured. Often used as an intermediate landing zone before transformation.
Data Lakehouse An architecture that combines the flexibility of a data lake with the structure and query performance of a data warehouse. Used in platforms like Databricks and Microsoft Fabric.
Data Pipeline The series of steps that moves data from a source system to a destination, including extraction, transformation, and loading. A well-designed pipeline runs automatically, handles failures gracefully, and maintains data freshness. Often referred to as a data connector.
Data Warehouse A structured repository for analytical data, organized to support queries and reporting. Separate from operational systems (ERP, PM, and other software) and optimized for read-heavy analytical workloads.
dbt (Data Build Tool) An open-source transformation framework that allows data teams to write, test, and document SQL-based data transformations as version-controlled code. The standard tool for building the silver and gold layers in a modern data warehouse.
ELT (Extract, Load, Transform) A data movement pattern where raw data is extracted from the source, loaded into the destination, and then transformed in place. More common in modern cloud architectures than the older ETL pattern.
ETL (Extract, Transform, Load) A data movement pattern where data is extracted from the source, transformed before loading, and then loaded into the destination. Common in older data warehouse architectures.
Freshness Monitoring A monitoring setup that alerts when a table has not received new data within an expected window. Essential for detecting pipeline failures before they cause data quality problems downstream.
Gold Layer The final layer in a medallion architecture. Contains business-ready data organized for reporting, dashboards, and AI queries. Optimized for read performance and named in business terms.
MCP (Model Context Protocol) A protocol that allows AI tools to connect to external systems and query real data in real time. Enables AI assistants to answer questions about your actual projects, costs, and field operations rather than relying solely on training data.
Medallion Architecture A data design pattern that organizes a data warehouse into three layers: bronze (raw), silver (cleaned), and gold (business-ready). Each layer progressively improves data quality and analytical usefulness.
Rate Limit A restriction on how frequently an API can be called within a given time period. Construction software APIs all have rate limits that affect how quickly data can be extracted and how frequently syncs can run.
Schema The structure of a database, including the tables, columns, data types, and relationships. A consistent schema across source systems and warehouse layers makes analytical queries reliable.
Silver Layer The middle layer in a medallion architecture. Contains cleaned, standardized, and deduplicated data. Business logic has not yet been applied, but the data is trustworthy and consistent.
Slowly Changing Dimension (SCD) A technique for tracking historical changes to dimension data over time. A Type 2 SCD adds a new row for each change, preserving the full history. Useful for tracking how project details, cost codes, or subcontractor information change over time.
Upsert A database operation that updates an existing row if the key exists, or inserts a new row if it does not. More robust than drop-and-recreate for maintaining data warehouse tables.
Common Financial Terms in Construction Analytics
Accounts Payable (AP) Money a company owes to vendors and subcontractors for work completed or materials received. AP aging breaks this into time buckets: current, 1-30 days, 31-60 days, and so on.
Accounts Receivable (AR) Money owed to a company for work it has completed and billed. AR aging shows how long invoices have been outstanding. High concentration in the 60+ day buckets indicates collection problems.
Billing Percent Complete The percentage of the total contract value that has been billed to the owner to date. Calculated as: JTD billed divided by contract value.
Buyout The process of executing subcontracts and purchase orders to commit the costs for a project scope. Buyout percent complete measures how much of the estimated scope has been committed.
Cash Conversion Cycle The time in days between paying for work and collecting payment for it. Calculated as Days Sales Outstanding minus Days Payable Outstanding. A shorter cash conversion cycle means less working capital is tied up in the business.
Committed Cost The total cost locked in through executed subcontracts and purchase orders. Represents money you have committed to spend but may not have spent yet.
Contingency A budget reserve held for unforeseen costs. Usually tracked as original contingency versus revised contingency, with the difference representing how much has been consumed.
Cost at Completion (EAC) The total projected cost of a project when finished, based on current actuals and the estimated cost to complete the remaining work. The accuracy of this number depends on honest forecasting by project managers.
Cost Percent Complete The percentage of the total projected cost that has been spent to date. Calculated as: JTD cost divided by cost at completion.
Days Payable Outstanding (DPO) The average number of days a company takes to pay its vendors and subcontractors after receiving an invoice. A higher DPO means holding cash longer before paying it out.
Days Sales Outstanding (DSO) The average number of days it takes to collect payment after issuing a billing. A lower DSO means faster collection. Calculated as: AR balance divided by (total revenue divided by days in period).
Earned Revenue The revenue a company is entitled to recognize based on actual work completed, regardless of what has been billed. Used in percentage of completion accounting to determine the appropriate revenue recognition.
Estimated Cost to Complete (ETC) The remaining cost required to finish a project from its current state. Added to JTD cost to produce the cost at completion.
Fee The GC's profit margin, usually expressed as a percentage of the construction cost or as a fixed amount. On cost-plus contracts, the fee is explicitly stated. On lump sum contracts, it is embedded in the contract price.
General Conditions (Div 1) The overhead costs of running a project: superintendent, project manager, project engineer, temporary facilities, safety, quality, and similar costs that are not tied to a specific scope of work. Tracking GC burn rate against the GC budget is a common early warning indicator.
Gross Profit Revenue minus cost of construction. Does not include G&A overhead or below-the-line items.
JTD (Job to Date) From the start of the project to the current date. JTD cost, JTD hours, JTD billed are all common metrics.
Liquidated Damages (LDs) A pre-agreed daily penalty for late completion written into the contract. Critical to track because they represent a quantifiable financial risk tied to schedule performance.
Net Cash Position The cash flow status of a project: what has been billed minus what has been spent, adjusted for retainage and uncommitted costs. A positive net cash position means the project is cash flow positive. A negative position means the company is funding the project.
Over/Under Billing Overbilling: you have billed more than you have earned based on work completed. Creates a liability. Underbilling: you have earned more than you have billed. Creates an asset. Both are tracked on the WIP schedule.
Pay Application (Pay App) A formal billing request submitted to the owner, typically monthly, based on the schedule of values. Includes the current period billing, the cumulative billing, and the retainage calculation.
Profit Fade The decline in projected gross profit on a project over its life, usually resulting from unforeseen costs or optimistic initial forecasting. Tracking profit fade over time helps identify whether estimating is consistently optimistic.
Retainage A percentage of each pay application withheld by the owner until project completion. Typically five to ten percent. Retainage represents money earned but not yet collectible and is a significant source of construction cash flow risk.
Schedule of Values (SOV) The breakdown of a contract into line items with associated values, used as the basis for monthly pay applications. Each line item has an original value, approved change orders, and a current value.
WIP Schedule Work in Progress schedule. The primary financial document for construction companies using percentage of completion accounting. Shows every active project with revenue earned, over and under billing, and forecasted gross profit.
Working Capital Current assets minus current liabilities. The financial cushion available to absorb short-term obligations. Growing working capital is a sign of financial health. Declining working capital, even on a profitable company, can indicate cash flow management problems.
Common Operational Terms in Construction Analytics
Activity Code A tag applied to schedule activities in P6 or other scheduling tools to categorize work by phase, responsibility, or work type. Consistent activity coding enables cross-project schedule analytics.
CPM (Critical Path Method) A scheduling methodology that identifies the longest sequence of dependent activities determining the minimum project duration. Activities on the critical path have zero float.
DART Rate Days Away, Restricted, or Transferred rate. A safety metric measuring the frequency of injuries serious enough to result in missed work days or job restrictions. Lower is better.
Earned Value Management (EVM) A project performance measurement methodology that integrates scope, schedule, and cost to provide an objective view of project performance. Key metrics include Schedule Performance Index (SPI) and Cost Performance Index (CPI).
Finish No Later Than (FNLT) A constraint type in P6 that sets a hard deadline for an activity. When the project is behind schedule, activities with Finish No Later Than constraints drive negative float, which signals the schedule is at risk.
Float The amount of time an activity can be delayed without delaying the project completion date. Zero float means the activity is on the critical path. Negative float means the current schedule already predicts a late completion.
JTD Hours Job to date labor hours. Total hours worked from project start to the current date.
Level of Effort (LOE) An activity type in P6 used for overhead and supervision activities whose duration is determined by the activities they support, not by their own logic. Should be excluded from duration and productivity calculations.
Milestone A zero-duration schedule activity marking a significant event. Substantial completion, certificate of occupancy, and permit received are common construction milestones.
PCO (Potential Change Order) A potential change to the contract that has been identified but not yet formally approved. Tracking open PCOs is an indicator of unresolved scope risk.
RFI (Request for Information) A formal question submitted by the contractor to the design team or owner seeking clarification on the contract documents. RFI response time is one of the most predictive schedule risk indicators.
SPI (Schedule Performance Index) An earned value metric. SPI below 1.0 means you are behind schedule. SPI above 1.0 means you are ahead. Calculated as: earned value divided by planned value.
Submittal A document, sample, or product data submitted to the architect or engineer for review and approval before the work is performed. Slow submittal review cycles are a common source of schedule delays.
TRIR (Total Recordable Incident Rate) A safety metric measuring the frequency of recordable injuries per 100 full-time workers. Industry standard formula: (number of recordable incidents x 200,000) divided by total hours worked.
WBS (Work Breakdown Structure) A hierarchical breakdown of a project into manageable sections. In scheduling tools, the WBS organizes activities into a logical hierarchy. In cost management, it defines the budget structure.
XER File The native export file format for Oracle Primavera P6 schedules. Contains the full schedule including activities, relationships, resources, calendars, baselines, and codes.
Interested in operationalizing your construction data?
Request a Demo