This repository contains the full workflow, data, and code used to develop a machine-learning framework for accurate construction cost prediction by integrating micro-level project estimates with macro-level U.S. construction spending indicators. The project includes two source datasets—(1) construction_estimates.csv, containing project-specific material cost, labor cost, profit rate, discount/markup, and total cost, and (2) construction_spending.csv, containing national monthly spending across public, private, residential, and non-residential sectors. A third file, merged_construction_dataset.csv, combines both sources to create a unified feature set used for feature selection (RFE, SelectKBest, correlation filtering) and supervised modeling (Linear Regression and Random Forest). All scripts for preprocessing, ARFF conversion, model training, evaluation, and figure generation are included, providing a reproducible pipeline for researchers and practitioners interested in improving construction cost forecasting.
| Name | Value | Last Modified |
|---|
No metadata available for this resource
No extraction events recorded.
| Views: | 177 |
| Last viewed: | Apr 03, 2026 10:39:09 |
| Downloads: | 0 |
| Last downloaded: | Never |
| Last Modified: | Dec 11, 2025 00:20:46 |