Create datasets to upload and publish data. Further organize your data using folders and assign metadata at both the file and dataset level.
This shapefile is a difference layer of the 2023 and 2018 data. This shapefile shows how infrastructure has changed from 2018 to 2023 and hos infrastrcture desert hexagons have transitioned.
This shapefile shows which infrastructure is deficient and number of infrastructure deficiencies in each hexagon.
This shapefile shows which infrastructure is deficient and number of infrastructure deficiencies in each hexagon.
This is our data for lithium ion battery, all experimental datasets are created under our lab ASPEN lab at SMU.
Data used for the West Nile Virus Risk Prediction Project
The memorandum and the appendix of my AI project
This repository contains the full workflow, data, and code used to develop a machine-learning framework for accurate construction cost prediction by integrating micro-level project estimates with macro-level U.S. construction spending indicators. The project includes two source datasets—(1) construction_estimates.csv, containing project-specific material cost, labor cost, profit rate, discount/markup, and total cost, and (2) construction_spending.csv, containing national monthly spending across public, private, residential, and non-residential sectors. A third file, merged_construction_dataset.csv, combines both sources to create a unified feature set used for feature selection (RFE, SelectKBest, correlation filtering) and supervised modeling (Linear Regression and Random Forest). All scripts for preprocessing, ARFF conversion, model training, evaluation, and figure generation are included, providing a reproducible pipeline for researchers and practitioners interested in improving construction cost forecasting.
This dataset contains more factors that lead to change in air quality
Link to data set: https://www.kaggle.com/datasets/nishantbhadauria/datasetucimlairquality?resource=download
This data set lists factors that lead to variance in air quality.
Link to dataset: https://archive.ics.uci.edu/dataset/360/air+quality
These are the datasets used for the project. Here is the original link: https://www.kaggle.com/datasets/taweilo/capital-bikeshare-dataset-202005202408
There is a Supervised and Clustered Dataset along with my combined datasets
Attributes:
- date (observation date)
- pickup_counts (number of bikes rented per day)
- dropoff_counts (number of bikes dropped off per day)
-tempmax (max daily temp F)
-tempmin (min daily temp F)
-humidity (average daily humidity %)
-precip (daily total precipitation inches)
-windspeed (average daily windspeed mph)
-weekday (numeric representation of day of week 1=monday 7=sunday)
-month (numeric representation 1-12)
-holiday (1 = federal or major holiday and 0 = normal day)
-total_usage (total usage of a station)
Powered by Clowder (1.22.1#1085 branch:master sha1:f28c203c56b2d4690d32ea0bce5364458de1ec79).