/data
directoryHandling Missing & Inconsistent Data
Identified and dropped rows with null or irrelevant values in key columns (e.g., price, commodity, date). Unified column naming (e.g., province, district, market) for consistency. Removed duplicates to ensure data quality.
Standardizing Formats
Converted date fields to datetime objects for time-series operations. Normalized price to numeric format, fixing any non-numeric entries. Standardized text columns (e.g., title-cased commodity, province names).
Filtering for Relevance
Kept only necessary columns for analysis and prediction. Filtered for valid price ranges to exclude outliers. Created a cleaned dataset ready for modeling and Power BI integration.
all_commodity_predictions.csv
..csv
fileMuhetoHodal_PowerBI_Exam_Report.pbix
/data/cleaned
directory