Data Wrangling (Data Preprocessing)
Data Wrangling (Data Preprocessing)
Mid-term assessment
Siddharth Dinkar Raul (s4015125)
18-09-2023
Setup
Hide
Data generation
Hide
# Introducing outliers
# Exporting to CSV
write.csv(sales_data, "sales_data.csv", row.names = FALSE)
set.seed(286)
# Export to CSV
write.csv(customer_data, "customer_data.csv", row.names = FALSE)
# Introduce outliers
inventory_data[sample(1:200, 5), "cost_price"] <- inventory_data[sample(1:200, 5), "cost_pric
e"] * 0.5
inventory_data[sample(1:200, 5), "selling_price"] <- inventory_data[sample(1:200, 5), "sellin
g_price"] * 2
# Export to CSV
write.csv(inventory_data, "inventory_data.csv", row.names = FALSE)
# Check structure of combined data and perform all necessary data type conversions, provide R
codes here.
Scanning data
Hide