-
Notifications
You must be signed in to change notification settings - Fork 1
Home
This document outlines the process of analyzing perfume buyers and product data, encompassing steps like data preprocessing, exploratory data analysis (EDA), machine learning modeling, and result evaluation.
1.1 noon_perfumes_buyer_dataset.csv
data:image/s3,"s3://crabby-images/91601/9160147d5b0b42b462ea9de842c2c49552edb806" alt="스크린샷 2023-12-04 오후 9 29 13"
data:image/s3,"s3://crabby-images/8a6ae/8a6ae1e58e9bec02abc201dccd476e2d36425364" alt="스크린샷 2023-12-04 오후 9 29 23"
data:image/s3,"s3://crabby-images/92650/926504433f04635a7f5b5957369239d28e7d4f58" alt="스크린샷 2023-12-04 오후 9 29 31"
1.2 noon_perfumes_dataset
data:image/s3,"s3://crabby-images/58f59/58f59728088dfa3e6799a099d472036e546d460c" alt="스크린샷 2023-12-04 오후 9 34 39"
data:image/s3,"s3://crabby-images/0407b/0407b352f32e06837a0640061c4cd46311ec92cf" alt="스크린샷 2023-12-04 오후 9 36 52"
data:image/s3,"s3://crabby-images/ddbc6/ddbc6604cacb66371bbe7dad5a84b4bf1ca7a470" alt="스크린샷 2023-12-04 오후 9 37 04"
data:image/s3,"s3://crabby-images/b5c3e/b5c3ef2d73eb9cfcd53a256128bfef90c1dc6064" alt="스크린샷 2023-12-04 오후 9 37 12"
Removing Missing Values: Rows with missing values are removed using the .dropna() method. Deleting Empty String Data: Rows with empty strings in specific columns are removed. Feature Integration: brand and name are combined into a single feature. One-Hot Encoding: Base notes and middle notes are one-hot encoded.
2.1 Buyer Dataset
data:image/s3,"s3://crabby-images/2a462/2a4620c58b2d6467732aa9c80d80e3317f6932e0" alt="스크린샷 2023-12-04 오후 9 44 17"
2.2 Perfume Dataset
data:image/s3,"s3://crabby-images/64fb4/64fb47d8e8c190c19a95db610798e4d5a26a3189" alt="스크린샷 2023-12-04 오후 9 49 42"
data:image/s3,"s3://crabby-images/1bcbb/1bcbbe389d1ee36ad75618720cba0b9540d744ed" alt="스크린샷 2023-12-04 오후 9 50 18"
KNN Model: A perfume recommendation model is trained using NearestNeighbors. Recommendation Function: Provides perfume recommendations based on the input perfume name.
Training and Evaluating KNN Classifier: The model is trained using KNeighborsClassifier, and evaluated with accuracy_score and classification_report.
Elbow Graph: An elbow graph is plotted to find the optimal value of K.
data:image/s3,"s3://crabby-images/b0019/b001982aae277a61e48da8e4f696275b8ec96006" alt="스크린샷 2023-12-05 오후 3 22 44"
data:image/s3,"s3://crabby-images/9026b/9026b7bea5323ba1f56d58aa13a3ab8457738ed7" alt="스크린샷 2023-12-05 오후 3 39 18"
Classification Report and Accuracy: The model's performance is summarized in a classification report, and its accuracy is calculated. This structured documentation provides a clear overview of the analysis process, making it easier to understand and follow the steps involved in the data analysis project.