PinnedVijay PatilinAnalytics VidhyaPredictive models using Rolling Window Features (I)Rolling Window6 min read·Dec 28, 2021----
Vijay PatilinAnalytics VidhyaXGBoost with PySpark on AWS EMRCompatible versions to train and deploy XGBoost using PySpark4 min read·Sep 3, 2022----
Vijay PatilInstalling and using PySpark on Linux machineInstallation steps simplified5 min read·Mar 20, 2022--1--1
Vijay PatilProduct Recommendation StrategiesA non-exhaustive, but fairly large list of product recommendation strategies in Fashion Retail (ecommerce)8 min read·Mar 13, 2022----
Vijay PatilinAnalytics VidhyaPredictive models using Rolling Window Features (II)Part 2 of the Rolling Window approach series.6 min read·Jan 9, 2022--1--1
Vijay PatilinAnalytics VidhyaGetting started with AirflowHow-to guide on setting up Airflow on Linux machine and creating a basic workflow using BashOperator, PythonOperator and MySqlOperator8 min read·Sep 19, 2021--2--2
Vijay PatilinAnalytics VidhyaClustering and profiling customers using k-MeansFollowing article walks through the flow of a clustering exercise using customer sales data.9 min read·Jul 31, 2021--1--1
Vijay PatilinAnalytics VidhyaInstalling and using PySpark on Windows machineInstallation steps simplified (and automated to certain extent…)6 min read·Dec 22, 2020--3--3