Data Scientist
Right after completing my Bachelor’s degree in Computer Engineering, I joined Mu Sigma as a Data Scientist in July 2019. I underwent 3 months of rigorous training where I gained some preliminary real-world experience and developed proficiency in Python, SQL, ML and Storyboarding.
World’s Largest Home Improvement Retailer
- My first client engagement started in November 2019 where I was assisting the Financial Planning and Analysis (FP&A) team of a Fortune 25 US-based Home Improvement Retailer, in undergoing Digital Transformation.
- We helped our clients develop an advanced tool to plan and forecast 7 Retail Metrics viz. Sales, Online Sales, Markups, Markdowns, Inventory, etc.
- As my first assignment, I developed the entire codebase for planning the “Initial Markups” (IMU Metric) and wrote ~13,000 lines of Python code.
- I was also responsible for maintaining the pipelines developed by us.
- I got a chance to design and create 7 Tableau Dashboard from scratch - which were used to visualize the financial plans created by our tool and flag anomalies if any.
- I was awarded my 1st SPOT Award due to this effort.
- By December 2020, I was leading this team of 5 data scientists and managing the pipelines and dashboards for 4 of the 7 Retail Metrics.
- Due to my consistent efforts and subject-matter, I recieved my 2nd SPOT Award.
Fortune 100 Telecom Clientele
- Post ~2 years of gaining experience in the Retail domain, I moved to a team of 7 data scientists, as a lead, and worked with the Data Science and Data Engineering team of a US-based Telecom Giant.
- We were partnering with the clients to develop and end-to-end framework to identify and tackle network impairments thus improving the “Network Service Experience”.
- Initially, we started off with performing exhaustive EDA on previously un-explored datasets. Lack of Subject Matter Expertise really made this a challenging task.
- From Cable Modem Registration Dataset, we identified over 14 network “events” which indicated a degraded network service on the consumer’s end for ~50M customers in near real-time.
- We explored more datasets viz. Speed Tests, Modem Utilizations, Proactive-Network_maintenance, Noise ratios among others.
- By the end of March 2022, we established an entire framework to identify any event indicating network failures with ~98% accuracy.
- I was awarded my 3rd SPOT Award for my efforts here.