My Data journey is

An Adventure

Hi, I’m Vinay Desai and welcome to my website. I’m a dedicated and enthusiastic Data Scientist with a passion for storytelling and making a difference. I completed the Microsoft Professional Programme in Data Science and have printed two documentation books covering everything from statistics and probability to querying with SQL, all the way through to machine learning and predictive solutions.

You’ll find links to my LinkedIn, Kaggle and GitHub pages, as well as a list of projects I’ve worked on. These include data cleansing, storytelling and machine learning projects. To download the notebook code for each project, simply click on the GitHub link on any project page. There’s also my blog, Microsoft certificates and a bunch of other cool stuff. Enjoy my site and feel free to get in touch!

My projects

Here's a few of the projects I've recently worked on. Click on the tabs below to see some of my examples for analysing, visualising and predicting with data.

img01

In this lab, I inserted, updated, and deleted data in the AdventureWorksLT database. This is a T-SQL lab for modifying data.

img02

I used some basic Transact-SQL programming logic to work with data in the AdventureWorksLT database.

img03

In this lab, I used Transact-SQL to implement error handling and transactions in the AdventureWorksLT database.

img04

Making use of Seaborn’s visualisation capabilities, as well as creating new features, converting data types, creating a dictionary and using .groupby.

img05

Making use of SparkML to classify whether a flight would be late or not, as well as improving the model with Parameter Tuning.

img06

Building a simple recommender system using SparkML and collaborative filtering, ready for validation and tuning tasks.

img02

This Lending Club data makes use of Decision Trees and Random Forest to find the borrowers with the highest probability of paying them back.

img06

This Natural Language Programming project attempts to classify Yelp Reviews into 1 star or 5 star categories based off the text content in the reviews.

img01

Using a Support Vector Machine algorithm to predict the correct flower type based on the flower’s characteristics, as well as trying GridSearch.

How-to guides and articles

The blog section focuses on how-to guides and articles related to big data.

img01

CSV files are a widespread way to store datasets, so it is important to know how to import this data into a Python workspace.

img02

Coming soon

img03

Coming soon

img04

Here's my list of the top five things you need to know to break into Data Science.

img05

Coming soon

img06

Coming soon