About Me
An Engineering graduate in Computer Science and Engineering. Currently a Data Engineer who’s previously interned/worked at Hitachi as a Data Analyst Intern and Aerogram (IIT-Delhi Startup) as a Data Engineer Intern. I love coding and solving problems. My primary focus is on Data Engineering, Big Data Analytics, Cloud, and building solution with and related to data. Previously I finished up my B.Tech/B.E in Computer Science and Engineering in Spring 2020, at Chandigarh University. My goals are to learn as much as possible from my experiences and interactions with others, and leave a positive impact on the world. I also love to write articles, blogs and to interact with people through my Podcast called Life, Tech and Beyond. Also, I like to help to build professional Resumes and SOPs for others . When I’m not working/programming I love to play chess and am also a keen camp fire level guitarist and an avid movie lover.
Core Skills: C++ Python PostgreSQL SQL Microsoft Azure JAVA Data Visualization Big Data Data Warehousing Airflow Machine Learning ETL Cloud Data Analytics Excel HTML CSS GIT Flask MongoDB Power BI
If you want to get your Resume, Cover Letter, or CV build buy me a coffee.
Projects
Data Integration and ETL (Microsoft Azure, ETL, MS Server SQL, Databases)
https://github.com/adityakaushal/Data-Integration-and-ETLDeveloped an end-to-end Data Warehousing and ETL Solution for Adventure Works 2019 Database.
Accumulated and collected desired data from Data warehouses containing 50+ tables. Designed an ETL Pipeline for data transformation and modeling using Azure Data Factory. Utilized the Data lake Storage for staging and processing data. Also, handled many task such as joining, filtering, selecting, deriving columns and tables for extracting desired data from Adventure Works2019 Data warehouse using components like Azure Data Factory pipelines, data flows, copy data activities, and control flows. Tech: Azure SQL DB, Azure SQL Server, Azure Data Lake Gen 2 Storage, Azure Data Factory.(Aug ’20)
AirSol (Web Development, Time Series, Forecasting)
https://github.com/adityakaushal/PM-2.5-PredictorDeveloped an end-to-end solution for Time Series Forecasting the PM 2.5 values in the vcinity of IIT Delhi Campus using IoT, Python, Google Cloud, ML Alogs, and Python Web Framework (Flask).
Built a Web Dashboard using time Series modeling to predict local mapped Particulate Matter 2.5 in the vicinity of IIT Delhi Campus using various forecasting algorithms like S-ARI-MA and Prophet. Tech: Python, Google Cloud, Google Firestore, Flask, HTML, CSS, Pygal, Pandas, NumPy, JS. (Mar’20)
Face2Gene (Desktop Application, Computer Vision, Python)
https://github.com/adityakaushal/Face-Detection-and-Recognition-Built a facial recognition system using Python and various Machine Learning Algorithms like Scalar Vector Machines, Principal Component Analysis, and Cross Validation.
Built a facial recognition app to recognize the user through facial features and displayed the user name on the identified Image. Utilised Support Vectors Machine, Principal Component Analysis, and K-Fold Cross Validation. Tech: Python, Open CV (May‘18)
Reporting Solution (Power BI, Databases)
https://github.com/adityakaushal/Reporting-Solution/blob/master/Reporting-Solution.pdfDesigned an end-to-end Analytics solution for analysing the Point of Sales Data of Adventure Works Data.
Analysed and mapped various tables from the Adventure Works 2019 Data warehouse to carry out visualization of Fact-Tables. Utilized Dimension Table and Fact Table to collect and model data to extract the desired result. (Aug ’20)
Analysed and did some EDA (Exploratory Data Analysis for the Loan Prediction Dataset from Kaggle). Utilised the Logistic Regression, XGBoost, Trees, and Random Forest for predicting Loan approval.
Processed Loan Dataset to automate loan eligibility process. Analysed various loan granting factors like ‘Credit Score’, ‘Dependencies’, ‘Education’, ‘Gender’, ‘Income’. Utilised various Python libraries like Sea born, pandas, Numpy and matplotlib to analyse various factors using numerous visualization to predict the loan eligibility. Tech: Python, SciPy (Mar’20)
Experience
Brillio is a Digital Consulting Company with a focus on buidling solution with Cloud, Big Data Analytics, and Product Engineering and Digital Infrastructure.
Data Engineering and Warehousing , ETL and PowerBI Reporting.
Aerograms builds cityscale air pollution monitors to track your personal exposure to pollution.
Built a web dashboard to predict PM2.5 values using algorithms like Prophet, S-ARI-MA, and EMA. Developed ETL Pipelines on Google Cloud to migrate telemetry feed from IoT-Devices to Google Cloud SQL. Integrated the pipelines with MQTT protocols using Google Pub/Sub & IoT core. Analysed PM 2.5 of E-BAM and IIT-D during and before the lock down to determine the seasonality and trends.Technical Skills: Python, SQL, Pandas, NumPy, SciPy, Google Cloud, HTML, CSS, JS, Flask, ETL, Tableau
Worked under Hitachi's Railway Systems Business Division.
Built a solution for extracting the arrival & departure of Trains to compare NTES data with actual timings. Designed a solution to automate the processes of ETL using Python. Extracted raw Data with ScraPy. Converted the unstructured formats to Excel readable formats for data visualization through Pandas. Summarized the Data into visualization to compare the actual arrival and departure with NTES timings. Technical Skills: Python, ScraPy, Pandas, NumPy, Excel, Mat- plotlib, Sea-born. *NTES: National Train Enquiry System
Education
Chandigarh University
BTech/B.E Computer Science and Engineering (Hons.)
2016 - 2020
Overall GPA: 7.44/10. Top ‘8%’ among all performers worldwide in the Google Hash code 2019 1st round. Selected for Elite batch of Top 40 students of Computer Science for scoring more than ‘650+’ in AMCAT. Awarded ‘IBM Mastery’ for ‘Cloud Application Developer’ and ‘AI Analyst’ for scoring more than ‘70%’.During my time at Chandigarh University I learnt most of my key skills that have I have taken through my career such as teamwork and working to tight deadlines. I thouroughly enjoyed my time at university and learnt a lot about a healthy work life balance.
A Little More About Me
Alongside my interests in Data Engineering and Machine Learning, some of my other interests and hobbies are:
- Playing Chess Send me an invite to play chess.
- Keen interest in Music and a Camp Fire Level Guitarist (Favourite Band: Nirvana)
- Blogging and writing articles See my Articles on Medium
- Podcaster Have a look at my podcasts with people
- Track and Field sports (Running and Athletics)