Hi There,

I'm Kanan Pandit

Welcome to My Portfolio

My Photo

About Me

I'm Kanan Pandit
Master's Student in Data Science

I am currently pursuing a Master’s degree in Data Science, with a strong focus on machine learning, data analytics, and AI technologies. With a solid foundation in statistical modeling and data manipulation, I specialize in transforming complex data into actionable insights. I am passionate about solving real-world problems through innovative, data-driven approaches and committed to continuous learning.

Contact

Download Resume

Education

Masters of Science in Big Data Analytics

Ramakrishna Mission Vivekanada Educational and Research Institute, Belur

2024-2026 | Pursuing

Bachelor of Education With Pedagogy Of Mathematics

WBUTTEPA,Kolkata

2020-2022 | Completed

Bachelor of Science in Mathematics

Vidyasagar University,Medinipur

2017-2020 | Completed

Higher Secondary

Golar Sushila Vidyapith High School,Golar,Medinipur

2015-2017 | Completed

Secondary Education

Golar Sushila Vidyapith High School,Golar,Medinipur

2015 | Completed

Academic Projects

A Comparative Study of Classification Algorithms on the EMNIST dataset

This project focuses on preprocessing the EMNIST dataset and applying various classification models—Logistic Regression, Softmax Regression, KNN, Decision Tree, Random Forest, and SVM—for handwritten character recognition. We compare model performance to identify the most effective approach and address challenges related to handwriting variation, offering recommendations for improved recognition systems..

GitHub View Code

Comprehensive Regression Analysis to predict sales based on advertising data

This project involves applying and comparing various regression techniques—including linear, polynomial, gradient descent methods, and regularization (Ridge, Lasso, ElasticNet)—to predict sales based on advertising data. The goal is to evaluate model performance using metrics like MAE, MSE, R², and computational time, with Polynomial Regression (Degree 3) identified as the most accurate and efficient model.

GitHub View Code

My Data Science Journey – Interactive Portfolio

A dynamic and interactive portfolio designed to highlight my journey in data science. Showcases core projects involving machine learning, data visualization, and statistical analysis, with an emphasis on real-world problem-solving and continuous learning.

GitHub View Code

Distributed ML with H2O Cluster

This project demonstrates the setup of a distributed H2O cluster across two machines and the execution of a machine learning task, showcasing parallel processing and scalable model training in a multi-node environment.

GitHub View Code

Distributed ML with SPARK Cluster

This project involves configuring a multi-node Apache Spark cluster and executing distributed machine learning tasks, highlighting Spark's capability for large-scale data processing and model training.

GitHub View Code

Distributed Machine Learning for Wildfire Prediction Using H2O Automl

Implemented distributed machine learning using H2O AutoML to predict wildfires, leveraging multiple nodes for scalable and efficient model training.

GitHub View Code

Smart Control Hub:Multi-Functional Virtual Controller using Hand Gestures

Developed a Smart Control Hub using Mediapipe for real-time hand tracking to enable gesture-controlled volume, brightness, virtual mouse, and presentation control with visual feedback for intuitive interaction.

GitHub View Code

Artistic Image Transformation in Ghibli Aesthetic

Implemented a CycleGAN model to transform real-world photos into Studio Ghibli-style images, preserving key features like facial structure, and evaluated the results qualitatively during training.

GitHub View Code

A Comparative Study on Image Filtering and Hybrid Image Generation Algorithm

Conducted a comparative analysis of various image filtering techniques and developed a hybrid image generation algorithm to enhance image processing outcomes.

GitHub View Code

Detecting Harris Corners and Matching Images

Implemented Harris corner detection to identify key points in images and performed feature matching to align and compare images effectively.

GitHub View Code

Image Stitching(Panorama Creation),Image Alignment (Cam-Scanner Style),Depth Estimation from Stereo Images using OpenCV and ustom Harris detector without using OpenCV

Developed image stitching for panorama creation, image alignment like CamScanner, and depth estimation from stereo images using OpenCV and a custom Harris corner detector without relying on OpenCV’s built-in functions.

GitHub View Code

Defending Sentiment Analysis Against Adversarial Attacks: A Step Toward More Reliable NLP Models

This project fine-tunes BERT for binary sentiment classification on the IMDB dataset and investigates its vulnerability to adversarial attacks. Developed multiple attack methods using semantically valid word substitutions to fool the model, evaluated attack success, and improved robustness through adversarial training with filtered adversarial examples.

GitHub View Code

Technical Skills

Languages:

Python, C, R

Tools:

MySQL, Hadoop, Spark, Power BI

Libraries:

NumPy, Pandas, Scikit-learn, PyTorch

Coursework

Probability
Time Series
Survival Analysis
Machine Learning
Deep Learning
NLP
Data Structure and Algorithms
Hadoop
Spark

Areas of Interest

Deep Learning
Machine Learning
Computer Vision
Optimization
Distributed Systems

Additional Skills & Hobbies

Languages: English, Bengali, Hindi
Hobbies: Cricket, Listening Music
Customer Support Illustration

Get in Touch