Aspiring PhD Researcher | AI & Large Language Models for Healthcare
Research interests: LLMs, Explainable AI, Causal ML, Health Informatics
View My WorkI am a Data Science graduate with a strong research orientation, seeking to pursue doctoral studies in Artificial Intelligence with a focus on Large Language Models and healthcare applications.
My academic work spans machine learning, deep learning, explainable AI, and generative models, with multiple research-oriented projects involving healthcare datasets, clinical prediction tasks, and model interpretability.
Through my MSc research and independent projects, I have developed experience in formulating research questions, designing experiments, evaluating models rigorously, and communicating findings clearly. I am particularly interested in responsible and privacy-aware AI systems for real-world healthcare use.
University of Greenwich
SRM Arts and Science College
NSN Memorial School
Expert in Python programming for data science and automation.
Proficient in R for statistical analysis, data manipulation, and visualization.
Skilled in SQL for querying and managing relational databases efficiently.
Proficient in JavaScript for web development and interactive interfaces.
Experienced in creating dynamic visualizations and dashboards.
Proficient in Power BI for business intelligence and data storytelling.
Crafting compelling narratives using data to drive business decisions.
Creating dynamic and interactive charts for impactful presentations.
Proficient in creating advanced visualizations such as heatmaps, scatter matrix plots, and multi-dimensional charts for better insights.
Developing classification and regression models using supervised techniques.
Expert in clustering techniques and dimensionality reduction methods.
Skilled in clustering algorithms like K-Means, DBSCAN, and hierarchical clustering for grouping datasets.
Proficient in deep learning using TensorFlow and PyTorch for building advanced neural networks.
Experienced in processing and analyzing text data using NLP techniques like tokenization, stemming, and sentiment analysis.
Expert in optimizing machine learning models using hyperparameter tuning and regularization techniques.
Deploying machine learning models using , Heroku, Flask, Docker, and cloud services.
A selection of my research-oriented work focusing on healthcare AI, large language models, explainable machine learning, and generative modeling.
This dissertation investigated the application of deep learning models for automated severity classification of knee osteoarthritis from medical imaging data. The study emphasized explainable AI to support clinical interpretability and responsible deployment.
Transfer learning was applied using convolutional neural networks, and model decisions were visualized through Grad-CAM to highlight clinically relevant regions. The final system was deployed as an interactive web application.
Figure: Web-based deep learning system for knee osteoarthritis severity detection with confidence score.
This research project explored generative modelling techniques across text and image domains, comparing large language models with deep generative architectures.
GPT-2 was fine-tuned for text generation tasks, while GANs, diffusion models, and CTGAN were evaluated for synthetic data generation on benchmark datasets. The project examined model performance, diversity, and scalability across modalities.
Figure: Synthetic CIFAR-10-like images generated using a DCGAN trained on image data.
This project applied machine learning techniques to analyze chronic kidney disease datasets, focusing on interpretability and clinical relevance.
Classification models were developed and analyzed using SHAP to explain feature importance and decision behavior, supporting transparent and trustworthy AI systems for healthcare.
Figure: SHAP waterfall plot illustrating feature-level contributions in chronic kidney disease prediction.
Publications are currently in preparation. Research outputs from my MSc thesis and ongoing projects are being developed for submission to peer-reviewed venues.
Applied machine learning and exploratory projects demonstrating practical implementation of data science and NLP techniques.
This project leverages natural language processing (NLP) to classify sentiments in movie reviews as positive, neutral, or negative using machine learning techniques.
Analyze sentiments of movie review data in real-time using advanced NLP techniques. This project showcases powerful data visualizations and easy-to-deploy pipelines.
This project applies machine learning techniques to classify products into various categories based on their attributes. By analyzing product data, the system improves inventory management and customer experience.
This project demonstrates the application of machine learning in product classification to enhance e-commerce functionalities. It includes preprocessing, model training, and deployment-ready pipelines.
This project aims to predict customer churn by analyzing historical data patterns. By leveraging machine learning algorithms, businesses can identify at-risk customers and implement strategies to improve retention.
This project focuses on exploring and analyzing Chronic Kidney Disease (CKD) data using unsupervised learning techniques. Clustering algorithms like K-Means and Hierarchical Clustering are applied to identify patterns and group similar cases, providing valuable insights for early diagnosis and targeted interventions.
Performing detailed exploratory data analysis to uncover trends and insights. Cleaning and preprocessing data to ensure its accuracy and reliability for analysis. Conducting statistical analysis to evaluate relationships between variables. Creating insightful visualizations like histograms, scatter plots, and heatmaps. Using feature engineering techniques to prepare datasets for advanced modeling. Helping clients make informed decisions through in-depth data understanding.
Designing and developing predictive models to extract actionable insights from complex datasets. Specializing in supervised and unsupervised learning techniques to solve classification, regression, and clustering problems. Expertise in libraries like TensorFlow, PyTorch, and Scikit-learn. Leveraging hyperparameter tuning and model optimization to improve performance. Implementing scalable ML pipelines for real-world applications in diverse domains. Delivering solutions tailored to meet client requirements and business goals..
Crafting visually compelling dashboards and charts to represent complex data effectively. Utilizing tools like Matplotlib, Seaborn, and Tableau to uncover insights. Developing interactive visuals for better data storytelling and decision-making. Simplifying trends and relationships in data through meaningful visual summaries. Focusing on creating user-friendly and intuitive designs tailored to audience needs. Delivering actionable insights for stakeholders by making data accessible and comprehensible.
Building recommendation engines to enhance user engagement and retention. Leveraging collaborative and content-based filtering techniques for accurate predictions. Applying matrix factorization methods for handling sparse datasets efficiently. Tailoring algorithms to personalize product, content, or service recommendations. Measuring success through precision, recall, and other evaluation metrics. Delivering scalable solutions to meet user preferences across industries.
Have a question or want to work together? Reach out to me!
Email: harshinihachu6@gmail.com