research
thesis · publications
total 3
thesis.mddata valuation for label error detection in ML pipelines · ongoingccce23.mdinterpretable hybrid recommender · CCCE'23, Stockholmisda22.mdRePI: research-paper impact analysis · ISDA'22
Data Valuation for Label Error Detection in ML PipelinesResponsible Data Science (RDS) Lab, Purdue University · Aug 2024 – PresentAdvised by Dr. Romila Pradhan
Developing Shapley-value-based data-valuation methods to detect and repair mislabeled training data in ML pipelines, with a focus on improving model fairness, reliability, and explainability.
An Interpretable Hybrid Recommender Based on Graph Convolution to Address SerendipityPublished at CCCE'23 · Stockholm, March 2023 — Springer link
Two novel contributions built on top of a 4-model hybrid graph-convolutional recommender:
- A new distance-based metric for quantifying recommendation serendipity, going beyond standard diversity/novelty proxies
- KNN feature-importance analysis layered on the hybrid to make its recommendations interpretable to end users
RePI: Research Paper Impact AnalysisPublished at ISDA'22 · December 2022 — Springer link
A web application for analyzing research-paper impact, built around a novel impact-factor ratio — a new metric for publication influence that goes beyond raw citation counts.
- Implemented the metric and pipeline in Python on top of the Semantic Scholar API
- Built an interactive interface in Streamlit for exploring per-paper and per-author impact