Jay Gupta - Resume

Work Experience

Data Director

Stanford Daily

2024

The Stanford Daily is the independent, student-run newspaper of Stanford University. The Daily strives to serve the Stanford community with relevant, unbiased journalism and provides its editorial, tech and business staffs with unparalleled educational opportunities. As the Data Director, I oversee data section recruitment, training, project management, and associated institutional knowledge.

Technologies:

GitHub
Flourish
D3

Data Analyst

Veteran Affairs

2023 - 2024

The Department of Veteran Affair's Program Evaluation and Resource Center (PERC) provides program evaluation and technical assistance for mental health quality improvement efforts across the Veteran's Health Administration. Our objective is to develop an approach for categorizing narrative content about mental health concepts into operationally relevant categories. Given the burgeoning threat of xylazine-laced fentanyl, we used xylazine as a case example.

Extracted snippets from Text Integrated Utility (TIU) Notes
Developed a custom tokenization procedure with built-in text repair processes
FineTuned BioClinicalBERT weights
Tested and validated model predictions

Technologies:

Azure Data Studio
SQL
PyTorch
HuggingFace

CTL Tutor

Stanford CTL

2023

CTL provides free subject tutoring with peers who are specially trained to help students develop the skills they need to be successful in their classes. As a Mathematics Appointment Tutor, I work with student peers taking MATH 19, 20, 21, 51, 52, and 53 to help them master the skills needed to succeed on their own by coaching rather than providing answers.

SUMO Tutor

Stanford SUMO

2023

SUMO works with the Stanford math department to provide peer tutoring services for a number of introductory math classes, specifically MATH 19, 20, 21, 51, 52, 53, and 104. As a peer tutor, I answered any questions students had about the material and homework in these classes.

Projects

Machine Unlearning

Report

2024

Machine unlearning, the ability for a model to "forget" a subset of its training data, may help eliminate biases and safeguard user privacy. We propose a student-teacher unlearning framework that rewards similarity on a retain set and penalizes similarity on a forget set.

Geometry

QuickHull, PolyLabel

2024

Computational Geometry is a branch of computer science dedicated to the study of algorithms which can be stated in geometric terms. I wrote efficient ports of QuickHull and PolyLabel in NumPy. The QuickHull Algorithm extends to arbitary dimensions and has 2D/3D visualization support. The PolyLabel Algorithm extends to any complex polygon. Contributed as ConvexHull and LabeledPolygram for ManimCE, an open source graphics software with over 20K GitHub Stars.

Stanford CPN

Course Explorer

2024

Understanding the relationship between courses is of great importance to a University's educational mission. We employ a range of supervised and unsupervised methods to understand and ultimately and inform the Stanford educational structure. Contributed as Prerequisite Tree feature for OnCourse, a course-planning startup with over 5,000 users.

Education

B.S. Data Science

Stanford University

Expected 2026

Coursework

CS 106B: Programming Abstractions
CS 107: Computer Organization
CS 111: Operating Systems
CS 103: Discrete Math
CS 109: Probability
CS 161: Algorithms
CS 229: Machine Learning
CS 224N: NLP with Deep Learning
CS 224W: ML with Graphs
DATASCI 112: Data Science
ENGR 40M: Introduction to EE
MATH 51: Linear Algebra
MATH 52: Multivariable Calculus
MATH 53: Differential Equations
MATH 110: Number Theory
MATH 113: Matrix Theory
MATH 151: Probability Theory
STATS 200: Theoretical Statistics

Awards

CS 109 Challenge Winner

The challenge involves creating a probability driven project of ones choosing that highlights concepts from the class and does something interesting. My project discusses the Hilbert Space of Random Variables.

PVSA Gold

This award honors individuals whose service positively impacts communities and inspires those around them to take action, too.

Skills

Python
C
C++
R
SQL
HTML
CSS
JS
NumPy
Pandas
SciPy
MatPlotLib
PyTorch
OpenCV
D3
Quarto
LaTex

Communication

At heart, I am an explainer and entertainer. I have posted hundreds of solutions on r/LearnMath under a burner account. I have composed essays on a variety of topics ranging from AI powered disinformation to religious architecture. I have blogged about my projects and interests over the years. I run Lyte Lectures, a 1.5K strong YouTube channel that seeks to communicate ideas with an accessible approach and artful style.

Work Experience

Data Director

Technologies:

Data Analyst

Technologies:

CTL Tutor

SUMO Tutor

Projects

Machine Unlearning

Geometry

Stanford CPN

Education

B.S. Data Science

Coursework

Awards

CS 109 Challenge Winner

PVSA Gold

Skills

Communication