Sumit Asthana
PhD candidate, University of Michigan. Go Blue!

GitHub • CV • LinkedIn • Google Scholar
Thank you dear visitor for stopping by! I am a final year PhD candidate at University of Michigan, Ann Arbor. My research focus and interests are at the intersection of Machine Learning and Human-Computer Interaction (HCI).
I develop adaptive AI systems that enable people to reason under risk and uncertainty in complex decision-making scenarios by modeling their underlying decision processes and not just their observable behaviors. For example, in education, inferring students’ conceptual gaps requires reconstructing their mental models from their learning trajectories, not just identifying surface-level mistakes. I borrow from cognitive science and probabilistic machine learning to design AI with experts’ mental model to improve Human-AI interaction. By modeling people’s latent cognitive states, my methods improve reasoning of AI systems beyond observed behaviors, improving overall learning efficiency and accuracy. I bring in strong computational and model building skills from my prior industry experience to build systems for Human-AI interaction and my training in HCI allows me to conduct large scale evaluations in people’s work context for improving these systems. For example, I recently built a bayesian network from a massive dataset of 3M Census records to model personal preferences and used it to study the design of personalization agents that respect users’ privacy. I also have strong Reinforcement Learning (RL) foundations that I have applied to model human behavior, which positions me well to explore RL-based fine-tuning of LLMs. For instance, I developed a deep RL system from scratch to simulate indoor human behavior and COVID-19 transmission dynamics, demonstrating how RL can capture and reason about complex behavioral patterns. The following three broad directions describe my research focus and future vision.
-
Desiging computational models that can understand and improve expert decision-making (AI to critique not obey): Furthering the design of computational models that can understand and reason about experts’ decision processes, and how they reason about and balance principles in their decisions. For example, understanding how instructors balance providing the answer versus guiding students in tutoring scenarios.
-
Designing Human-AI interaction to support and continuously improve through interaction with experts: Designing user interfaces and interactions that naturally support and augment experts’ work practices, and enable AI to unaimbiguously understand user goals, and mental processes during interactions to improve AI systems during use.
-
Advancing the responsible use of AI in helping people become experts: Designing methods to advance the responsible use of AI help people become experts by studying decision process of educators and designing technologies that leverage established pedagogical principles to augment educators’s expertise.
I have strong connections with the Wikipedia open-source community. In 2014-18, I extensively contributed to the codebase that powers the mobile version of Wikipedia and was a prominent code reviewer for mobile Wikipedia. I had also administered Wikimedia Foundation’s Google Summer of Code internship program in 2016,17. For my services I was nominated by the Wikimedia Foundation to attend the Google summer of code mentor summit in 2017, and invited to present my research at the monthly Wikimedia research showcase in 2021. Currently, I contribute to the Wikipedia research community, such as serving on the program committee of Wiki Workshop, Wikipedia’s annual research workshop.