I am an ELLIS PhD student supervised by Prof. Iryna Gurevych at the UKP Lab, TU Darmstadt and co-supervised by Prof. Amartya Sanyal at the University of Copenhagen. Previously, I spent two great years in IIIT Hyderabad, India with Prof. Ponnurangam Kumaraguru.

I am broadly interested in the privacy and safety of large language models. My recent work focuses on developing methods for protecting sensitive information when working with billion-scale parameter models. Outside of research, I enjoy cricket and travelling.


Selected Papers

Auditing Language Model Unlearning via Information Decomposition
Differentially Private Steering for Large Language Model Alignment
Socratic Reasoning Improves Positive Text Rewriting
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
An Unsupervised, Geometric and Syntax-aware Quantification of Polysemy
SyMCoM - Syntactic Measure of Code Mixing A Study Of English-Hindi Code-Mixing
HLDC: Hindi Legal Documents Corpus

Reviewing: ACL Rolling Review (ACL, EMNLP, NAACL, EACL), AAAI 2025, NeurIPS 2025, ICML MUGen workshop, WiNLP workshop, LLMSec workshop, CLPsych workshop

Student Advising:

  • Are You Sure?: Uncertainty Estimation in LLM Judges
  • - Patrick Gantner (MSc)
  • Investigating Privacy Leakage and its Mitigation in Activation Editing of LLMs
  • - Michail Moroz (MSc)
  • Enhancing LLM Reasoning Capabilities on Therapeutic Interventions
  • - Alicia Gleichmann (BSc)