Research

I have conducted research in machine learning optimization, feature selection, and image segmentation, resulting in 9 peer-reviewed publications across IEEE, Springer, and Elsevier journals, amongst others. My current work at Columbia University is at the intersection of Causal Inference and Privacy Laws in the contemporary world to ensure adherence of LLMs to legal frameworks, data management, and building user trust in these technologies.

Current Research

I am currently a Research Assistant at the ARiSE (Advanced Research in Software Engineering) Lab at Columbia University. I am working on an NSF-funded project focused on compliance auditing of LLMs using Causal Inference and Explainable AI techniques. The primary goal is to check adherence of modern and upcoming LLMs to data privacy laws across the globe, such as GDPR, EU AI Act, Colorado AI Act, amongst others. These laws build upon FIPPs and OECD's purpose limitation and data minimization guidelines. The current task focuses on building a benchmark to audit the current open-source LLMs against both real-world and synthetically generated datasets in the Finance, Employment and Healthcare domains, providing counter-factuals and eventually evaluating the models. Apart from Columbia University, there are academic collaborators from Wesleyan University and University of South Carolina, while industry participation is being spearheaded by Google and IBM whose focus is to provide their own LLMs and datasets.


Apart from the work regarding LLM privacy, I am working as an ML Research Assistant at the Department of Earth & Environmental Engineering at Columbia University. I am assisting in the development of a translator for Fortran code of Community Land, Ocean and Atmospheric models to Python JAX for usage of autodiff and other ML techniques in the future. This pipeline will aid development of weather prediction in a better way by understanding the interaction of the biosphere with the atmosphere. The current task focuses on developing a modular package to translate the code, run test suites using pytest, and automatically repairing any failed test.

Previous Research

I have previously worked as a Research Assistant at the Centre for Microprocessor Applications Training and Research (CMATER) lab at Jadavpur University, India during my undergraduate studies. To continue my passion for ML optimization problems, upon request from various journals, I currently review research works to help advancement in this domain. Some salient observations during my time at CMATER are as follows:

Tech: Python, NumPy, SciPy, scikit-learn, PyPI, MATLAB, Statistics

View all research papers