Shubhashis Roy Dipta

PhD Researcher · UMBC

sroydip1@umbc.edu

Resume Google Scholar

Amazon Science (Alexa)

Seattle, WA

Applied Scientist Intern

Summer 2026

Mentors: Dr. Xiaohu Xie, Dr. Daniel Bis

Amazon Science (Alexa)

Seattle, WA

Applied Scientist Intern

Summer 2025

Mentors: Dr. Daniel Bis, Dr. Kun Zhou

Paper: PA3: Policy-Aware Agent Alignment

Scale AI

San Francisco, CA

Machine Learning Research Intern

Summer 2024

University of Maryland, Baltimore County

Ph.D. in Computer Science

Fall 2023 - Present

Grade: 4.00/4.00

Publications: See Here (From 2022)

University of Maryland, Baltimore County

M.Sc. in Computer Science

Spring 2021 - Spring 2023

Grade: 4.00/4.00

Morgan State University

Research Assistant

2017 - 2019

Publications: 4 Journal

UniShopr.com

Bangladesh

Founder

2017 - 2021

Upcoming Travel

ACL 2026 in San Diego, CA (Jul 3-7)

✅ NeurIPS 2025 in San Diego, CA (Dec 2-7)
❌ AACL 2025 in Mumbai, India (Dec 20-24) (canceled)

👋 I'm open to meet! Email me to schedule a chat!

Peer Review

Reviewed 28+ papers across top venues (2023–2025).

Conferences

ACLNeurIPSNAACLCOLING*SEM

Workshops

SemEvalTrustNLPSRWW-NUTELVM

Journals

Scientific ReportsBMC BioinformaticsPlant MethodsComputational and Structural Biotechnology

I’m a final-year CS Ph.D. researcher at the University of Maryland, Baltimore County (UMBC), advised by Dr. Frank Ferraro, with research internships at Amazon Science (Alexa AI; Summer 2025 + 2026) and Scale AI (Summer 2024). I make LLMs more reliable largely through decomposition and reinforcement learning - spanning reasoning, agentic, and multimodal settings. My work on agentic LLMs earned a $20K Google Cloud Gemini Academic Program Award (2026).

Reasoning & Decomposition
- Semi-supervised RL for traceable decomposition-based claim verification [DecomposeRL]
- Atomic, presupposition-free decomposition for robust claim verification [De-Presuppose]
- Token-efficient math reasoning via distractor-aware computational graphs [DAGGER]
- Curriculum-driven GRPO for math reasoning in under-resourced languages [GanitLLM]
- Hierarchical event abstraction for compositional sequence modeling [SHEM]
Agentic LLMs & Reinforcement Learning
- Tool-calling alignment via policy-grounded deliberation [PA3]
- Multi-agent benchmarks for diagnosing collaboration failures [AgentCollabBench]
- Mechanistic analysis of token saliency in on-policy distillation [Rock Tokens]
- Metacognitive control in LLMs under resource constraints [TRIAGE]
Multimodal Learning & Evaluation
- Reference-free factuality metric for video captions [VC-Inspector]
- Calibrated abstention under modality conflict in omni-modal models [OMD]
- Zero-shot multilingual text-to-video retrieval via temporal event decomposition [Q2E]

Graduating Spring 2027 · No visa sponsorship needed · actively seeking Research Scientist roles in NLP / Multimodal AI. Please reach out if you have an opening.

Recent News (See All)

Jun 16, 2026	🎉 Awarded a $20,000 Google Cloud research grant through the Gemini Academic Program to support my LLM agentic research at UMBC.
Jun 1, 2026	🎉 Joined Amazon Science (Alexa AI) for my second summer - researching self-distillation with RL to push LLM reasoning on agentic tasks.
May 27, 2026	🚀 New preprint - DecomposeRL: a 7B claim-verifier that matches GPT-4.1-mini across 11 benchmarks - with fully inspectable reasoning traces.
May 26, 2026	🥳 AgentCollabBench accepted at the FAGEN workshop @ ICML 2026 - 900 tasks that catch when a multi-agent LLM team’s final answer is right but the reasoning quietly broke.
May 20, 2026	🥳 5 papers accepted at MeLLM workshop @ ACL 2026 (Multilinguality in the Era of LLMs) - spanning text-to-gloss, math reasoning, sentiment auditing, VLM dialect benchmarks, and pretraining corpora.
May 14, 2026	🥳 OMD-Bench got accepted at 3 CVPR 2026 workshops (Any2Any MLLM, CVinWild, KnowledgeMR).

Beyond Research

I’ve competed internationally in algorithms and robotics - ranking 8th out of 300+ teams at the 2018 ACM ICPC Asia Dhaka Regional with multiple regional and national placements, reaching the top 70 on Kaggle 🥉 in the Birdcall Identification competition, and placing 9th at the University Rover Challenge 2015 (Utah, USA) and 22nd at the European Rover Challenge 2016 (Poland). Full list of awards →

Before the PhD, I also founded UniShopr (2017-2021), a cross-border e-commerce platform serving consumers in Bangladesh.

Featured Publications

Check out Google Scholar for a full list of my publications.

Submitted

DecomposeRL: Learning to Ask Useful, Informative, and Diverse Questions for Semi-Supervised, Traceable Claim Verification

Shubhashis Roy Dipta, Ankur Padia, and Francis Ferraro

Preprint 2026

arXiv Code 🤗 Hugging Face Website
Submitted

Cornerstones or Stumbling Blocks? Deciphering the Rock Tokens in On-Policy Distillation

Yuxuan Jiang*, Runchao Li*, Shubhashis Roy Dipta*, and 2 more authors

Preprint 2026

* Equal contribution

arXiv
Submitted

AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators

Aritra Mazumder, Shubhashis Roy Dipta, Nusrat Jahan Lia, and 10 more authors

Preprint 2026

arXiv Code Website
Preprint

TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints

Zabir Al Nazi, and Shubhashis Roy Dipta

Preprint 2026

arXiv
Submitted

PA3: Policy-Aware Agent Alignment through Chain-of-Thought

Shubhashis Roy Dipta, Daniel Bis, Kun Zhou, and 4 more authors

Preprint 2026

Work done during internship at Amazon Alexa AI

arXiv Video
Submitted

†DAGGER: Distractor-Aware Graph Generation for Executable Reasoning in Math Problems

Zabir Al Nazi, Shubhashis Roy Dipta, and Sudipta Kar

Preprint 2026

arXiv Code Website
Submitted

Omni-Modal Dissonance Benchmark: Systematically Breaking Modality Consensus to Probe Robustness and Calibrated Abstention

Zabir Al Nazi*, Shubhashis Roy Dipta*, and Md Rizwan Parvez

Preprint 2026

* Equal contribution

arXiv
ACL

GanitLLM: Difficulty-Aware Bengali Mathematical Reasoning through Curriculum-GRPO

Shubhashis Roy Dipta, Khairul Mahbub, and Nadia Najjar

ACL 2026

arXiv Code 🤗 Hugging Face Website Video Poster
ACL

VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis

Shubhashis Roy Dipta, Tz-Ying Wu, and Subarna Tripathi

ACL 2026

arXiv Code 🤗 Hugging Face Website Video Slides Poster
ACL

Multimodal Unlearning Across Vision, Language, Video, and Audio: Survey of Methods, Datasets, and Benchmarks

Nobin Sarwar, Shubhashis Roy Dipta, Zheyuan Liu, and 1 more author

ACL 2026

PDF Code Website
AACL

Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval

Shubhashis Roy Dipta, and Francis Ferraro

AACL 2025

arXiv Code 🤗 Hugging Face Website Video Slides Poster
*SEM

If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition

Shubhashis Roy Dipta, and Francis Ferraro

*SEM 2025

arXiv Code Website Poster
MathAI @NeurIPS

Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning

Ningning Xu, Yuxuan Jiang, and Shubhashis Roy Dipta

MathAI @NeurIPS 2025

arXiv
*SEM

Semantically-informed Hierarchical Event Modeling

Shubhashis Roy Dipta, Mehdi Rezaee, and Francis Ferraro

*SEM 2023

arXiv Code Slides