Bio
I am currently an Applied AI Scientist at Amazon Web Services (AWS), where I work on Amazon Q for the AWS Console, with a particular focus on analogical reasoning, retrieval-augmented reasoning, and generative user interfaces. My work lies at the intersection of information retrieval, reasoning, and large language models, with applications in AI-driven personalization and adaptive UI generation.
Previously, I was a Ph.D. student in the College of Information and Computer Sciences at UMass Amherst, working with
Prof. Andrew McCallum at IESL. My research focused on set-based representation learning and compositional reasoning, with applications in natural language processing and information retrieval. I earned my Master’s degree from the Indian Institute of Science (IISc) in 2018, where I was part of the Machine and Language Learning (MALL) Lab, working with Prof. Partha Talukdar.
In addition to academia, I have gained industry research experience working with Dr. Tijmen Tieleman at Minds.ai, Dr. Steffen Rendle at Google Research, Dr. Achille Fokoue at IBM Research, and Dr. Tobias Schnabel at Microsoft Research.
Research Areas
Representation Learning: My PhD thesis focuses on set-based representation learning using Box Embeddings – trainable Venn diagrams that model logical reasoning and set-based similarity. Box embeddings provide a structured alternative to vector-based similarity measures like dot product by encoding inclusion, intersection, and exclusion relationships. I improved the training stability and local identifiability of box embeddings (Improving Local Identifiability), enabling more accurate modeling of complex set-theoretic relationships. In Word2Box, I showed that box embeddings outperform traditional word embeddings in representing hierarchical and compositional semantics. My work on Box-to-Box Transformations demonstrated their effectiveness for joint hierarchical modeling in knowledge graph completion and reasoning.
Information Retrieval: Box embeddings improve real-world IR tasks like faceted search, recommendation systems, and web browsing by better handling compositional queries. In Answering Compositional Queries with Set-Theoretic Embeddings, I proposed a framework for handling queries like "Jazz but not Smooth Jazz" or "Animation ∩ Monsters," where traditional vector models often fail. My work on Personalized Recommendation with Set-Theoretic Constraints explored attribute-aware recommendations, enhancing performance when user preferences follow set-based patterns. I also integrated box embeddings with transformer-based dual encoders, improving search and personalized recommendation performance (results coming soon). At Microsoft, I developed a hybrid approach combining traditional collaborative filtering with LLM-based similarity estimation to enhance recommendation accuracy.
Knowledge Base Representation: My research explores improving knowledge base representation by developing methods that capture both structural and temporal dependencies in large-scale knowledge graphs. In HyTE, I proposed a hyperplane-based embedding method to model the temporal evolution of facts in knowledge graphs. HyTE introduced a novel temporal axis in embedding space, allowing for more accurate reasoning over time-dependent relationships. My work on Neural Dater applied graph convolutional networks (GCNs) to predict the dates of historical documents, showing that structured representations improve temporal inference.
Building on my Ph.D. research, I have extended box embeddings to probabilistic knowledge bases, where uncertainty and hierarchical relationships are key challenges. Box embeddings provide a natural geometric representation for encoding uncertainty and capturing the complex structure of multi-relational data. In Probabilistic Box Embeddings for Knowledge Graph Reasoning, I showed that box-based models outperform traditional embeddings in capturing hierarchical and compositional properties of knowledge bases, improving both link prediction and query answering performance. My work demonstrates that box embeddings can effectively handle partial and uncertain information, making them suitable for real-world knowledge graphs.
Experience
-
Amazon Web Services (AWS) 2025 – Present
- Applied Scientist II, Amazon Q for AWS Console
- Working on analogical reasoning where retrieval meets reasoning.
- Building Generative UI systems that translate user intent into adaptive, personalized console experiences.
-
Microsoft Research Summer 2024
- Research Intern, Manager – Tobias Schnabel
- Integrated LLMs into recommendation systems using an efficient and editable RAG-based framework.
- Achieved 50% improvement in human preference alignment over state-of-the-art methods.
-
IBM Research Summer 2023
- Research Intern, Manager – Achille Fokoue
- Developed datasets and evaluation protocols for LLM agents focused on tool usage and multi-step planning.
-
Google Research Summer 2022, Fall 2022
- Research Intern, Manager – Steffen Rendle
- Introduced a benchmark for compositional queries (e.g., “Jazz but not Smooth Jazz”) in recommender systems.
- Proposed a set-based embedding method outperforming vector baselines by 25%.
-
IBM Research Spring 2022
- Research Externship, Mentors – Ken Clarkson (IBM), Cameron Musco (UMass Amherst)
- Designed a hashing-based, single-pass method for scalable word embedding learning.
-
Adobe Research Summer 2021
- Document Intelligence Research Intern, Manager – Tong Sun
- Proposed geometric embeddings to impose hierarchical structure on vector representations.
-
Minds.ai 2018 – 2019
- Neural Network Engineer, Manager – Tijmen Tieleman
- Developed GCN-based molecular property predictors for drug discovery.
- Built deep RL controllers to improve battery life and fuel efficiency in hybrid vehicles.
Education
P.h.D. in Computer Science
University of Massachusetts Amherst.
Started PhD in Fall 2019.
Thesis on Set Theoretic Representation Learning. Supervised by
Prof. Andrew McCallum. Published and presented in multiple top-tier ML/NLP conferences.
Master of Technology in System Science and Automation
Indian Institute of Science, Bangalore. M.Tech. 2018.
Thesis on Temporal Information Processing. Published fundamental research work on temporal
representation of knowledge bases.
Bachelor of Engineering in Electrical Engineering
Jadavpur University B.E. 2015.
Publications
Please visit my Google Scholar page for an updated list of publications.
-
Shib Dasgupta, Michael Boratko, Andrew McCallum.
A Geometric Approach to Personalized Recommendation with Set-Theoretic Constraints Using Box Embeddings.
In-Submission: ICML 2025 – Recommendation Systems
-
B. Rozonoyer, M. Boratko, D. Patel, W. Zhao, Shib Dasgupta, H. Le, A. McCallum.
Learning Representations for Hierarchies with Minimal Support.
NeurIPS 2024 – Machine Learning
-
Shib Dasgupta, Andrew McCallum, Steffen Rendle, Li Zhang.
Answering Compositional Queries with Set-Theoretic Embeddings.
ArXiv 2023 – Information Retrieval
-
Shib Dasgupta, Michael Boratko, S. Atmakuri, Xiang Lorraine Li, D. Patel, Andrew McCallum.
Word2Box: Capturing Set-Theoretic Semantics of Words using Box Embeddings.
ACL 2022 – NLP
👉 [Talk Link]
-
EunJeong Hwang, Veronika Thost, Shib Sankar Dasgupta, Tengfei Ma.
An Analysis of Virtual Nodes in Graph Neural Networks for Link Prediction.
LoG 2022 – Graph Neural Networks
-
Michael Boratko, D. Patel, Shib Dasgupta, Andrew McCallum.
Measure-Theoretic Set Representation Learning.
ArXiv 2022 – ML Theory
-
Michael Boratko, Javier Burroni, Shib Sankar Dasgupta, Andrew McCallum.
Min/Max Stability and Box Distributions.
UAI 2021 – ML Theory
-
Shib Dasgupta, Xiang Lorraine Li, Michael Boratko, Dongxu Zhang, Andrew McCallum.
Box-To-Box Transformations for Modeling Joint Hierarchies.
ACL (Rep4NLP) 2021 – Graph Learning
-
Xuelu Chen, Michael Boratko, Muhao Chen, Shib Dasgupta, Xiang Li, Andrew McCallum.
Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning.
NAACL 2021 – Graph Learning
-
T. Chheda, P. Goyal, T. Trang, D. Patel, M. Boratko, Shib Dasgupta, Andrew McCallum.
Box Embeddings: An Open-source Library for Representation Learning using Geometric Structures.
EMNLP (Demo Track) 2021 – Open Source Contribution
-
Shib Dasgupta*, Michael Boratko*, Dongxu Zhang, Luke Vilnis, Xiang Li, Andrew McCallum.
Improving Local Identifiability for Probabilistic Box Embeddings.
NeurIPS 2020 – Machine Learning
👉 [Talk Link]
-
Dhruvesh Patel*, Shib Sankar Dasgupta*, Michael Boratko, Xiang Li, Luke Vilnis, Andrew McCallum.
Representing Joint Hierarchies with Box Embeddings.
AKBC 2020 – Graph Learning
-
Shib Dasgupta, Swayambhu Nath Ray, Partha Talukdar.
HyTE: Hyperplane-based Temporally aware Knowledge Graph Embedding.
EMNLP 2018 – Graph Learning
-
Shikhar Vashishth, Shib Sankar Dasgupta, Swayambhu Nath Ray, Partha Talukdar.
Dating Documents using Graph Convolution Networks.
ACL 2018 – Graph Learning
-
Swayambhu Nath Ray, Shib Sankar Dasgupta, Partha Talukdar.
AD3: Attentive Deep Document Dater.
EMNLP 2018 – Graph Learning
Skills
- Programming Languages: Python, C++, Matlab
- Libraries & Frameworks: PyTorch, TensorFlow, Transformers
- Machine Learning: Deep Learning, Information Retrieval, Natural Language Processing, Recommendation Systems
- Tools & Platforms: Git, Docker, Linux, Jupyter
- Data Handling: Pandas, NumPy
Published Software and Datasets
Mentorship Experience
-
Industry Mentorship Program: Mentored master’s students every year
(15+ students over 5 years) on projects in collaboration with industries, including
IBM, Adobe, Spotify, and Uber.
UMass Amherst, 2021–2025
-
Master's Research Project: Mentored 10+ Master’s students over 5 years,
offering research projects and guidance towards their degree completion.
UMass Amherst, 2021–2025
Awards & Achievements
-
Scholarship: Awarded the W. Bruce Croft Graduate Scholarship in Computer Science.
UMass Amherst, 2020
-
Gold Medal: Awarded the N R Khambhati Memorial Medal for Topper of the class in M.Tech.
IISc, Bangalore, 2019
-
Scholarship: Awarded Non-Student Travel Scholarship by EMNLP, 2018.
EMNLP, 2018
-
All India Rank 47: In GATE–2016 (Graduate Engineering Entrance Test) among ~130,000 applicants.
IISc, Bangalore, 2016
-
All State Rank 166: In WBJEE–2011 (Engineering Entrance Test) among ~130,000 applicants.
Jadavpur University, 2011
-
Scholarship: University Students scholarship by the Government of India for Higher Secondary merit.
Jadavpur University, 2011