Author name cluster

Amitava Das

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

8 papers

1 author row

AAAI Conference 2026 Conference Paper

DETONATE – A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization

Renjith Prasad Kaippilly Mana
Abhilekh Borah
Hasnat Md Abdullah
Chathurangi Shyalika
Gurpreet Singh
Ritvik Garimella
Rajarshi Roy
Harshul Raj Surana

Alignment is crucial for text-to-image (T2I) models to ensure that the generated images faithfully capture user intent while maintaining safety and fairness. Direct Preference Optimization (DPO) has emerged as a key alignment technique for large language models (LLMs), and its influence is now extending to T2I systems. This paper introduces DPO-Kernels for T2I models, a novel extension of DPO that enhances alignment across three key dimensions: (i) Hybrid Loss, which integrates embedding-based objectives with the traditional probability-based loss to improve optimization; (ii) Kernelized Representations, leveraging Radial Basis Function (RBF), Polynomial, and Wavelet kernels to enable richer feature transformations, ensuring better separation between safe and unsafe inputs; and (iii) Divergence Selection, expanding beyond DPO’s default Kullback–Leibler (KL) regularizer by incorporating alternative divergence measures such as Wasserstein and Rényi divergences to enhance stability and robustness in alignment training. We introduce DETONATE, the first large-scale benchmark of its kind, comprising approximately 100K curated image pairs, categorized as chosen and rejected. This benchmark encapsulates three critical axes of social bias and discrimination: Race, Gender, and Disability. The prompts are sourced from the hate speech datasets, while the images are generated using state-of-the-art T2I models, including Stable Diffusion 3.5 Large (SD-3.5), Stable Diffusion XL (SD-XL), and Midjourney. Furthermore, to evaluate alignment beyond surface metrics, we introduce the Alignment Quality Index (AQI) for T2I systems: a novel geometric measure that quantifies latent space separability of safe/unsafe image activations, revealing hidden model vulnerabilities. While alignment techniques often risk overfitting, we empirically demonstrate that DPO-Kernels preserve strong generalization bounds using the theory of Heavy-Tailed Self-Regularization (HT-SR).

PDF Details DOI

EAAI Journal 2025 Journal Article

A deep dive into automated sexism detection using fine-tuned deep learning and large language models

Advaitha Vetagiri
Partha Pakray
Amitava Das

Details DOI

IS Journal 2023 Journal Article

Why Do We Need Neurosymbolic AI to Model Pragmatic Analogies?

Thilini Wijesiriwardene
Amit Sheth
Valerie L. Shalin
Amitava Das

A hallmark of intelligence is the ability to use a familiar domain to make inferences about a less familiar domain, known as analogical reasoning. In this article, we delve into the performance of large language models (LLMs) in dealing with progressively complex analogies expressed in unstructured text. We discuss analogies at four distinct levels of complexity: lexical, syntactic, semantic, and pragmatic. As the analogies become more complex, they require increasingly extensive, diverse knowledge beyond the textual content, unlikely to be found in the lexical co-occurrence statistics that power LLMs. We discuss neurosymbolic AI techniques that combine statistical and symbolic AI, informing the representation of unstructured text to highlight and augment relevant content, provide abstraction, and guide the mapping process. This maintains the efficiency of LLMs while preserving the ability to explain analogies for pedagogical applications.

Details DOI

AAAI Conference 2022 Short Paper

Memotion Analysis through the Lens of Joint Embedding (Student Abstract)

Nethra Gunti
Sathyanarayanan Ramamoorthy
Parth Patwa
Amitava Das

Joint embedding (JE) is a way to encode multi-modal data into a vector space where text remains as the grounding key and other modalities like image are to be anchored with such keys. Meme is typically an image with embedded text onto it. Although, memes are commonly used for fun, they could also be used to spread hate and fake information. That along with its growing ubiquity over several social platforms has caused automatic analysis of memes to become a widespread topic of research. In this paper, we report our initial experiments on Memotion Analysis problem through joint embeddings. Results are marginally yielding SOTA.

PDF Details

AAAI Conference 2022 Short Paper

PESTO: Switching Point Based Dynamic and Relative Positional Encoding for Code-Mixed Languages (Student Abstract)

Mohsin Ali
Sai Teja Kandukuri
Sumanth Manduru
Parth Patwa
Amitava Das

NLP applications for code-mixed (CM) or mix-lingual text have gained a significant momentum recently, the main reason being the prevalence of language mixing in social media communications in multi-lingual societies like India, Mexico, Europe, parts of USA etc. Word embeddings are basic building blocks of any NLP system today, yet, word embedding for CM languages is an unexplored territory. The major bottleneck for CM word embeddings is switching points, where the language switches. These locations lack in contextually and statistical systems fail to model this phenomena due to high variance in the seen examples. In this paper we present our initial observations on applying switching point based positional encoding techniques for CM language, specifically Hinglish (Hindi - English). Results are only marginally better than SOTA, but it is evident that positional encoding could be an effective way to train position sensitive language models for CM text.

PDF Details

AAAI Conference 2018 Short Paper

Consonant-Vowel Sequences as Subword Units for Code-Mixed Languages

Upendra Kumar
Vishal Singh
Chris Andrew
Santhoshini Reddy
Amitava Das

In this research work, we develop a state-of-art model for identifying sentiment in Hindi-English code-mixed language. We introduce new phonemic sub-word units for Hindi- English code-mixed text along with a hierarchical deep learning model which uses these sub-word units for predicting sentiment. The results indicate that the model yields a signiﬁcant increase in accuracy as compared to other models.

PDF Details

IS Journal 2018 Journal Article

Revealing Psycholinguistic Dimensions of Communities in Social Networks

Tushar Maheshwari
Aishwarya N. Reganti
Upendra Kumar
Tanmoy Chakraborty
Amitava Das

In this paper, the authors seek to answer one fundamental question - what brings people together to form a community? In this article, they explore the personalities (psychological) and values (sociological) of individuals in social network communities in order to understand such natural selection.

Details DOI

AAAI Conference 2017 Short Paper

Semantic Interpretation of Social Network Communities

Tushar Maheshwari
Aishwarya Reganti
Upendra Kumar
Tanmoy Chakraborty
Amitava Das

A community in a social network is considered to be a group of nodes densely connected internally and sparsely connected externally.Although previous work intensely studied network topology within a community, its semantic interpretation is hardly understood. In this paper, we attempt to understand whether individuals in a community possess similar Personalities, Values and Ethical background. Finally, we show that Personality and Values models could be used as features to discover more accurate community structure compared to the one obtained from only network information.

PDF Details