Author: "P. P. Divakaran" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"P. P. Divakaran"' showing total 441 results

Start Over Author "P. P. Divakaran"

441 results on '"P. P. Divakaran"'

1. EagleEye: Attention to Unveil Malicious Event Sequences from Provenance Graphs

Author: Gysel, Philipp, Wüest, Candid, Nwafor, Kenneth, Jašek, Otakar, Ustyuzhanin, Andrey, and Divakaran, Dinil Mon
Subjects: Computer Science - Cryptography and Security
Abstract: Securing endpoints is challenging due to the evolving nature of threats and attacks. With endpoint logging systems becoming mature, provenance-graph representations enable the creation of sophisticated behavior rules. However, adapting to the pace of emerging attacks is not scalable with rules. This led to the development of ML models capable of learning from endpoint logs. However, there are still open challenges: i) malicious patterns of malware are spread across long sequences of events, and ii) ML classification results are not interpretable. To address these issues, we develop and present EagleEye, a novel system that i) uses rich features from provenance graphs for behavior event representation, including command-line embeddings, ii) extracts long sequences of events and learns event embeddings, and iii) trains a lightweight Transformer model to classify behavior sequences as malicious or not. We evaluate and compare EagleEye against state-of-the-art baselines on two datasets, namely a new real-world dataset from a corporate environment, and the public DARPA dataset. On the DARPA dataset, at a false-positive rate of 1%, EagleEye detects $\approx$89% of all malicious behavior, outperforming two state-of-the-art solutions by an absolute margin of 38.5%. Furthermore, we show that the Transformer's attention mechanism can be leveraged to highlight the most suspicious events in a long sequence, thereby providing interpretation of malware alerts., Comment: Accepted as research paper at eCrime 2024; update affiliations
Published: 2024

2. Multimodal Large Language Models for Phishing Webpage Detection and Identification

Author: Lee, Jehyun, Lim, Peiyuan, Hooi, Bryan, and Divakaran, Dinil Mon
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence
Abstract: To address the challenging problem of detecting phishing webpages, researchers have developed numerous solutions, in particular those based on machine learning (ML) algorithms. Among these, brand-based phishing detection that uses models from Computer Vision to detect if a given webpage is imitating a well-known brand has received widespread attention. However, such models are costly and difficult to maintain, as they need to be retrained with labeled dataset that has to be regularly and continuously collected. Besides, they also need to maintain a good reference list of well-known websites and related meta-data for effective performance. In this work, we take steps to study the efficacy of large language models (LLMs), in particular the multimodal LLMs, in detecting phishing webpages. Given that the LLMs are pretrained on a large corpus of data, we aim to make use of their understanding of different aspects of a webpage (logo, theme, favicon, etc.) to identify the brand of a given webpage and compare the identified brand with the domain name in the URL to detect a phishing attack. We propose a two-phase system employing LLMs in both phases: the first phase focuses on brand identification, while the second verifies the domain. We carry out comprehensive evaluations on a newly collected dataset. Our experiments show that the LLM-based system achieves a high detection rate at high precision; importantly, it also provides interpretable evidence for the decisions. Our system also performs significantly better than a state-of-the-art brand-based phishing detection system while demonstrating robustness against two known adversarial attacks., Comment: To appear in eCrime 2024
Published: 2024

3. From ML to LLM: Evaluating the Robustness of Phishing Webpage Detection Models against Adversarial Attacks

Author: Kulkarni, Aditya, Balachandran, Vivek, Divakaran, Dinil Mon, and Das, Tamal
Subjects: Computer Science - Cryptography and Security
Abstract: Phishing attacks attempt to deceive users into stealing sensitive information, posing a significant cybersecurity threat. Advances in machine learning (ML) and deep learning (DL) have led to the development of numerous phishing webpage detection solutions, but these models remain vulnerable to adversarial attacks. Evaluating their robustness against adversarial phishing webpages is essential. Existing tools contain datasets of pre-designed phishing webpages for a limited number of brands, and lack diversity in phishing features. To address these challenges, we develop PhishOracle, a tool that generates adversarial phishing webpages by embedding diverse phishing features into legitimate webpages. We evaluate the robustness of two existing models, Stack model and Phishpedia, in classifying PhishOracle-generated adversarial phishing webpages. Additionally, we study a commercial large language model, Gemini Pro Vision, in the context of adversarial attacks. We conduct a user study to determine whether PhishOracle-generated adversarial phishing webpages deceive users. Our findings reveal that many PhishOracle-generated phishing webpages evade current phishing webpage detection models and deceive users, but Gemini Pro Vision is robust to the attack. We also develop the PhishOracle web app, allowing users to input a legitimate URL, select relevant phishing features and generate a corresponding phishing webpage. All resources are publicly available on GitHub.
Published: 2024

4. Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification

Author: Sahu, Pritish, Sikka, Karan, and Divakaran, Ajay
Subjects: Computer Science - Computation and Language
Abstract: Large Visual Language Models (LVLMs) struggle with hallucinations in visual instruction following task(s), limiting their trustworthiness and real-world applicability. We propose Pelican -- a novel framework designed to detect and mitigate hallucinations through claim verification. Pelican first decomposes the visual claim into a chain of sub-claims based on first-order predicates. These sub-claims consist of (predicate, question) pairs and can be conceptualized as nodes of a computational graph. We then use Program-of-Thought prompting to generate Python code for answering these questions through flexible composition of external tools. Pelican improves over prior work by introducing (1) intermediate variables for precise grounding of object instances, and (2) shared computation for answering the sub-question to enable adaptive corrections and inconsistency identification. We finally use reasoning abilities of LLMs to verify the correctness of the claim by considering the consistency and confidence of the (question, answer) pairs from each sub-claim. Our experiments reveal a drop in hallucination rate by ~ 8% - 32% across various baseline LVLMs and a 27% drop compared to approaches proposed for hallucination mitigation on MMHal-Bench. Results on two other benchmarks further corroborate our results.
Published: 2024

5. Empowering Interdisciplinary Insights with Dynamic Graph Embedding Trajectories

Author: Jin, Yiqiao, Zhao, Andrew, Lee, Yeon-Chang, Ye, Meng, Divakaran, Ajay, and Kumar, Srijan
Subjects: Computer Science - Machine Learning, Computer Science - Human-Computer Interaction, Computer Science - Social and Information Networks
Abstract: We developed DyGETViz, a novel framework for effectively visualizing dynamic graphs (DGs) that are ubiquitous across diverse real-world systems. This framework leverages recent advancements in discrete-time dynamic graph (DTDG) models to adeptly handle the temporal dynamics inherent in dynamic graphs. DyGETViz effectively captures both micro- and macro-level structural shifts within these graphs, offering a robust method for representing complex and massive dynamic graphs. The application of DyGETViz extends to a diverse array of domains, including ethology, epidemiology, finance, genetics, linguistics, communication studies, social studies, and international relations. Through its implementation, DyGETViz has revealed or confirmed various critical insights. These include the diversity of content sharing patterns and the degree of specialization within online communities, the chronological evolution of lexicons across decades, and the distinct trajectories exhibited by aging-related and non-related genes. Importantly, DyGETViz enhances the accessibility of scientific findings to non-domain experts by simplifying the complexities of dynamic graphs. Our framework is released as an open-source Python package for use across diverse disciplines. Our work not only addresses the ongoing challenges in visualizing and analyzing DTDG models but also establishes a foundational framework for future investigations into dynamic graph representation and analysis across various disciplines., Comment: 27 pages, 11 figures
Published: 2024

6. Chaos controlled and disorder driven phase transitions induced by breaking permutation symmetry

Author: C, Manju, Lakshminarayan, Arul, and Divakaran, Uma
Subjects: Quantum Physics, Condensed Matter - Statistical Mechanics, Nonlinear Sciences - Chaotic Dynamics
Abstract: The effects of disorder and chaos on quantum many-body systems can be superficially similar, yet their interplay has not been sufficiently explored. This work finds a continuous phase transition when disorder breaks permutation symmetry, with details of the transition being controlled by the degree of chaos in the clean limit. The system changes from an area law entangled phase in the permutation symmetric subspace where collective variables exist to volume law entanglement in the full Hilbert space, beyond a critical strength of the disorder. The critical strength tends to zero when the original disorder free system is fully chaotic. We study this mainly via the scaling of the collective spin of non-equilibrium states which transit to have properties of what has been dubbed "deep Hilbert space". This has potential implications for general many body physics, as well as technologies such as transmon qubits.
Published: 2024

7. The promises and challenges of many-body quantum technologies: a focus on quantum engines

Author: Mukherjee, Victor and Divakaran, Uma
Subjects: Quantum Physics, Condensed Matter - Statistical Mechanics
Abstract: Can many-body systems be beneficial to designing quantum technologies? We address this question by examining quantum engines, where recent studies indicate potential benefits through the harnessing of many-body effects, such as divergences close to phase transitions. However, open questions remain regarding their real-world applications., Comment: 3 pages, 2 figures, Comment article
Published: 2024
Full Text: View/download PDF

8. Quantum critical engine at finite temperatures

Author: S, Revathy B, Mukherjee, Victor, and Divakaran, Uma
Subjects: Quantum Physics
Abstract: We construct a quantum critical Otto engine that is powered by finite temperature baths. We show that the work output of the engine shows universal power law behavior that depends on the critical exponents of the working medium, as well as on the temperature of the cold bath. Furthermore, higher temperatures of the cold bath allows the engine to approach the limit of adiabatic operation for smaller values of the time period, while the corresponding power shows a maximum at an intermediate value of the cold bath temperature. These counterintuitive results stems from thermal excitations dominating the dynamics at higher temperatures., Comment: 9 pages, 5 figures
Published: 2024
Full Text: View/download PDF

9. Athletic signature: predicting the next game lineup in collegiate basketball

Author: Sharma, Srishti, Divakaran, Srikrishnan, Kaya, Tolga, and Raval, Mehul
Published: 2024
Full Text: View/download PDF

10. Lupus pregnancy outcomes in women with previous adverse outcomes: a prospective cohort study

Author: Ravindran, Vinod, Bhadran, S., Divakaran, Mini, and Reshma, V. M.
Published: 2024
Full Text: View/download PDF

11. Facile exfoliation and physicochemical characterization of biomass-based cellulose derived from Pandanus tectorius leaves for sustainable environment

Author: Kavimani, V, Divakaran, Divya, Sriariyanun, Malinee, Suganya Priyadharshini, G, Gopal, PM, Suyambulingam, Indran, Sanjay, MR, and Siengchin, Suchart
Published: 2024
Full Text: View/download PDF

12. Exfoliation and physicochemical characterization of novel biomass-based microcrystalline cellulose derived from Millettia pinnata leaf

Author: Gopal P. M., Suganya Priyadharshini G, Suyambulingam, Indran, Divakaran, Divya, Kavimani V, Sanjay M. R., and Siengchin, Suchart
Published: 2024
Full Text: View/download PDF

13. BloomVQA: Assessing Hierarchical Multi-modal Comprehension

Author: Gong, Yunye, Shrestha, Robik, Claypoole, Jared, Cogswell, Michael, Ray, Arijit, Kanan, Christopher, and Divakaran, Ajay
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We propose a novel VQA dataset, BloomVQA, to facilitate comprehensive evaluation of large vision-language models on comprehension tasks. Unlike current benchmarks that often focus on fact-based memorization and simple reasoning tasks without theoretical grounding, we collect multiple-choice samples based on picture stories that reflect different levels of comprehension, as laid out in Bloom's Taxonomy, a classic framework for learning assessment widely adopted in education research. Our data maps to a novel hierarchical graph representation which enables automatic data augmentation and novel measures characterizing model consistency. We perform graded evaluation and reliability analysis on recent multi-modal models. In comparison to low-level tasks, we observe decreased performance on tasks requiring advanced comprehension and cognitive skills with up to 38.0\% drop in VQA accuracy. In comparison to earlier models, GPT-4V demonstrates improved accuracy over all comprehension levels and shows a tendency of bypassing visual inputs especially for higher-level tasks. Current models also show consistency patterns misaligned with human comprehension in various scenarios, demonstrating the need for improvement based on theoretically-grounded criteria., Comment: Accepted by ACL Findings (2024). Dataset available at https://huggingface.co/datasets/ygong/BloomVQA
Published: 2023

14. The Evolution of DNS Security and Privacy

Author: Csikor, Levente and Divakaran, Dinil Mon
Subjects: Computer Science - Cryptography and Security, Computer Science - Networking and Internet Architecture
Abstract: DNS, one of the fundamental protocols of the TCP/IP stack, has evolved over the years to protect against threats and attacks. This study examines the risks associated with DNS and explores recent advancements that contribute towards making the DNS ecosystem resilient against various attacks while safeguarding user privacy., Comment: 9 pages, 4 figures - original manuscript submitted to IEEE Security & Privacy Magazine
Published: 2023

15. Suitability study of novel Bio-plasticizer from Agave sisalana leaf for biofilm applications: a biomass to biomaterial approach

Author: Edayadulla, Naushad, Divakaran, Divya, Chandraraj, Shanmuga Sundari, Sriariyanun, Malinee, Suyambulingam, Indran, Sanjay, M. R., and Siengchin, Suchart
Published: 2024
Full Text: View/download PDF

16. Effect of alkalization on physical, chemical, thermal, tensile, and surface morphological properties of Musa acuminata peduncles fiber

Author: Durai, Praveen Nagarajan, Viswalingam, Kathir, Senthilkumar, B., Divakaran, Divya, and Siengchin, Suchart
Published: 2024
Full Text: View/download PDF

17. Comprehensive characterization of novel Borassus flabellifer flower biomass based microcrystalline cellulose reinforced with polylactic acid (PLA) biofilm for futuristic applications

Author: N, Sunesh, Suyambulingam, Indran, Divakaran, Divya, and Siengchin, Suchart
Published: 2024
Full Text: View/download PDF

18. A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval

Author: Gwilliam, Matthew, Cogswell, Michael, Ye, Meng, Sikka, Karan, Shrivastava, Abhinav, and Divakaran, Ajay
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Existing long video retrieval systems are trained and tested in the paragraph-to-video retrieval regime, where every long video is described by a single long paragraph. This neglects the richness and variety of possible valid descriptions of a video, which could be described in moment-by-moment detail, or in a single phrase summary, or anything in between. To provide a more thorough evaluation of the capabilities of long video retrieval systems, we propose a pipeline that leverages state-of-the-art large language models to carefully generate a diverse set of synthetic captions for long videos. We validate this pipeline's fidelity via rigorous human inspection. We then benchmark a representative set of video language models on these synthetic captions using a few long video datasets, showing that they struggle with the transformed data, especially the shortest captions. We also propose a lightweight fine-tuning method, where we use a contrastive loss to learn a hierarchical embedding loss based on the differing levels of information among the various captions. Our method improves performance both on the downstream paragraph-to-video retrieval task (+1.1% R@1 on ActivityNet), as well as for the various long video retrieval metrics we compute using our synthetic data (+3.6% R@1 for short descriptions on ActivityNet). For data access and other details, please refer to our project website at https://mgwillia.github.io/10k-words., Comment: 13 pages, 15 tables, 5 figures
Published: 2023

19. DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback

Author: Chen, Yangyi, Sikka, Karan, Cogswell, Michael, Ji, Heng, and Divakaran, Ajay
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We present DRESS, a large vision language model (LVLM) that innovatively exploits Natural Language feedback (NLF) from Large Language Models to enhance its alignment and interactions by addressing two key limitations in the state-of-the-art LVLMs. First, prior LVLMs generally rely only on the instruction finetuning stage to enhance alignment with human preferences. Without incorporating extra feedback, they are still prone to generate unhelpful, hallucinated, or harmful responses. Second, while the visual instruction tuning data is generally structured in a multi-turn dialogue format, the connections and dependencies among consecutive conversational turns are weak. This reduces the capacity for effective multi-turn interactions. To tackle these, we propose a novel categorization of the NLF into two key types: critique and refinement. The critique NLF identifies the strengths and weaknesses of the responses and is used to align the LVLMs with human preferences. The refinement NLF offers concrete suggestions for improvement and is adopted to improve the interaction ability of the LVLMs-- which focuses on LVLMs' ability to refine responses by incorporating feedback in multi-turn interactions. To address the non-differentiable nature of NLF, we generalize conditional reinforcement learning for training. Our experimental results demonstrate that DRESS can generate more helpful (9.76%), honest (11.52%), and harmless (21.03%) responses, and more effectively learn from feedback during multi-turn interactions compared to SOTA LVMLs., Comment: CVPR 2024. The feedback datasets are released at: https://huggingface.co/datasets/YangyiYY/LVLM_NLF
Published: 2023

20. SPRING-INX: A Multilingual Indian Language Speech Corpus by SPRING Lab, IIT Madras

Author: R, Nithya, S, Malavika, F, Jordan, Gangwar, Arjun, J, Metilda N, Umesh, S, Sarab, Rithik, Dubey, Akhilesh Kumar, Divakaran, Govind, K, Samudra Vijaya, and Gangashetty, Suryakanth V
Subjects: Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: India is home to a multitude of languages of which 22 languages are recognised by the Indian Constitution as official. Building speech based applications for the Indian population is a difficult problem owing to limited data and the number of languages and accents to accommodate. To encourage the language technology community to build speech based applications in Indian languages, we are open sourcing SPRING-INX data which has about 2000 hours of legally sourced and manually transcribed speech data for ASR system building in Assamese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi, Odia, Punjabi and Tamil. This endeavor is by SPRING Lab , Indian Institute of Technology Madras and is a part of National Language Translation Mission (NLTM), funded by the Indian Ministry of Electronics and Information Technology (MeitY), Government of India. We describe the data collection and data cleaning process along with the data statistics in this paper., Comment: 3 pages, About SPRING-INX Data
Published: 2023

21. Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning

Author: Som, Anirudh, Sikka, Karan, Gent, Helen, Divakaran, Ajay, Kathol, Andreas, and Vergyri, Dimitra
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Paraphrasing of offensive content is a better alternative to content removal and helps improve civility in a communication environment. Supervised paraphrasers; however, rely heavily on large quantities of labelled data to help preserve meaning and intent. They also often retain a large portion of the offensiveness of the original content, which raises questions on their overall usability. In this paper we aim to assist practitioners in developing usable paraphrasers by exploring In-Context Learning (ICL) with large language models (LLMs), i.e., using a limited number of input-label demonstration pairs to guide the model in generating desired outputs for specific queries. Our study focuses on key factors such as - number and order of demonstrations, exclusion of prompt instruction, and reduction in measured toxicity. We perform principled evaluation on three datasets, including our proposed Context-Aware Polite Paraphrase (CAPP) dataset, comprising of dialogue-style rude utterances, polite paraphrases, and additional dialogue context. We evaluate our approach using four closed source and one open source LLM. Our results reveal that ICL is comparable to supervised methods in generation quality, while being qualitatively better by 25% on human evaluation and attaining lower toxicity by 76%. Also, ICL-based paraphrasers only show a slight reduction in performance even with just 10% training data., Comment: Accepted in Association for Computational Linguistics (ACL) 2024 Findings
Published: 2023

22. ZEST: Attention-based Zero-Shot Learning for Unseen IoT Device Classification

Author: Wu, Binghui, Gysel, Philipp, Divakaran, Dinil Mon, and Gurusamy, Mohan
Subjects: Computer Science - Networking and Internet Architecture, Computer Science - Cryptography and Security, Computer Science - Machine Learning
Abstract: Recent research works have proposed machine learning models for classifying IoT devices connected to a network. However, there is still a practical challenge of not having all devices (and hence their traffic) available during the training of a model. This essentially means, during the operational phase, we need to classify new devices not seen in the training phase. To address this challenge, we propose ZEST -- a ZSL (zero-shot learning) framework based on self-attention for classifying both seen and unseen devices. ZEST consists of i) a self-attention based network feature extractor, termed SANE, for extracting latent space representations of IoT traffic, ii) a generative model that trains a decoder using latent features to generate pseudo data, and iii) a supervised model that is trained on the generated pseudo data for classifying devices. We carry out extensive experiments on real IoT traffic data; our experiments demonstrate i) ZEST achieves significant improvement (in terms of accuracy) over the baselines; ii) SANE is able to better extract meaningful representations than LSTM which has been commonly used for modeling network traffic., Comment: 9 pages, 6 figures, 3 tables
Published: 2023

23. Confidence Calibration for Systems with Cascaded Predictive Modules

Author: Gong, Yunye, Yao, Yi, Lin, Xiao, Divakaran, Ajay, and Gervasio, Melinda
Subjects: Computer Science - Machine Learning
Abstract: Existing conformal prediction algorithms estimate prediction intervals at target confidence levels to characterize the performance of a regression model on new test samples. However, considering an autonomous system consisting of multiple modules, prediction intervals constructed for individual modules fall short of accommodating uncertainty propagation over different modules and thus cannot provide reliable predictions on system behavior. We address this limitation and present novel solutions based on conformal prediction to provide prediction intervals calibrated for a predictive system consisting of cascaded modules (e.g., an upstream feature extraction module and a downstream regression module). Our key idea is to leverage module-level validation data to characterize the system-level error distribution without direct access to end-to-end validation data. We provide theoretical justification and empirical experimental results to demonstrate the effectiveness of proposed solutions. In comparison to prediction intervals calibrated for individual modules, our solutions generate improved intervals with more accurate performance guarantees for system predictions, which are demonstrated on both synthetic systems and real-world systems performing overlap prediction for indoor navigation using the Matterport3D dataset.
Published: 2023

24. Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models

Author: Chen, Yangyi, Sikka, Karan, Cogswell, Michael, Ji, Heng, and Divakaran, Ajay
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Vision-language models (VLMs) have recently demonstrated strong efficacy as visual assistants that can parse natural queries about the visual content and generate human-like outputs. In this work, we explore the ability of these models to demonstrate human-like reasoning based on the perceived information. To address a crucial concern regarding the extent to which their reasoning capabilities are fully consistent and grounded, we also measure the reasoning consistency of these models. We achieve this by proposing a chain-of-thought (CoT) based consistency measure. However, such an evaluation requires a benchmark that encompasses both high-level inference and detailed reasoning chains, which is costly. We tackle this challenge by proposing a LLM-Human-in-the-Loop pipeline, which notably reduces cost while simultaneously ensuring the generation of a high-quality dataset. Based on this pipeline and the existing coarse-grained annotated dataset, we build the CURE benchmark to measure both the zero-shot reasoning performance and consistency of VLMs. We evaluate existing state-of-the-art VLMs, and find that even the best-performing model is unable to demonstrate strong visual reasoning capabilities and consistency, indicating that substantial efforts are required to enable VLMs to perform visual reasoning as systematically and consistently as humans. As an early step, we propose a two-stage training framework aimed at improving both the reasoning performance and consistency of VLMs. The first stage involves employing supervised fine-tuning of VLMs using step-by-step reasoning samples automatically generated by LLMs. In the second stage, we further augment the training process by incorporating feedback provided by LLMs to produce reasoning chains that are highly consistent and grounded. We empirically highlight the effectiveness of our framework in both reasoning performance and consistency., Comment: NAACL 2024 Main Conference. The data is released at https://github.com/Yangyi-Chen/CoTConsistency
Published: 2023

25. Attacking logo-based phishing website detectors with adversarial perturbations

Author: Lee, Jehyun, Xin, Zhe, See, Melanie Ng Pei, Sabharwal, Kanav, Apruzzese, Giovanni, and Divakaran, Dinil Mon
Subjects: Computer Science - Cryptography and Security
Abstract: Recent times have witnessed the rise of anti-phishing schemes powered by deep learning (DL). In particular, logo-based phishing detectors rely on DL models from Computer Vision to identify logos of well-known brands on webpages, to detect malicious webpages that imitate a given brand. For instance, Siamese networks have demonstrated notable performance for these tasks, enabling the corresponding anti-phishing solutions to detect even "zero-day" phishing webpages. In this work, we take the next step of studying the robustness of logo-based phishing detectors against adversarial ML attacks. We propose a novel attack exploiting generative adversarial perturbations to craft "adversarial logos" that evade phishing detectors. We evaluate our attacks through: (i) experiments on datasets containing real logos, to evaluate the robustness of state-of-the-art phishing detectors; and (ii) user studies to gauge whether our adversarial logos can deceive human eyes. The results show that our proposed attack is capable of crafting perturbed logos subtle enough to evade various DL models-achieving an evasion rate of up to 95%. Moreover, users are not able to spot significant differences between generated adversarial logos and original ones., Comment: To appear in ESORICS 2023
Published: 2023

26. TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models

Author: Sur, Indranil, Sikka, Karan, Walmer, Matthew, Koneripalli, Kaushik, Roy, Anirban, Lin, Xiao, Divakaran, Ajay, and Jha, Susmit
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We present a Multimodal Backdoor Defense technique TIJO (Trigger Inversion using Joint Optimization). Recent work arXiv:2112.07668 has demonstrated successful backdoor attacks on multimodal models for the Visual Question Answering task. Their dual-key backdoor trigger is split across two modalities (image and text), such that the backdoor is activated if and only if the trigger is present in both modalities. We propose TIJO that defends against dual-key attacks through a joint optimization that reverse-engineers the trigger in both the image and text modalities. This joint optimization is challenging in multimodal models due to the disconnected nature of the visual pipeline which consists of an offline feature extractor, whose output is then fused with the text using a fusion module. The key insight enabling the joint optimization in TIJO is that the trigger inversion needs to be carried out in the object detection box feature space as opposed to the pixel space. We demonstrate the effectiveness of our method on the TrojVQA benchmark, where TIJO improves upon the state-of-the-art unimodal methods from an AUC of 0.6 to 0.92 on multimodal dual-key backdoors. Furthermore, our method also improves upon the unimodal baselines on unimodal backdoors. We present ablation studies and qualitative results to provide insights into our algorithm such as the critical importance of overlaying the inverted feature triggers on all visual features during trigger inversion. The prototype implementation of TIJO is available at https://github.com/SRI-CSL/TIJO., Comment: Published as conference paper at ICCV 2023. 13 pages, 6 figures, 7 tables
Published: 2023

27. Predicting Information Pathways Across Online Communities

Author: Jin, Yiqiao, Lee, Yeon-Chang, Sharma, Kartik, Ye, Meng, Sikka, Karan, Divakaran, Ajay, and Kumar, Srijan
Subjects: Computer Science - Social and Information Networks, Computer Science - Computers and Society, J.4
Abstract: The problem of community-level information pathway prediction (CLIPP) aims at predicting the transmission trajectory of content across online communities. A successful solution to CLIPP holds significance as it facilitates the distribution of valuable information to a larger audience and prevents the proliferation of misinformation. Notably, solving CLIPP is non-trivial as inter-community relationships and influence are unknown, information spread is multi-modal, and new content and new communities appear over time. In this work, we address CLIPP by collecting large-scale, multi-modal datasets to examine the diffusion of online YouTube videos on Reddit. We analyze these datasets to construct community influence graphs (CIGs) and develop a novel dynamic graph framework, INPAC (Information Pathway Across Online Communities), which incorporates CIGs to capture the temporal variability and multi-modal nature of video propagation across communities. Experimental results in both warm-start and cold-start scenarios show that INPAC outperforms seven baselines in CLIPP., Comment: In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'23)
Published: 2023
Full Text: View/download PDF

28. Isolation of Microcrystalline Cellulose from Valoniopsis pachynema Green Macroalgae: Physicochemical, Thermal, Morphological, and Mechanical Characterization for Biofilm Applications

Author: Sunesh, Narayana Perumal, Suyambulingam, Indran, Divakaran, Divya, and Siengchin, Suchart
Published: 2024
Full Text: View/download PDF

29. Probing Conceptual Understanding of Large Visual-Language Models

Author: Schiappa, Madeline, Abdullah, Raiyaan, Azad, Shehreen, Claypoole, Jared, Cogswell, Michael, Divakaran, Ajay, and Rawat, Yogesh
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent years large visual-language (V+L) models have achieved great success in various downstream tasks. However, it is not well studied whether these models have a conceptual grasp of the visual content. In this work we focus on conceptual understanding of these large V+L models. To facilitate this study, we propose novel benchmarking datasets for probing three different aspects of content understanding, 1) \textit{relations}, 2) \textit{composition}, and 3) \textit{context}. Our probes are grounded in cognitive science and help determine if a V+L model can, for example, determine if snow garnished with a man is implausible, or if it can identify beach furniture by knowing it is located on a beach. We experimented with many recent state-of-the-art V+L models and observe that these models mostly \textit{fail to demonstrate} a conceptual understanding. This study reveals several interesting insights such as that \textit{cross-attention} helps learning conceptual understanding, and that CNNs are better with \textit{texture and patterns}, while Transformers are better at \textit{color and shape}. We further utilize some of these insights and investigate a \textit{simple finetuning technique} that rewards the three conceptual understanding measures with promising initial results. The proposed benchmarks will drive the community to delve deeper into conceptual understanding and foster advancements in the capabilities of large V+L models. The code and dataset is available at: \url{https://tinyurl.com/vlm-robustness}, Comment: All code and dataset is available at: https://tinyurl.com/vlm-robustness. Accepted in CVPRW 2024
Published: 2023

30. Lipoprotein(a) as a cardiovascular risk factor among patients with and without diabetes Mellitus: the Mass General Brigham Lp(a) Registry

Author: Arthur Shiyovich, Adam N. Berman, Stephanie A. Besser, David W. Biery, Rhanderson Cardoso, Sanjay Divakaran, Avinainder Singh, Daniel M. Huck, Brittany Weber, Jorge Plutzky, Christopher Cannon, Khurram Nasir, Marcelo F. Di Carli, James L. Januzzi, Deepak L. Bhatt, and Ron Blankstein
Subjects: Lipoprotein (a), Diabetes mellitus, Coronary artery disease outcomes, Diseases of the circulatory (Cardiovascular) system, RC666-701
Abstract: Abstract Background Diabetes mellitus (DM) and Lp(a) are well-established predictors of coronary artery disease (CAD) outcomes. However, their combined association remains poorly understood. Objective To investigate the relationship between elevated Lp(a) and DM with CAD outcomes. Methods Retrospective analysis of the MGB Lp(a) Registry involving patients ≥ 18 years who underwent Lp(a) measurements between 2000 and 2019. Exclusion criteria were severe kidney dysfunction, malignant neoplasms, and prior atherosclerotic cardiovascular disease (ASCVD). The primary outcome was a combination of cardiovascular death or myocardial infarction (MI). Elevated Lp(a) was defined as > 90th percentile (≥ 216 nmol/L). Results Among 6,238 patients who met the eligibility criteria, the median age was 54, 45% were women, and 12% had DM. Patients with DM were older, more frequently male, and had a higher prevalence of additional cardiovascular risk factors. Over a median follow-up of 12.9 years, patients with either DM or elevated Lp(a) experienced higher rates of the primary outcome. Notably, those with elevated Lp(a) had a higher incidence of the primary outcome regardless of their DM status. The annual event rates were as follows: No-DM and Lp(a) 90th% − 1.3%; DM and Lp(a) 90th% − 4.7% (p
Published: 2024
Full Text: View/download PDF

31. Quantum critical engine at finite temperatures

Author: Revathy, B. S., Mukherjee, Victor, and Divakaran, Uma
Published: 2024
Full Text: View/download PDF

32. Multilingual Content Moderation: A Case Study on Reddit

Author: Ye, Meng, Sikka, Karan, Atwell, Katherine, Hassan, Sabit, Divakaran, Ajay, and Alikhani, Malihe
Subjects: Computer Science - Computation and Language
Abstract: Content moderation is the process of flagging content based on pre-defined platform rules. There has been a growing need for AI moderators to safeguard users as well as protect the mental health of human moderators from traumatic content. While prior works have focused on identifying hateful/offensive language, they are not adequate for meeting the challenges of content moderation since 1) moderation decisions are based on violation of rules, which subsumes detection of offensive speech, and 2) such rules often differ across communities which entails an adaptive solution. We propose to study the challenges of content moderation by introducing a multilingual dataset of 1.8 Million Reddit comments spanning 56 subreddits in English, German, Spanish and French. We perform extensive experimental analysis to highlight the underlying challenges and suggest related research problems such as cross-lingual transfer, learning under label noise (human biases), transfer of moderation models, and predicting the violated rule. Our dataset and analysis can help better prepare for the challenges and opportunities of auto moderation.
Published: 2023

33. Improving Performance of Quantum Heat Engines using modified Otto cycle

Author: S, Revathy B., Sharma, Harsh, and Divakaran, Uma
Subjects: Quantum Physics
Abstract: The efficiency of a quantum heat engine is maximum when the unitary strokes are adiabatic. On the other hand, this may not be always possible due to small energy gaps in the system, especially at the critical point where the gap vanishes. With the aim to achieve this adiabaticity, we modify one of the unitary strokes of the cycle by allowing the system to evolve freely with a particular Hamiltonian till a time so that the system reaches a less excited state. This will help in increasing the magnitude of the heat absorbed from the hot bath so that the work output and efficiency of the engine can be increased. We demonstrate this method using an integrable model and a non-integrable model as the working medium. In the case of a two spin system, the optimal value for the time till which the system needs to be freely evolved is calculated analytically in the adiabatic limit. The results show that implementing this modified stroke significantly improves the work output and efficiency of the engine, especially when it crosses the critical point., Comment: 8 pages, 8 figures
Published: 2023
Full Text: View/download PDF

34. System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Author: Sur, Indranil, Daniels, Zachary, Rahman, Abrar, Faber, Kamil, Gallardo, Gianmarco J., Hayes, Tyler L., Taylor, Cameron E., Gurbuz, Mustafa Burak, Smith, James, Joshi, Sahana, Japkowicz, Nathalie, Baron, Michael, Kira, Zsolt, Kanan, Christopher, Corizzo, Roberto, Divakaran, Ajay, Piacentino, Michael, Hostetler, Jesse, and Raghavan, Aswin
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: As Artificial and Robotic Systems are increasingly deployed and relied upon for real-world applications, it is important that they exhibit the ability to continually learn and adapt in dynamically-changing environments, becoming Lifelong Learning Machines. Continual/lifelong learning (LL) involves minimizing catastrophic forgetting of old tasks while maximizing a model's capability to learn new tasks. This paper addresses the challenging lifelong reinforcement learning (L2RL) setting. Pushing the state-of-the-art forward in L2RL and making L2RL useful for practical applications requires more than developing individual L2RL algorithms; it requires making progress at the systems-level, especially research into the non-trivial problem of how to integrate multiple L2RL algorithms into a common framework. In this paper, we introduce the Lifelong Reinforcement Learning Components Framework (L2RLCF), which standardizes L2RL systems and assimilates different continual learning components (each addressing different aspects of the lifelong learning problem) into a unified system. As an instantiation of L2RLCF, we develop a standard API allowing easy integration of novel lifelong learning components. We describe a case study that demonstrates how multiple independently-developed LL components can be integrated into a single realized system. We also introduce an evaluation environment in order to measure the effect of combining various system components. Our evaluation environment employs different LL scenarios (sequences of tasks) consisting of Starcraft-2 minigames and allows for the fair, comprehensive, and quantitative comparison of different combinations of components within a challenging common evaluation environment., Comment: The Second International Conference on AIML Systems, October 12--15, 2022, Bangalore, India
Published: 2022
Full Text: View/download PDF

35. Unpacking Large Language Models with Conceptual Consistency

Author: Sahu, Pritish, Cogswell, Michael, Gong, Yunye, and Divakaran, Ajay
Subjects: Computer Science - Computation and Language
Abstract: If a Large Language Model (LLM) answers "yes" to the question "Are mountains tall?" then does it know what a mountain is? Can you rely on it responding correctly or incorrectly to other questions about mountains? The success of Large Language Models (LLMs) indicates they are increasingly able to answer queries like these accurately, but that ability does not necessarily imply a general understanding of concepts relevant to the anchor query. We propose conceptual consistency to measure a LLM's understanding of relevant concepts. This novel metric measures how well a model can be characterized by finding out how consistent its responses to queries about conceptually relevant background knowledge are. To compute it we extract background knowledge by traversing paths between concepts in a knowledge base and then try to predict the model's response to the anchor query from the background knowledge. We investigate the performance of current LLMs in a commonsense reasoning setting using the CSQA dataset and the ConceptNet knowledge base. While conceptual consistency, like other metrics, does increase with the scale of the LLM used, we find that popular models do not necessarily have high conceptual consistency. Our analysis also shows significant variation in conceptual consistency across different kinds of relations, concepts, and prompts. This serves as a step toward building models that humans can apply a theory of mind to, and thus interact with intuitively.
Published: 2022

36. Effective Utilization and Characterization of Carbon Derived from Non-biodegradable Waste Based Electrical Switches for Supercapacitor Applications: A Green Approach

Author: Prabhin, V. S., Benitha, V. S., Jeyasubramanian, K., Selvakumari, R. Shantha, and Divakaran, Divya
Published: 2024
Full Text: View/download PDF

37. Facile exfoliation and physicochemical characterization of biomass-based cellulose derived from Lantana aculeata leaves for sustainable environment

Author: Gokulkumar, S., Suyambulingam, Indran, Divakaran, Divya, Priyadharshini, G. Suganya, Aravindh, M., Iyyadurai, Jenish, Edwards, M. Sanju, and Siengchin, Suchart
Published: 2023
Full Text: View/download PDF

38. Bath engineering enhanced quantum critical engines

Author: S., Revathy B., Mukherjee, Victor, and Divakaran, Uma
Subjects: Quantum Physics, Condensed Matter - Statistical Mechanics
Abstract: Driving a quantum system across quantum critical points leads to non-adiabatic excitations in the system. This in turn may adversely affect the functioning of a quantum machine which uses a quantum critical substance as its working medium. Here we propose a bath-engineered quantum engine (BEQE), in which we use the Kibble--Zurek mechanism and critical scaling laws to formulate a protocol for enhancing the performance of finite-time quantum engines operating close to quantum phase transitions. In the case of free fermionic systems, BEQE enables finite-time engines to outperform engines operating in the presence of shortcuts to adiabaticity, and even infinite-time engines under suitable conditions, thus showing the remarkable advantages offered by this technique. Open questions remain regarding the use of BEQE based on non-integrable models.
Published: 2022
Full Text: View/download PDF

39. Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2

Author: Daniels, Zachary, Raghavan, Aswin, Hostetler, Jesse, Rahman, Abrar, Sur, Indranil, Piacentino, Michael, and Divakaran, Ajay
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: One approach to meet the challenges of deep lifelong reinforcement learning (LRL) is careful management of the agent's learning experiences, to learn (without forgetting) and build internal meta-models (of the tasks, environments, agents, and world). Generative replay (GR) is a biologically inspired replay mechanism that augments learning experiences with self-labelled examples drawn from an internal generative model that is updated over time. We present a version of GR for LRL that satisfies two desiderata: (a) Introspective density modelling of the latent representations of policies learned using deep RL, and (b) Model-free end-to-end learning. In this paper, we study three deep learning architectures for model-free GR, starting from a na\"ive GR and adding ingredients to achieve (a) and (b). We evaluate our proposed algorithms on three different scenarios comprising tasks from the Starcraft-2 and Minigrid domains. We report several key findings showing the impact of the design choices on quantitative metrics that include transfer learning, generalization to unseen tasks, fast adaptation after task change, performance wrt task expert, and catastrophic forgetting. We observe that our GR prevents drift in the features-to-action mapping from the latent vector space of a deep RL agent. We also show improvements in established lifelong learning metrics. We find that a small random replay buffer significantly increases the stability of training. Overall, we find that "hidden replay" (a well-known architecture for class-incremental classification) is the most promising approach that pushes the state-of-the-art in GR for LRL and observe that the architecture of the sleep model might be more important for improving performance than the types of replay used. Our experiments required only 6% of training samples to achieve 80-90% of expert performance in most Starcraft-2 scenarios., Comment: Accepted to the First Conference on Lifelong Learning Agents (CoLLAs 2022)
Published: 2022

40. Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction

Author: Kim, Sujeong, Garlapati, Abhinav, Lubin, Jonah, Tamrakar, Amir, and Divakaran, Ajay
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: We present a series of two studies conducted to understand user's affective states during voice-based human-machine interactions. Emphasis is placed on the cases of communication errors or failures. In particular, we are interested in understanding "confusion" in relation with other affective states. The studies consist of two types of tasks: (1) related to communication with a voice-based virtual agent: speaking to the machine and understanding what the machine says, (2) non-communication related, problem-solving tasks where the participants solve puzzles and riddles but are asked to verbally explain the answers to the machine. We collected audio-visual data and self-reports of affective states of the participants. We report results of two studies and analysis of the collected data. The first study was analyzed based on the annotator's observation, and the second study was analyzed based on the self-report.
Published: 2022
Full Text: View/download PDF

41. Isolation and characterization of novel bioplasticizers from rose (Rosa damascena Mill.) petals and its suitability investigation for poly (butylene adipate-co-terephthalate) biofilm applications

Author: Edayadulla, Naushad, Divakaran, Divya, Chandraraj, Shanmuga Sundari, Suyambulingam, Indran, Jayamani, Elammaran, Sanjay, M. R., and Siengchin, Suchart
Published: 2024
Full Text: View/download PDF

42. Characterization of Calotropis gigantiea plant leaves biomass-based bioplasticizers for biofilm applications

Author: Shanmuga Sundari Chandraraj, Indran Suyambulingam, Naushad Edayadulla, Divya Divakaran, Manoj Kumar Singh, M.R. Sanjay, and Suchart Siengchin
Subjects: Calotropis gigantiea, Bioplasticizer, Biofilm, Biofiller, Biomaterial, Biomass valorization, Science (General), Q1-390, Social sciences (General), H1-99
Abstract: The present surge in environmental consciousness has pushed for the use of biodegradable plasticizers, which are sustainable and abundant in plant resources. As a result of their biocompatibility and biodegradability, Calotropis gigantiea leaf plasticizers (CLP) serve as viable alternatives to chemical plasticizers. First time, the natural plasticizers from the Calotropis leaves were extracted for this study using a suitable chemical approach that was also environmentally friendly. The XRD results showed a reduced crystallinity index of 20.2 % and a crystalline size of 5.3 nm, respectively. TGA study revealed that the CLP has good thermal stability (244 °C). Through FT-IR study, the existence of organic compounds in CLP can be investigated by key functional groups such as alcohol, amine, amide, hydrocarbon, alkene, aromatic, etc. Further the presence of alcoholic, amino, and carboxyl constituents was confirmed by UV investigation. SEM, EDAX analysis, and AFM are used to examine the surface morphology of the isolated plasticizer. SEM pictures reveal rough surfaces on the CLP surface pores, which makes them suitable for plasticizing new bioplastics with improved mechanical properties. Poly (butylene adipate-co-terephthalate) (PBAT), a biodegradable polymer matrix, was used to investigate the plasticization impact after the macromolecules were characterised. The biofilm PBAT/CLP had a thickness of 0.8 mm. In addition, the reinforcement interface was examined using scanning electron microscopy. When CLP is loaded differently in PBAT, the tensile strength and young modulus change from 15.30 to 24.60 MPa and from 137 to 168 MPa, respectively. CLP-reinforced films demonstrated better surface compatibility and enhanced flexibility at a loading of 2 % when compared to pure PBAT films. Considering several documented characteristics, CLP may prove to be an excellent plasticizer for resolving environmental issues in the future.
Published: 2024
Full Text: View/download PDF

43. SIERRA: Ranking Anomalous Activities in Enterprise Networks

Author: Lee, Jehyun, Tang, Farren, Thet, Phyo May, Yeoh, Desmond, Rybczynski, Mitch, and Divakaran, Dinil Mon
Subjects: Computer Science - Cryptography and Security
Abstract: An enterprise today deploys multiple security middleboxes such as firewalls, IDS, IPS, etc. in its network to collect different kinds of events related to threats and attacks. These events are streamed into a SIEM (Security Information and Event Management) system for analysts to investigate and respond quickly with appropriate actions. However, the number of events collected for a single enterprise can easily run into hundreds of thousands per day, much more than what analysts can investigate under a given budget constraint (time). In this work, we look into the problem of prioritizing suspicious events or anomalies to analysts for further investigation. We develop SIERRA, a system that processes event logs from multiple and diverse middleboxes to detect and rank anomalous activities. SIERRA takes an unsupervised approach and therefore has no dependence on ground truth data. Different from other works, SIERRA defines contexts, that help it to provide visual explanations of highly-ranked anomalous points to analysts, despite employing unsupervised models. We evaluate SIERRA using months of logs from multiple security middleboxes of an enterprise network. The evaluations demonstrate the capability of SIERRA to detect top anomalies in a network while outperforming naive application of existing anomaly detection algorithms as well as a state-of-the-art SIEM-based anomaly detection solution., Comment: to appear in IEEE European Symposium on Security and Privacy (EuroS&P) 2022
Published: 2022

44. Can FDG PET/CT predict AV nodal recovery in cardiac sarcoidosis?

Author: John, Leah A., Tedrow, Usha B., and Divakaran, Sanjay
Published: 2023
Full Text: View/download PDF

45. The promises and challenges of many-body quantum technologies: A focus on quantum engines

Author: Victor Mukherjee and Uma Divakaran
Subjects: Science
Abstract: Can many-body systems be beneficial to designing quantum technologies? We address this question by examining quantum engines, where recent studies indicate potential benefits through the harnessing of many-body effects, such as divergences close to phase transitions. However, open questions remain regarding their real-world applications.
Published: 2024
Full Text: View/download PDF

46. A Survey on Identifying Empathic Expression in Remote Collaboration From Empathic Computing

Author: Prabesh Paudel, Hyung-Jeong Yang, Anjitha Divakaran, Seung-Won Kim, Ji-Eun Shin, and Soo Hyung Kim
Subjects: Empathy, empathic computing, empathic response and expression, empathic intelligence, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Empathy has emerged as a valuable tool for communication in today’s digital world, with empathic computing serving as an integral component in facilitating human-like changes such as humanoid robots and fostering comfortable interactions. This article thoroughly explores the realms of empathy, empathic computing, emotion, and their intricate interconnections. Our primary focus is to investigate methods for measuring empathy between two remote collaborators. We review empathy usage and its measurement in remote collaboration and computing process. We analyze recent empathic computing methods in virtual reality and through gaze behavior. Our analysis explores empathy, and the application of empathic computing, and analyze their corresponding responses. We conclude by providing a comprehensive understanding of empathy’s multifaceted nature and its crucial role in establishing mutual understanding and connection between remote agents.
Published: 2024
Full Text: View/download PDF

47. Markov Chain Monte Carlo-Based Machine Unlearning: Unlearning What Needs to be Forgotten

Author: Nguyen, Quoc Phong, Oikawa, Ryutaro, Divakaran, Dinil Mon, Chan, Mun Choon, and Low, Bryan Kian Hsiang
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security
Abstract: As the use of machine learning (ML) models is becoming increasingly popular in many real-world applications, there are practical challenges that need to be addressed for model maintenance. One such challenge is to 'undo' the effect of a specific subset of dataset used for training a model. This specific subset may contain malicious or adversarial data injected by an attacker, which affects the model performance. Another reason may be the need for a service provider to remove data pertaining to a specific user to respect the user's privacy. In both cases, the problem is to 'unlearn' a specific subset of the training data from a trained model without incurring the costly procedure of retraining the whole model from scratch. Towards this goal, this paper presents a Markov chain Monte Carlo-based machine unlearning (MCU) algorithm. MCU helps to effectively and efficiently unlearn a trained model from subsets of training dataset. Furthermore, we show that with MCU, we are able to explain the effect of a subset of a training dataset on the model prediction. Thus, MCU is useful for examining subsets of data to identify the adversarial data to be removed. Similarly, MCU can be used to erase the lineage of a user's personal data from trained ML models, thus upholding a user's "right to be forgotten". We empirically evaluate the performance of our proposed MCU algorithm on real-world phishing and diabetes datasets. Results show that MCU can achieve a desirable performance by efficiently removing the effect of a subset of training dataset and outperform an existing algorithm that utilizes the remaining dataset., Comment: Proceedings of the 2022 ACM Asia Conference on Computer and Communications Security (ASIA CCS '22), May 30-June 3, 2022, Nagasaki, Japan
Published: 2022
Full Text: View/download PDF

48. Detecting out-of-context objects using contextual cues

Author: Acharya, Manoj, Roy, Anirban, Koneripalli, Kaushik, Jha, Susmit, Kanan, Christopher, and Divakaran, Ajay
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: This paper presents an approach to detect out-of-context (OOC) objects in an image. Given an image with a set of objects, our goal is to determine if an object is inconsistent with the scene context and detect the OOC object with a bounding box. In this work, we consider commonly explored contextual relations such as co-occurrence relations, the relative size of an object with respect to other objects, and the position of the object in the scene. We posit that contextual cues are useful to determine object labels for in-context objects and inconsistent context cues are detrimental to determining object labels for out-of-context objects. To realize this hypothesis, we propose a graph contextual reasoning network (GCRN) to detect OOC objects. GCRN consists of two separate graphs to predict object labels based on the contextual cues in the image: 1) a representation graph to learn object features based on the neighboring objects and 2) a context graph to explicitly capture contextual cues from the neighboring objects. GCRN explicitly captures the contextual cues to improve the detection of in-context objects and identify objects that violate contextual relations. In order to evaluate our approach, we create a large-scale dataset by adding OOC object instances to the COCO images. We also evaluate on recent OCD benchmark. Our results show that GCRN outperforms competitive baselines in detecting OOC objects and correctly detecting in-context objects.
Published: 2022

49. Exactly Solvable 1D Quantum Models with Gamma Matrices

Author: Chugh, Yash, Dhochak, Kusum, Divakaran, Uma, Narayan, Prithvi, and Pal, Amit Kumar
Subjects: Condensed Matter - Statistical Mechanics, Condensed Matter - Strongly Correlated Electrons, High Energy Physics - Theory, Quantum Physics
Abstract: In this paper, we write exactly solvable generalizations of 1-dimensional quantum XY and Ising-like models by using $2^d$-dimensional Gamma ($\Gamma$) matrices as the degrees of freedom on each site. We show that these models result in quadratic Fermionic Hamiltonians with Jordan-Wigner like transformations. We illustrate the techniques using a specific case of 4-dimensional $\Gamma$ matrices and explore the quantum phase transitions present in the model., Comment: 25 Pages, 4 Figures
Published: 2022
Full Text: View/download PDF

50. Comprehensive characterization of novel bioplasticizer from Pandanus tectorius leaves: a sustainable biomaterial for biofilm applications

Author: Boominathan, Senthilkumar, Suyambulingam, Indran, Narayanaperumal, Sunesh, Divakaran, Divya, Senthamaraikannan, P., and Siengchin, Suchart
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

441 results on '"P. P. Divakaran"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources