Author: "A. Jose Albin" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"A. Jose Albin"' showing total 7 results

Start Over Author "A. Jose Albin" Search Limiters Full Text

7 results on '"A. Jose Albin"'

1. TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Author: Liu, Aiwei, Bai, Haoping, Lu, Zhiyun, Sun, Yanchao, Kong, Xiang, Wang, Simon, Shan, Jiulong, Jose, Albin Madappally, Liu, Xiaojiang, Wen, Lijie, Yu, Philip S., and Cao, Meng
Subjects: Computer Science - Computation and Language, 68T50, I.2.7
Abstract: Direct Preference Optimization (DPO) has been widely adopted for preference alignment of Large Language Models (LLMs) due to its simplicity and effectiveness. However, DPO is derived as a bandit problem in which the whole response is treated as a single arm, ignoring the importance differences between tokens, which may affect optimization efficiency and make it difficult to achieve optimal results. In this work, we propose that the optimal data for DPO has equal expected rewards for each token in winning and losing responses, as there is no difference in token importance. However, since the optimal dataset is unavailable in practice, we propose using the original dataset for importance sampling to achieve unbiased optimization. Accordingly, we propose a token-level importance sampling DPO objective named TIS-DPO that assigns importance weights to each token based on its reward. Inspired by previous works, we estimate the token importance weights using the difference in prediction probabilities from a pair of contrastive LLMs. We explore three methods to construct these contrastive LLMs: (1) guiding the original LLM with contrastive prompts, (2) training two separate LLMs using winning and losing responses, and (3) performing forward and reverse DPO training with winning and losing responses. Experiments show that TIS-DPO significantly outperforms various baseline methods on harmlessness and helpfulness alignment and summarization tasks. We also visualize the estimated weights, demonstrating their ability to identify key token positions., Comment: 27 pages, 7 figures, 2 tables
Published: 2024

2. Apple Intelligence Foundation Language Models

Author: Gunter, Tom, Wang, Zirui, Wang, Chong, Pang, Ruoming, Narayanan, Andy, Zhang, Aonan, Zhang, Bowen, Chen, Chen, Chiu, Chung-Cheng, Qiu, David, Gopinath, Deepak, Yap, Dian Ang, Yin, Dong, Nan, Feng, Weers, Floris, Yin, Guoli, Huang, Haoshuo, Wang, Jianyu, Lu, Jiarui, Peebles, John, Ye, Ke, Lee, Mark, Du, Nan, Chen, Qibin, Keunebroek, Quentin, Wiseman, Sam, Evans, Syd, Lei, Tao, Rathod, Vivek, Kong, Xiang, Du, Xianzhi, Li, Yanghao, Wang, Yongqiang, Gao, Yuan, Ahmed, Zaid, Xu, Zhaoyang, Lu, Zhiyun, Rashid, Al, Jose, Albin Madappally, Doane, Alec, Bencomo, Alfredo, Vanderby, Allison, Hansen, Andrew, Jain, Ankur, Anupama, Anupama Mann, Kamal, Areeba, Wu, Bugu, Brum, Carolina, Maalouf, Charlie, Erdenebileg, Chinguun, Dulhanty, Chris, Moritz, Dominik, Kang, Doug, Jimenez, Eduardo, Ladd, Evan, Shi, Fangping, Bai, Felix, Chu, Frank, Hohman, Fred, Kotek, Hadas, Coleman, Hannah Gillis, Li, Jane, Bigham, Jeffrey, Cao, Jeffery, Lai, Jeff, Cheung, Jessica, Shan, Jiulong, Zhou, Joe, Li, John, Qin, Jun, Singh, Karanjeet, Vega, Karla, Zou, Kelvin, Heckman, Laura, Gardiner, Lauren, Bowler, Margit, Cordell, Maria, Cao, Meng, Hay, Nicole, Shahdadpuri, Nilesh, Godwin, Otto, Dighe, Pranay, Rachapudi, Pushyami, Tantawi, Ramsey, Frigg, Roman, Davarnia, Sam, Shah, Sanskruti, Guha, Saptarshi, Sirovica, Sasha, Ma, Shen, Ma, Shuang, Wang, Simon, Kim, Sulgi, Jayaram, Suma, Shankar, Vaishaal, Paidi, Varsha, Kumar, Vivek, Wang, Xin, Zheng, Xin, Cheng, Walker, Shrager, Yael, Ye, Yang, Tanaka, Yasu, Guo, Yihao, Meng, Yunsong, Luo, Zhao Tang, Ouyang, Zhi, Aygar, Alp, Wan, Alvin, Walkingshaw, Andrew, Lin, Antonie, Farooq, Arsalan, Ramerth, Brent, Reed, Colorado, Bartels, Chris, Chaney, Chris, Riazati, David, Yang, Eric Liang, Feldman, Erin, Hochstrasser, Gabriel, Seguin, Guillaume, Belousova, Irina, Pelemans, Joris, Yang, Karen, Vahid, Keivan Alizadeh, Cao, Liangliang, Najibi, Mahyar, Zuliani, Marco, Horton, Max, Cho, Minsik, Bhendawade, Nikhil, Dong, Patrick, Maj, Piotr, Agrawal, Pulkit, Shan, Qi, Fu, Qichen, Poston, Regan, Xu, Sam, Liu, Shuangning, Rao, Sushma, Heeramun, Tashweena, Merth, Thomas, Rayala, Uday, Cui, Victor, Sridhar, Vivek Rangarajan, Zhang, Wencong, Zhang, Wenqi, Wu, Wentao, Zhou, Xingyu, Liu, Xinwen, Zhao, Yang, Xia, Yin, Ren, Zhile, and Ren, Zhongzheng
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development.
Published: 2024

3. Data Filtering Networks

Author: Fang, Alex, Jose, Albin Madappally, Jain, Amit, Schmidt, Ludwig, Toshev, Alexander, and Shankar, Vaishaal
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Large training sets have become a cornerstone of machine learning and are the foundation for recent advances in language modeling and multimodal learning. While data curation for pre-training is often still ad-hoc, one common paradigm is to first collect a massive pool of data from the Web and then filter this candidate pool down to an actual training set via various heuristics. In this work, we study the problem of learning a data filtering network (DFN) for this second step of filtering a large uncurated dataset. Our key finding is that the quality of a network for filtering is distinct from its performance on downstream tasks: for instance, a model that performs well on ImageNet can yield worse training sets than a model with low ImageNet accuracy that is trained on a small amount of high-quality data. Based on our insights, we construct new data filtering networks that induce state-of-the-art image-text datasets. Specifically, our best performing dataset DFN-5B enables us to train state-of-the-art CLIP models for their compute budgets: among other improvements on a variety of tasks, a ViT-H trained on our dataset achieves 84.4% zero-shot transfer accuracy on ImageNet, out-performing models trained on other datasets such as LAION-2B, DataComp-1B, or OpenAI's WIT. In order to facilitate further research in dataset design, we also release a new 2 billion example dataset DFN-2B and show that high performance data filtering networks can be trained from scratch using only publicly available data.
Published: 2023

4. STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

Author: Chen, Chen, Zhang, Bowen, Cao, Liangliang, Shen, Jiguang, Gunter, Tom, Jose, Albin Madappally, Toshev, Alexander, Shlens, Jonathon, Pang, Ruoming, and Yang, Yinfei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Image and text retrieval is one of the foundational tasks in the vision and language domain with multiple real-world applications. State-of-the-art approaches, e.g. CLIP, ALIGN, represent images and texts as dense embeddings and calculate the similarity in the dense embedding space as the matching score. On the other hand, sparse semantic features like bag-of-words models are more interpretable, but believed to suffer from inferior accuracy than dense representations. In this work, we show that it is possible to build a sparse semantic representation that is as powerful as, or even better than, dense presentations. We extend the CLIP model and build a sparse text and image representation (STAIR), where the image and text are mapped to a sparse token space. Each token in the space is a (sub-)word in the vocabulary, which is not only interpretable but also easy to integrate with existing information retrieval systems. STAIR model significantly outperforms a CLIP model with +$4.9\%$ and +$4.3\%$ absolute Recall@1 improvement on COCO-5k text$\rightarrow$image and image$\rightarrow$text retrieval respectively. It also achieved better performance on both of ImageNet zero-shot and linear probing compared to CLIP.
Published: 2023

5. Implications of an extensive salt water barrage on the distribution of black clam in a tropical estuarine system, Southwest coast of India

Author: Nagarathinam, Arunpandi, Retnamma, Jyothibabu, Loganathan, Jagadeesan, Singaram, Parthasarathi, Mohanan Kannampally Madam, Savitha, Jose, Albin Konnakkamannil, and Subramanian, Pandiyarajan Rethinam
Published: 2021
Full Text: View/download PDF

6. Impact of human-altered hydrographical setting on the Copepod community structure in an extensive tropical estuary along the southwest coast of India

Author: Nagarathinam, Arunpandi, Retnamma, Jyothibabu, Loganathan, Jagadeesan, Singaram, Parthasarathi, Arayillath, Anjusha, and Jose, Albin Konnakkamannil
Published: 2021
Full Text: View/download PDF

7. A Study on Impact of Self Medication in Adults and Paediatrics in COVID-19 Pandemic.

Author: S., Sathya Narayana, Jose, Albin, S., Arun, M., Prashanth, Mani, Muthukumar, R., Srinivasan, and V., Saikrupa B.
Subjects: SELF medication, COVID-19, COVID-19 pandemic, DRUG-food interactions, ADULTS
Abstract: Background: Self-medication is defined as the use of medicinal products by the consumer to treat self-recognized disorders or symptoms, or the intermittent or continued use of a medication prescribed by a physician for chronic or recurring diseases or symptoms, which often is accompanied by potential risks to the consumers, including toxicity, therapeutic failure and drug-drug as well as drug-food interactions. The practice of self-medication has exponentially risen during the Covid-19 pandemic due to fear of visiting healthcare setups and subsequently contracting infections. Aim: The present study aims to determine the prevalence of self-medication in adults and pediatric patients during Covid-19. settings and Design: A cross-sectional community-based survey study was conducted among 556 participants, including pediatric and adult populations, for a period of 6 months, through an online platform. Methods and Material: The data was collected using a self-administered questionnaire, which included informed consent, questions on patient demographics and self-medication practices, and plausible reasons for it. The collected data was analyzed using SPSS 22.0, and R environment ver.3.2.2 software. Results: Out of 556 participants the prevalence of self-medicating with paracetamol was considered to be high both in adults and paediatrics which was found to be 72.6% (319) and 66.6% (117) respectively. During Covid-19 symptoms, it was reported that Azithromycin consumption was found to be more than paracetamol. It also showed that 62.9% (439) were self-medicating before the Covid19 pandemic and 20.5 % (439) were not aware of the risks associated with self-medication. The major source of self-medication was found to be Google (46.5%) followed by using old prescriptions (121.5%) and using family or friends’ prescriptions (10.9%). Conclusions: The study showed a conclusive rise in self-medication practices during the Covid pandemic among the SM-naïve population. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

7 results on '"A. Jose Albin"'

1. TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

2. Apple Intelligence Foundation Language Models

3. Data Filtering Networks

4. STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

5. Implications of an extensive salt water barrage on the distribution of black clam in a tropical estuarine system, Southwest coast of India

6. Impact of human-altered hydrographical setting on the Copepod community structure in an extensive tropical estuary along the southwest coast of India

7. A Study on Impact of Self Medication in Adults and Paediatrics in COVID-19 Pandemic.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

7 results on '"A. Jose Albin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources