Author: "Fuxman, Ariel" / Publication Type: Electronic Resources - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Fuxman, Ariel"' showing total 11 results

Start Over Author "Fuxman, Ariel" Publication Type Electronic Resources

11 results on '"Fuxman, Ariel"'

1. Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use

Author: Toubal, Imad Eddine, Avinash, Aditya, Alldrin, Neil Gordon, Dlabal, Jan, Zhou, Wenlei, Luo, Enming, Stretcu, Otilia, Xiong, Hao, Lu, Chun-Ta, Zhou, Howard, Krishna, Ranjay, Fuxman, Ariel, Duerig, Tom, Toubal, Imad Eddine, Avinash, Aditya, Alldrin, Neil Gordon, Dlabal, Jan, Zhou, Wenlei, Luo, Enming, Stretcu, Otilia, Xiong, Hao, Lu, Chun-Ta, Zhou, Howard, Krishna, Ranjay, Fuxman, Ariel, and Duerig, Tom
Abstract: From content moderation to wildlife conservation, the number of applications that require models to recognize nuanced or subjective visual concepts is growing. Traditionally, developing classifiers for such concepts requires substantial manual effort measured in hours, days, or even months to identify and annotate data needed for training. Even with recently proposed Agile Modeling techniques, which enable rapid bootstrapping of image classifiers, users are still required to spend 30 minutes or more of monotonous, repetitive data labeling just to train a single classifier. Drawing on Fiske's Cognitive Miser theory, we propose a new framework that alleviates manual effort by replacing human labeling with natural language interactions, reducing the total effort required to define a concept by an order of magnitude: from labeling 2,000 images to only 100 plus some natural language interactions. Our framework leverages recent advances in foundation models, both large language models and vision-language models, to carve out the concept space through conversation and by automatically labeling training data points. Most importantly, our framework eliminates the need for crowd-sourced annotations. Moreover, our framework ultimately produces lightweight classification models that are deployable in cost-sensitive scenarios. Across 15 subjective concepts and across 2 public image classification datasets, our trained models outperform traditional Agile Modeling as well as state-of-the-art zero-shot classification models like ALIGN, CLIP, CuPL, and large visual question-answering models like PaLI-X.
Published: 2024

2. Scaling Up LLM Reviews for Google Ads Content Moderation

Author: Qiao, Wei, Dogra, Tushar, Stretcu, Otilia, Lyu, Yu-Han, Fang, Tiantian, Kwon, Dongjin, Lu, Chun-Ta, Luo, Enming, Wang, Yuan, Chia, Chih-Chun, Fuxman, Ariel, Wang, Fangzhou, Krishna, Ranjay, Tek, Mehmet, Qiao, Wei, Dogra, Tushar, Stretcu, Otilia, Lyu, Yu-Han, Fang, Tiantian, Kwon, Dongjin, Lu, Chun-Ta, Luo, Enming, Wang, Yuan, Chia, Chih-Chun, Fuxman, Ariel, Wang, Fangzhou, Krishna, Ranjay, and Tek, Mehmet
Abstract: Large language models (LLMs) are powerful tools for content moderation, but their inference costs and latency make them prohibitive for casual use on large datasets, such as the Google Ads repository. This study proposes a method for scaling up LLM reviews for content moderation in Google Ads. First, we use heuristics to select candidates via filtering and duplicate removal, and create clusters of ads for which we select one representative ad per cluster. We then use LLMs to review only the representative ads. Finally, we propagate the LLM decisions for the representative ads back to their clusters. This method reduces the number of reviews by more than 3 orders of magnitude while achieving a 2x recall compared to a baseline non-LLM model. The success of this approach is a strong function of the representations used in clustering and label propagation; we found that cross-modal similarity representations yield better results than uni-modal representations.
Published: 2024
Full Text: View/download PDF

3. Agile Modeling: From Concept to Classifier in Minutes

Author: Stretcu, Otilia, Vendrow, Edward, Hata, Kenji, Viswanathan, Krishnamurthy, Ferrari, Vittorio, Tavakkol, Sasan, Zhou, Wenlei, Avinash, Aditya, Luo, Enming, Alldrin, Neil Gordon, Bateni, MohammadHossein, Berger, Gabriel, Bunner, Andrew, Lu, Chun-Ta, Rey, Javier A, DeSalvo, Giulia, Krishna, Ranjay, Fuxman, Ariel, Stretcu, Otilia, Vendrow, Edward, Hata, Kenji, Viswanathan, Krishnamurthy, Ferrari, Vittorio, Tavakkol, Sasan, Zhou, Wenlei, Avinash, Aditya, Luo, Enming, Alldrin, Neil Gordon, Bateni, MohammadHossein, Berger, Gabriel, Bunner, Andrew, Lu, Chun-Ta, Rey, Javier A, DeSalvo, Giulia, Krishna, Ranjay, and Fuxman, Ariel
Abstract: The application of computer vision to nuanced subjective use cases is growing. While crowdsourcing has served the vision community well for most objective tasks (such as labeling a "zebra"), it now falters on tasks where there is substantial subjectivity in the concept (such as identifying "gourmet tuna"). However, empowering any user to develop a classifier for their concept is technically difficult: users are neither machine learning experts, nor have the patience to label thousands of examples. In reaction, we introduce the problem of Agile Modeling: the process of turning any subjective visual concept into a computer vision model through a real-time user-in-the-loop interactions. We instantiate an Agile Modeling prototype for image classification and show through a user study (N=14) that users can create classifiers with minimal effort under 30 minutes. We compare this user driven process with the traditional crowdsourcing paradigm and find that the crowd's notion often differs from that of the user's, especially as the concepts become more subjective. Finally, we scale our experiments with simulations of users training classifiers for ImageNet21k categories to further demonstrate the efficacy.
Published: 2023

4. Benchmarking Robustness to Adversarial Image Obfuscations

Author: Stimberg, Florian, Chakrabarti, Ayan, Lu, Chun-Ta, Hazimeh, Hussein, Stretcu, Otilia, Qiao, Wei, Liu, Yintao, Kaya, Merve, Rashtchian, Cyrus, Fuxman, Ariel, Tek, Mehmet, Gowal, Sven, Stimberg, Florian, Chakrabarti, Ayan, Lu, Chun-Ta, Hazimeh, Hussein, Stretcu, Otilia, Qiao, Wei, Liu, Yintao, Kaya, Merve, Rashtchian, Cyrus, Fuxman, Ariel, Tek, Mehmet, and Gowal, Sven
Abstract: Automated content filtering and moderation is an important tool that allows online platforms to build striving user communities that facilitate cooperation and prevent abuse. Unfortunately, resourceful actors try to bypass automated filters in a bid to post content that violate platform policies and codes of conduct. To reach this goal, these malicious actors may obfuscate policy violating images (e.g. overlay harmful images by carefully selected benign images or visual patterns) to prevent machine learning models from reaching the correct decision. In this paper, we invite researchers to tackle this specific issue and present a new image benchmark. This benchmark, based on ImageNet, simulates the type of obfuscations created by malicious actors. It goes beyond ImageNet-$\textrm{C}$ and ImageNet-$\bar{\textrm{C}}$ by proposing general, drastic, adversarial modifications that preserve the original content intent. It aims to tackle a more common adversarial threat than the one considered by $\ell_p$-norm bounded adversaries. We evaluate 33 pretrained models on the benchmark and train models with different augmentations, architectures and training methods on subsets of the obfuscations to measure generalization. We hope this benchmark will encourage researchers to test their models and methods and try to find new approaches that are more robust to these obfuscations.
Published: 2023

5. Towards Understanding the Effect of Pretraining Label Granularity

Author: Hong, Guan Zhe, Cui, Yin, Fuxman, Ariel, Chan, Stanley H., Luo, Enming, Hong, Guan Zhe, Cui, Yin, Fuxman, Ariel, Chan, Stanley H., and Luo, Enming
Abstract: In this paper, we study how the granularity of pretraining labels affects the generalization of deep neural networks in image classification tasks. We focus on the "fine-to-coarse" transfer learning setting, where the pretraining label space is more fine-grained than that of the target problem. Empirically, we show that pretraining on the leaf labels of ImageNet21k produces better transfer results on ImageNet1k than pretraining on other coarser granularity levels, which supports the common practice used in the community. Theoretically, we explain the benefit of fine-grained pretraining by proving that, for a data distribution satisfying certain hierarchy conditions, 1) coarse-grained pretraining only allows a neural network to learn the "common" or "easy-to-learn" features well, while 2) fine-grained pretraining helps the network learn the "rarer" or "fine-grained" features in addition to the common ones, thus improving its accuracy on hard downstream test samples in which common features are missing or weak in strength. Furthermore, we perform comprehensive experiments using the label hierarchies of iNaturalist 2021 and observe that the following conditions, in addition to proper choice of label granularity, enable the transfer to work well in practice: 1) the pretraining dataset needs to have a meaningful label hierarchy, and 2) the pretraining and target label functions need to align well.
Published: 2023

6. Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models

Author: Hu, Yushi, Stretcu, Otilia, Lu, Chun-Ta, Viswanathan, Krishnamurthy, Hata, Kenji, Luo, Enming, Krishna, Ranjay, Fuxman, Ariel, Hu, Yushi, Stretcu, Otilia, Lu, Chun-Ta, Viswanathan, Krishnamurthy, Hata, Kenji, Luo, Enming, Krishna, Ranjay, and Fuxman, Ariel
Abstract: Solving complex visual tasks such as "Who invented the musical instrument on the right?" involves a composition of skills: understanding space, recognizing instruments, and also retrieving prior knowledge. Recent work shows promise by decomposing such tasks using a large language model (LLM) into an executable program that invokes specialized vision models. However, generated programs are error-prone: they omit necessary steps, include spurious ones, and are unable to recover when the specialized models give incorrect outputs. Moreover, they require loading multiple models, incurring high latency and computation costs. We propose Visual Program Distillation (VPD), an instruction tuning framework that produces a vision-language model (VLM) capable of solving complex visual tasks with a single forward pass. VPD distills the reasoning ability of LLMs by using them to sample multiple candidate programs, which are then executed and verified to identify a correct one. It translates each correct program into a language description of the reasoning steps, which are then distilled into a VLM. Extensive experiments show that VPD improves the VLM's ability to count, understand spatial relations, and reason compositionally. Our VPD-trained PaLI-X outperforms all prior VLMs, achieving state-of-the-art performance across complex vision tasks, including MMBench, OK-VQA, A-OKVQA, TallyQA, POPE, and Hateful Memes. An evaluation with human annotators also confirms that VPD improves model response factuality and consistency. Finally, experiments on content moderation demonstrate that VPD is also helpful for adaptation to real-world applications with limited data., Comment: CVPR 2024 Oral
Published: 2023

7. CARLS: Cross-platform Asynchronous Representation Learning System

Author: Lu, Chun-Ta, Zeng, Yun, Juan, Da-Cheng, Fan, Yicheng, Li, Zhe, Dlabal, Jan, Chen, Yi-Ting, Gopalan, Arjun, Heydon, Allan, Ferng, Chun-Sung, Miyara, Reah, Fuxman, Ariel, Peng, Futang, Li, Zhen, Duerig, Tom, Tomkins, Andrew, Lu, Chun-Ta, Zeng, Yun, Juan, Da-Cheng, Fan, Yicheng, Li, Zhe, Dlabal, Jan, Chen, Yi-Ting, Gopalan, Arjun, Heydon, Allan, Ferng, Chun-Sung, Miyara, Reah, Fuxman, Ariel, Peng, Futang, Li, Zhen, Duerig, Tom, and Tomkins, Andrew
Abstract: In this work, we propose CARLS, a novel framework for augmenting the capacity of existing deep learning frameworks by enabling multiple components -- model trainers, knowledge makers and knowledge banks -- to concertedly work together in an asynchronous fashion across hardware platforms. The proposed CARLS is particularly suitable for learning paradigms where model training benefits from additional knowledge inferred or discovered during training, such as node embeddings for graph neural networks or reliable pseudo labels from model predictions. We also describe three learning paradigms -- semi-supervised learning, curriculum learning and multimodal learning -- as examples that can be scaled up efficiently by CARLS. One version of CARLS has been open-sourced and available for download at: https://github.com/tensorflow/neural-structured-learning/tree/master/research/carls
Published: 2021

8. Synthesizing Products for Online Catalogs

Author: Nguyen, Hoa, Fuxman, Ariel, Paparizos, Stelios, Freire, Juliana, Agrawal, Rakesh, Nguyen, Hoa, Fuxman, Ariel, Paparizos, Stelios, Freire, Juliana, and Agrawal, Rakesh
Abstract: A high-quality, comprehensive product catalog is essential to the success of Product Search engines and shopping sites such as Yahoo! Shopping, Google Product Search or Bing Shopping. But keeping catalogs up-to-date becomes a challenging task, calling for the need of automated techniques. In this paper, we introduce the problem of product synthesis, a key component of catalog creation and maintenance. Given a set of offers advertised by merchants, the goal is to identify new products and add them to the catalog together with their (structured) attributes. A fundamental challenge is the scale of the problem: a Product Search engine receives data from thousands of merchants and millions of products; the product taxonomy contains thousands of categories, where each category comes in a different schema; and merchants use representations for products that are different from the ones used in the catalog of the Product Search engine. We propose a system that provides an end-to-end solution to the product synthesis problem, and includes components for extraction, and addresses issues involved in data extraction from offers, schema reconciliation, and data fusion. We developed a novel and scalable technique for schema matching which leverages knowledge about previously-known instance-level associations between offers and products; and it is trained using automatically created training sets (no manually-labeled data is needed). We present an experimental evaluation of our system using data from Bing Shopping for more than 800K offers, a thousand merchants, and 400 categories. The evaluation confirms that our approach is able to automatically generate a large number of accurate product specifications, and that our schema reconciliation component outperforms state-of-the-art schema matching techniques in terms of precision and recall., Comment: VLDB2011
Published: 2011

9. Schema Management

Author: Andritsos, Periklis, Fagin, Ronald, Fuxman, Ariel, Haas, Laura M., Hernandez, Mauricio A., Ho, Howard, Kementsietsidis, Anastasios, Kolaitis, Phokion G., Miller, Renée J., Naumann, Felix, Popa, Lucian, Velegrakis, Yannis, Andritsos, Periklis, Fagin, Ronald, Fuxman, Ariel, Haas, Laura M., Hernandez, Mauricio A., Ho, Howard, Kementsietsidis, Anastasios, Kolaitis, Phokion G., Miller, Renée J., Naumann, Felix, Popa, Lucian, and Velegrakis, Yannis
Abstract: Peer Reviewed
Published: 2002

10. Schema Management

Author: Andritsos, Periklis, Fagin, Ronald, Fuxman, Ariel, Haas, Laura M., Hernandez, Mauricio A., Ho, Howard, Kementsietsidis, Anastasios, Kolaitis, Phokion G., Miller, Renée J., Naumann, Felix, Popa, Lucian, Velegrakis, Yannis, Andritsos, Periklis, Fagin, Ronald, Fuxman, Ariel, Haas, Laura M., Hernandez, Mauricio A., Ho, Howard, Kementsietsidis, Anastasios, Kolaitis, Phokion G., Miller, Renée J., Naumann, Felix, Popa, Lucian, and Velegrakis, Yannis
Abstract: Peer Reviewed
Published: 2002

11. Information systems as social structures

Author: Louvain School of Management - Operations and Information, University of Toronto - Computer Science, University of Trento - Computer Science, UCL - SSH/ILSM/ILSM - Research Institute of Louvain School of Management, UCL - SST/ICTM/INGI - Pôle en ingénierie informatique, Fuxman, Ariel, Giorgini, Paolo, Kolp, Manuel, Mylopoulos, John, 2nd International Conference on Formal Ontology in Information Systems, FOIS 2001, Louvain School of Management - Operations and Information, University of Toronto - Computer Science, University of Trento - Computer Science, UCL - SSH/ILSM/ILSM - Research Institute of Louvain School of Management, UCL - SST/ICTM/INGI - Pôle en ingénierie informatique, Fuxman, Ariel, Giorgini, Paolo, Kolp, Manuel, Mylopoulos, John, and 2nd International Conference on Formal Ontology in Information Systems, FOIS 2001
Published: 2001

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

11 results on '"Fuxman, Ariel"'

1. Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use

2. Scaling Up LLM Reviews for Google Ads Content Moderation

3. Agile Modeling: From Concept to Classifier in Minutes

4. Benchmarking Robustness to Adversarial Image Obfuscations

5. Towards Understanding the Effect of Pretraining Label Granularity

6. Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models

7. CARLS: Cross-platform Asynchronous Representation Learning System

8. Synthesizing Products for Online Catalogs

9. Schema Management

10. Schema Management

11. Information systems as social structures

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Publication Type

Database

Publisher

11 results on '"Fuxman, Ariel"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources