Author: "Tomkins, Andrew" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

1. Substance or Style: What Does Your Image Embedding Know?

Author: Rashtchian, Cyrus, Herrmann, Charles, Ferng, Chun-Sung, Chakrabarti, Ayan, Krishnan, Dilip, Sun, Deqing, Juan, Da-Cheng, and Tomkins, Andrew
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Probes are small networks that predict properties of underlying data from embeddings, and they provide a targeted, effective way to illuminate the information contained in embeddings. While analysis through the use of probes has become standard in NLP, there has been much less exploration in vision. Image foundation models have primarily been evaluated for semantic content. Better understanding the non-semantic information in popular embeddings (e.g., MAE, SimCLR, or CLIP) will shed new light both on the training algorithms and on the uses for these foundation models. We design a systematic transformation prediction task and measure the visual content of embeddings along many axes, including image style, quality, and a range of natural and artificial transformations. Surprisingly, six embeddings (including SimCLR) encode enough non-semantic information to identify dozens of transformations. We also consider a generalization task, where we group similar transformations and hold out several for testing. We find that image-text models (CLIP and ALIGN) are better at recognizing new examples of style transfer than masking-based models (CAN and MAE). Overall, our results suggest that the choice of pre-training algorithm impacts the types of information in the embedding, and certain models are better than others for non-semantic downstream tasks., Comment: 27 pages, 9 figures
Published: 2023

2. Approximating a RUM from Distributions on k-Slates

Author: Chierichetti, Flavio, Giacchini, Mirko, Kumar, Ravi, Panconesi, Alessandro, and Tomkins, Andrew
Subjects: Computer Science - Machine Learning, Computer Science - Data Structures and Algorithms
Abstract: In this work we consider the problem of fitting Random Utility Models (RUMs) to user choices. Given the winner distributions of the subsets of size $k$ of a universe, we obtain a polynomial-time algorithm that finds the RUM that best approximates the given distribution on average. Our algorithm is based on a linear program that we solve using the ellipsoid method. Given that its corresponding separation oracle problem is NP-hard, we devise an approximate separation oracle that can be viewed as a generalization of the weighted feedback arc set problem to hypergraphs. Our theoretical result can also be made practical: we obtain a heuristic that is effective and scales to real-world datasets.
Published: 2023

3. Quantifying the sustainability impact of Google Maps: A case study of Salt Lake City

Author: Arora, Neha, Cabannes, Theophile, Ganapathy, Sanjay, Li, Yechen, McAfee, Preston, Nunkesser, Marc, Osorio, Carolina, Tomkins, Andrew, and Tsogsuren, Iveel
Subjects: Physics - Physics and Society, Computer Science - Multiagent Systems
Abstract: Google Maps uses current and historical traffic trends to provide routes to drivers. In this paper, we use microscopic traffic simulation to quantify the improvements to both travel time and CO$_2$ emissions from Google Maps real-time navigation. A case study in Salt Lake City shows that Google Maps users are, on average, saving 1.7% of CO$_2$ emissions and 6.5% travel time. If we restrict to the users for which Google Maps finds a different route than their original route, the average savings are 3.4% of CO$_2$ emissions and 12.5% of travel time. These results are based on traffic conditions observed during the Covid-19 pandemic. As congestion gradually builds back up to pre-pandemic levels, it is expected to lead to even greater savings in emissions., Comment: 7 pages, 3 figures
Published: 2021

4. An Efficient Simulation-Based Travel Demand Calibration Algorithm for Large-Scale Metropolitan Traffic Models

Author: Arora, Neha, Chen, Yi-fan, Ganapathy, Sanjay, Li, Yechen, Lin, Ziheng, Osorio, Carolina, Tomkins, Andrew, and Tsogsuren, Iveel
Subjects: Computer Science - Multiagent Systems
Abstract: Metropolitan scale vehicular traffic modeling is used by a variety of private and public sector urban mobility stakeholders to inform the design and operations of road networks. High-resolution stochastic traffic simulators are increasingly used to describe detailed demand-supply interactions. The design of efficient calibration techniques remains a major challenge. This paper considers a class of high-dimensional calibration problems known as origin-destination (OD) calibration. We formulate the problem as a continuous simulation-based optimization problem. Our proposed algorithm builds upon recent metamodel methods that tackle the simulation-based problem by solving a sequence of approximate analytical optimization problems, which rely on the use of analytical network models. In this paper, we formulate a network model defined as a system of linear equations, the dimension of which scales linearly with the number of roads with field data and independently of the dimension of the route choice set. This makes the approach suitable for large-scale metropolitan networks. The approach has enhanced efficiency compared with past metamodel formulations that are based on systems of nonlinear, rather than linear, equations. It also has enhanced efficiency compared to traditional calibration methods that resort to simulation-based estimates of traffic assignment matrices, while the proposed approach uses analytical approximations of these matrices. We benchmark the approach considering a peak period Salt Lake City case study and calibrate based on field vehicular count data. The new formulation yields solutions with good performance and is suitable for large-scale road networks., Comment: 14 pages, 4 figures
Published: 2021

5. CARLS: Cross-platform Asynchronous Representation Learning System

Author: Lu, Chun-Ta, Zeng, Yun, Juan, Da-Cheng, Fan, Yicheng, Li, Zhe, Dlabal, Jan, Chen, Yi-Ting, Gopalan, Arjun, Heydon, Allan, Ferng, Chun-Sung, Miyara, Reah, Fuxman, Ariel, Peng, Futang, Li, Zhen, Duerig, Tom, and Tomkins, Andrew
Subjects: Computer Science - Machine Learning
Abstract: In this work, we propose CARLS, a novel framework for augmenting the capacity of existing deep learning frameworks by enabling multiple components -- model trainers, knowledge makers and knowledge banks -- to concertedly work together in an asynchronous fashion across hardware platforms. The proposed CARLS is particularly suitable for learning paradigms where model training benefits from additional knowledge inferred or discovered during training, such as node embeddings for graph neural networks or reliable pseudo labels from model predictions. We also describe three learning paradigms -- semi-supervised learning, curriculum learning and multimodal learning -- as examples that can be scaled up efficiently by CARLS. One version of CARLS has been open-sourced and available for download at: https://github.com/tensorflow/neural-structured-learning/tree/master/research/carls
Published: 2021

6. Graph Autoencoders with Deconvolutional Networks

Author: Li, Jia, Yu, Tomas, Juan, Da-Cheng, Gopalan, Arjun, Cheng, Hong, and Tomkins, Andrew
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Recent studies have indicated that Graph Convolutional Networks (GCNs) act as a \emph{low pass} filter in spectral domain and encode smoothed node representations. In this paper, we consider their opposite, namely Graph Deconvolutional Networks (GDNs) that reconstruct graph signals from smoothed node representations. We motivate the design of Graph Deconvolutional Networks via a combination of inverse filters in spectral domain and de-noising layers in wavelet domain, as the inverse operation results in a \emph{high pass} filter and may amplify the noise. Based on the proposed GDN, we further propose a graph autoencoder framework that first encodes smoothed graph representations with GCN and then decodes accurate graph signals with GDN. We demonstrate the effectiveness of the proposed method on several tasks including unsupervised graph-level representation , social recommendation and graph generation
Published: 2020

7. Adversarial Robustness Across Representation Spaces

Author: Awasthi, Pranjal, Yu, George, Ferng, Chun-Sung, Tomkins, Andrew, and Juan, Da-Cheng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Adversarial robustness corresponds to the susceptibility of deep neural networks to imperceptible perturbations made at test time. In the context of image tasks, many algorithms have been proposed to make neural networks robust to adversarial perturbations made to the input pixels. These perturbations are typically measured in an $\ell_p$ norm. However, robustness often holds only for the specific attack used for training. In this work we extend the above setting to consider the problem of training of deep neural networks that can be made simultaneously robust to perturbations applied in multiple natural representation spaces. For the case of image data, examples include the standard pixel representation as well as the representation in the discrete cosine transform~(DCT) basis. We design a theoretically sound algorithm with formal guarantees for the above problem. Furthermore, our guarantees also hold when the goal is to require robustness with respect to multiple $\ell_p$ norm based attacks. We then derive an efficient practical implementation and demonstrate the effectiveness of our approach on standard datasets for image classification.
Published: 2020

8. Surprise: Result List Truncation via Extreme Value Theory

Author: Bahri, Dara, Zheng, Che, Tay, Yi, Metzler, Donald, and Tomkins, Andrew
Subjects: Computer Science - Information Retrieval, Computer Science - Machine Learning, Statistics - Applications
Abstract: Work in information retrieval has largely been centered around ranking and relevance: given a query, return some number of results ordered by relevance to the user. The problem of result list truncation, or where to truncate the ranked list of results, however, has received less attention despite being crucial in a variety of applications. Such truncation is a balancing act between the overall relevance, or usefulness of the results, with the user cost of processing more results. Result list truncation can be challenging because relevance scores are often not well-calibrated. This is particularly true in large-scale IR systems where documents and queries are embedded in the same metric space and a query's nearest document neighbors are returned during inference. Here, relevance is inversely proportional to the distance between the query and candidate document, but what distance constitutes relevance varies from query to query and changes dynamically as more documents are added to the index. In this work, we propose Surprise scoring, a statistical method that leverages the Generalized Pareto distribution that arises in extreme value theory to produce interpretable and calibrated relevance scores at query time using nothing more than the ranked scores. We demonstrate its effectiveness on the result list truncation task across image, text, and IR datasets and compare it to both classical and recent baselines. We draw connections to hypothesis testing and $p$-values.
Published: 2020

9. Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study

Author: Bahri, Dara, Tay, Yi, Zheng, Che, Metzler, Donald, Brunk, Cliff, and Tomkins, Andrew
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Large generative language models such as GPT-2 are well-known for their ability to generate text as well as their utility in supervised downstream tasks via fine-tuning. Our work is twofold: firstly we demonstrate via human evaluation that classifiers trained to discriminate between human and machine-generated text emerge as unsupervised predictors of "page quality", able to detect low quality content without any training. This enables fast bootstrapping of quality indicators in a low-resource setting. Secondly, curious to understand the prevalence and nature of low quality pages in the wild, we conduct extensive qualitative and quantitative analysis over 500 million web articles, making this the largest-scale study ever conducted on the topic.
Published: 2020

10. BusTr: Predicting Bus Travel Times from Real-Time Traffic

Author: Barnes, Richard, Buthpitiya, Senaka, Cook, James, Fabrikant, Alex, Tomkins, Andrew, and Xu, Fangzhou
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We present BusTr, a machine-learned model for translating road traffic forecasts into predictions of bus delays, used by Google Maps to serve the majority of the world's public transit systems where no official real-time bus tracking is provided. We demonstrate that our neural sequence model improves over DeepTTE, the state-of-the-art baseline, both in performance (-30% MAPE) and training stability. We also demonstrate significant generalization gains over simpler models, evaluated on longitudinal data to cope with a constantly evolving world., Comment: 14 pages, 2 figures, 5 tables. Citation: "Richard Barnes, Senaka Buthpitiya, James Cook, Alex Fabrikant, Andrew Tomkins, Fangzhou Xu (2020). BusTr: Predicting Bus Travel Times from Real-Time Traffic. 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. doi: 10.1145/3394486.3403376"
Published: 2020
Full Text: View/download PDF

11. Choppy: Cut Transformer For Ranked List Truncation

Author: Bahri, Dara, Tay, Yi, Zheng, Che, Metzler, Donald, and Tomkins, Andrew
Subjects: Computer Science - Information Retrieval, Computer Science - Computation and Language, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Work in information retrieval has traditionally focused on ranking and relevance: given a query, return some number of results ordered by relevance to the user. However, the problem of determining how many results to return, i.e. how to optimally truncate the ranked result list, has received less attention despite being of critical importance in a range of applications. Such truncation is a balancing act between the overall relevance, or usefulness of the results, with the user cost of processing more results. In this work, we propose Choppy, an assumption-free model based on the widely successful Transformer architecture, to the ranked list truncation problem. Needing nothing more than the relevance scores of the results, the model uses a powerful multi-head attention mechanism to directly optimize any user-defined IR metric. We show Choppy improves upon recent state-of-the-art methods., Comment: SIGIR 2020
Published: 2020

12. Reverse Engineering Configurations of Neural Text Generation Models

Author: Tay, Yi, Bahri, Dara, Zheng, Che, Brunk, Clifford, Metzler, Donald, and Tomkins, Andrew
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: This paper seeks to develop a deeper understanding of the fundamental properties of neural text generations models. The study of artifacts that emerge in machine generated text as a result of modeling choices is a nascent research area. Previously, the extent and degree to which these artifacts surface in generated text has not been well studied. In the spirit of better understanding generative text models and their artifacts, we propose the new task of distinguishing which of several variants of a given model generated a piece of text, and we conduct an extensive suite of diagnostic tests to observe whether modeling choices (e.g., sampling methods, top-$k$ probabilities, model architectures, etc.) leave detectable artifacts in the text they generate. Our key finding, which is backed by a rigorous set of experiments, is that such artifacts are present and that different modeling choices can be inferred by observing the generated text alone. This suggests that neural text generators may be more sensitive to various modeling choices than previously thought., Comment: ACL 2020
Published: 2020

13. Preventing Adversarial Use of Datasets through Fair Core-Set Construction

Author: Spector, Benjamin, Kumar, Ravi, and Tomkins, Andrew
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: We propose improving the privacy properties of a dataset by publishing only a strategically chosen "core-set" of the data containing a subset of the instances. The core-set allows strong performance on primary tasks, but forces poor performance on unwanted tasks. We give methods for both linear models and neural networks and demonstrate their efficacy on data., Comment: 6 pages, 2 figures, NeurIPS 2019 Privacy In Machine Learning Workshop (PriML 2019)
Published: 2019

14. Graph-RISE: Graph-Regularized Image Semantic Embedding

Author: Juan, Da-Cheng, Lu, Chun-Ta, Li, Zhen, Peng, Futang, Timofeev, Aleksei, Chen, Yi-Ting, Gao, Yaxi, Duerig, Tom, Tomkins, Andrew, and Ravi, Sujith
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Learning image representations to capture fine-grained semantics has been a challenging and important task enabling many applications such as image search and clustering. In this paper, we present Graph-Regularized Image Semantic Embedding (Graph-RISE), a large-scale neural graph learning framework that allows us to train embeddings to discriminate an unprecedented O(40M) ultra-fine-grained semantic labels. Graph-RISE outperforms state-of-the-art image embedding algorithms on several evaluation tasks, including image classification and triplet ranking. We provide case studies to demonstrate that, qualitatively, image retrieval based on Graph-RISE effectively captures semantics and, compared to the state-of-the-art, differentiates nuances at levels that are closer to human-perception., Comment: 9 pages, 7 figures
Published: 2019

15. SCRank: Spammer and Celebrity Ranking in Directed Social Networks

Author: Fabrikant, Alex, Mahdian, Mohammad, and Tomkins, Andrew
Subjects: Computer Science - Social and Information Networks
Abstract: Many online social networks allow directed edges: Alice can unilaterally add an "edge" to Bob, typically indicating interest in Bob or Bob's content, without Bob's permission or reciprocation. In directed social networks we observe the rise of two distinctive classes of users: celebrities who accrue unreciprocated incoming links, and follow spammers, who generate unreciprocated outgoing links. Identifying users in these two classes is important for abuse detection, user and content ranking, privacy choices, and other social network features. In this paper we develop SCRank, an iterative algorithm to identify such users. We analyze SCRank both theoretically and experimentally. The spammer-celebrity definition is not amenable to analysis using standard power iteration, so we develop a novel potential function argument to show convergence to an approximate equilibrium point for a class of algorithms including SCRank. We then use experimental evaluation on a real global-scale social network and on synthetically generated graphs to observe that the algorithm converges quickly and consistently. Using synthetic data with built-in ground truth, we also experimentally show that the algorithm provides a good approximation to planted celebrities and spammers.
Published: 2018

16. Linear Additive Markov Processes

Author: Kumar, Ravi, Raghu, Maithra, Sarlos, Tamas, and Tomkins, Andrew
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: We introduce LAMP: the Linear Additive Markov Process. Transitions in LAMP may be influenced by states visited in the distant history of the process, but unlike higher-order Markov processes, LAMP retains an efficient parametrization. LAMP also allows the specific dependence on history to be learned efficiently from data. We characterize some theoretical properties of LAMP, including its steady-state and mixing time. We then give an algorithm based on alternating minimization to learn LAMP models from data. Finally, we perform a series of real-world experiments to show that LAMP is more powerful than first-order Markov processes, and even holds its own against deep sequential models (LSTMs) with a negligible increase in parameter complexity., Comment: Accepted to WWW 2017
Published: 2017

17. Single versus Double Blind Reviewing at WSDM 2017

Author: Tomkins, Andrew, Zhang, Min, and Heavlin, William D.
Subjects: Computer Science - Digital Libraries, Computer Science - Social and Information Networks
Abstract: In this paper we study the implications for conference program committees of using single-blind reviewing, in which committee members are aware of the names and affiliations of paper authors, versus double-blind reviewing, in which this information is not visible to committee members. WSDM 2017, the 10th ACM International ACM Conference on Web Search and Data Mining, performed a controlled experiment in which each paper was reviewed by four committee members. Two of these four reviewers were chosen from a pool of committee members who had access to author information; the other two were chosen from a disjoint pool who did not have access to this information. This information asymmetry persisted through the process of bidding for papers, reviewing papers, and entering scores. Reviewers in the single-blind condition typically bid for 22% fewer papers, and preferentially bid for papers from top institutions. Once papers were allocated to reviewers, single-blind reviewers were significantly more likely than their double-blind counterparts to recommend for acceptance papers from famous authors and top institutions. The estimated odds multipliers are 1.63 for famous authors and 1.58 and 2.10 for top universities and companies respectively, so the result is tangible. For female authors, the associated odds multiplier of 0.78 is not statistically significant in our study. However, a meta-analysis places this value in line with that of other experiments, and in the context of this larger aggregate the gender effect is also statistically significant.
Published: 2017

18. Smart Reply: Automated Response Suggestion for Email

Author: Kannan, Anjuli, Kurach, Karol, Ravi, Sujith, Kaufmann, Tobias, Tomkins, Andrew, Miklos, Balint, Corrado, Greg, Lukacs, Laszlo, Ganea, Marina, Young, Peter, and Ramavajjala, Vivek
Subjects: Computer Science - Computation and Language
Abstract: In this paper we propose and investigate a novel end-to-end method for automatically generating short email responses, called Smart Reply. It generates semantically diverse suggestions that can be used as complete email responses with just one tap on mobile. The system is currently used in Inbox by Gmail and is responsible for assisting with 10% of all mobile responses. It is designed to work at very high throughput and process hundreds of millions of messages daily. The system exploits state-of-the-art, large-scale deep learning. We describe the architecture of the system as well as the challenges that we faced while building it, like response diversity and scalability. We also introduce a new method for semantic clustering of user-generated content that requires only a modest amount of explicitly labeled data., Comment: Accepted to KDD 2016
Published: 2016

19. Your Two Weeks of Fame and Your Grandmother's

Author: Cook, James, Sarma, Atish Das, Fabrikant, Alex, and Tomkins, Andrew
Subjects: Computer Science - Digital Libraries, Computer Science - Computation and Language, Computer Science - Social and Information Networks, Physics - Physics and Society, J.4
Abstract: Did celebrity last longer in 1929, 1992 or 2009? We investigate the phenomenon of fame by mining a collection of news articles that spans the twentieth century, and also perform a side study on a collection of blog posts from the last 10 years. By analyzing mentions of personal names, we measure each person's time in the spotlight, using two simple metrics that evaluate, roughly, the duration of a single news story about a person, and the overall duration of public interest in a person. We watched the distribution evolve from 1895 to 2010, expecting to find significantly shortening fame durations, per the much popularly bemoaned shortening of society's attention spans and quickening of media's news cycles. Instead, we conclusively demonstrate that, through many decades of rapid technological and societal change, through the appearance of Twitter, communication satellites, and the Internet, fame durations did not decrease, neither for the typical case nor for the extremely famous, with the last statistically significant fame duration decreases coming in the early 20th century, perhaps from the spread of telegraphy and telephony. Furthermore, while median fame durations stayed persistently constant, for the most famous of the famous, as measured by either volume or duration of media attention, fame durations have actually trended gently upward since the 1940s, with statistically significant increases on 40-year timescales. Similar studies have been done with much shorter timescales specifically in the context of information spreading on Twitter and similar social networking sites. To the best of our knowledge, this is the first massive scale study of this nature that spans over a century of archived data, thereby allowing us to track changes across decades., Comment: This version supercedes the short version of this paper published in the proceedings of WWW 2012
Published: 2012

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

19 results on '"Tomkins, Andrew"'

1. Substance or Style: What Does Your Image Embedding Know?

2. Approximating a RUM from Distributions on k-Slates

3. Quantifying the sustainability impact of Google Maps: A case study of Salt Lake City

4. An Efficient Simulation-Based Travel Demand Calibration Algorithm for Large-Scale Metropolitan Traffic Models

5. CARLS: Cross-platform Asynchronous Representation Learning System

6. Graph Autoencoders with Deconvolutional Networks

7. Adversarial Robustness Across Representation Spaces

8. Surprise: Result List Truncation via Extreme Value Theory

9. Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study

10. BusTr: Predicting Bus Travel Times from Real-Time Traffic

11. Choppy: Cut Transformer For Ranked List Truncation

12. Reverse Engineering Configurations of Neural Text Generation Models

13. Preventing Adversarial Use of Datasets through Fair Core-Set Construction

14. Graph-RISE: Graph-Regularized Image Semantic Embedding

15. SCRank: Spammer and Celebrity Ranking in Directed Social Networks

16. Linear Additive Markov Processes

17. Single versus Double Blind Reviewing at WSDM 2017

18. Smart Reply: Automated Response Suggestion for Email

19. Your Two Weeks of Fame and Your Grandmother's

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

19 results on '"Tomkins, Andrew"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources