Author: "Tung, Frederick" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Tung, Frederick"' showing total 176 results

Start Over Author "Tung, Frederick"

176 results on '"Tung, Frederick"'

1. Were RNNs All We Needed?

Author: Feng, Leo, Tung, Frederick, Ahmed, Mohamed Osama, Bengio, Yoshua, and Hajimirsadegh, Hossein
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The scalability limitations of Transformers regarding sequence length have renewed interest in recurrent sequence models that are parallelizable during training. As a result, many novel recurrent architectures, such as S4, Mamba, and Aaren, have been proposed that achieve comparable performance. In this work, we revisit traditional recurrent neural networks (RNNs) from over a decade ago: LSTMs (1997) and GRUs (2014). While these models were slow due to requiring to backpropagate through time (BPTT), we show that by removing their hidden state dependencies from their input, forget, and update gates, LSTMs and GRUs no longer need to BPTT and can be efficiently trained in parallel. Building on this, we introduce minimal versions (minLSTMs and minGRUs) that (1) use significantly fewer parameters than their traditional counterparts and (2) are fully parallelizable during training (175x faster for a sequence of length 512). Lastly, we show that these stripped-down versions of decade-old RNNs match the empirical performance of recent sequence models.
Published: 2024

2. Forget Sharpness: Perturbed Forgetting of Model Biases Within SAM Dynamics

Author: Vani, Ankit, Tung, Frederick, Oliveira, Gabriel L., and Sharifi-Noghabi, Hossein
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Despite attaining high empirical generalization, the sharpness of models trained with sharpness-aware minimization (SAM) do not always correlate with generalization error. Instead of viewing SAM as minimizing sharpness to improve generalization, our paper considers a new perspective based on SAM's training dynamics. We propose that perturbations in SAM perform perturbed forgetting, where they discard undesirable model biases to exhibit learning signals that generalize better. We relate our notion of forgetting to the information bottleneck principle, use it to explain observations like the better generalization of smaller perturbation batches, and show that perturbed forgetting can exhibit a stronger correlation with generalization than flatness. While standard SAM targets model biases exposed by the steepest ascent directions, we propose a new perturbation that targets biases exposed through the model's outputs. Our output bias forgetting perturbations outperform standard SAM, GSAM, and ASAM on ImageNet, robustness benchmarks, and transfer to CIFAR-{10,100}, while sometimes converging to sharper regions. Our results suggest that the benefits of SAM can be explained by alternative mechanistic principles that do not require flatness of the loss surface., Comment: Published as a conference paper at ICML 2024. 9 pages main, 15 pages total including references and appendix
Published: 2024

3. Attention as an RNN

Author: Feng, Leo, Tung, Frederick, Hajimirsadeghi, Hossein, Ahmed, Mohamed Osama, Bengio, Yoshua, and Mori, Greg
Subjects: Computer Science - Machine Learning
Abstract: The advent of Transformers marked a significant breakthrough in sequence modelling, providing a highly performant architecture capable of leveraging GPU parallelism. However, Transformers are computationally expensive at inference time, limiting their applications, particularly in low-resource settings (e.g., mobile and embedded devices). Addressing this, we (1) begin by showing that attention can be viewed as a special Recurrent Neural Network (RNN) with the ability to compute its \textit{many-to-one} RNN output efficiently. We then (2) show that popular attention-based models such as Transformers can be viewed as RNN variants. However, unlike traditional RNNs (e.g., LSTMs), these models cannot be updated efficiently with new tokens, an important property in sequence modelling. Tackling this, we (3) introduce a new efficient method of computing attention's \textit{many-to-many} RNN output based on the parallel prefix scan algorithm. Building on the new attention formulation, we (4) introduce \textbf{Aaren}, an attention-based module that can not only (i) be trained in parallel (like Transformers) but also (ii) be updated efficiently with new tokens, requiring only constant memory for inferences (like traditional RNNs). Empirically, we show Aarens achieve comparable performance to Transformers on $38$ datasets spread across four popular sequential problem settings: reinforcement learning, event forecasting, time series classification, and time series forecasting tasks while being more time and memory-efficient.
Published: 2024

4. Pretext Training Algorithms for Event Sequence Data

Author: Wang, Yimu, Zhao, He, Deng, Ruizhi, Tung, Frederick, and Mori, Greg
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Pretext training followed by task-specific fine-tuning has been a successful approach in vision and language domains. This paper proposes a self-supervised pretext training framework tailored to event sequence data. We introduce a novel alignment verification task that is specialized to event sequences, building on good practices in masked reconstruction and contrastive learning. Our pretext tasks unlock foundational representations that are generalizable across different down-stream tasks, including next-event prediction for temporal point process models, event sequence classification, and missing event interpolation. Experiments on popular public benchmarks demonstrate the potential of the proposed method across different tasks and data domains.
Published: 2024

5. AdaFlood: Adaptive Flood Regularization

Author: Bae, Wonho, Ren, Yi, Ahmed, Mohamad Osama, Tung, Frederick, Sutherland, Danica J., and Oliveira, Gabriel L.
Subjects: Computer Science - Machine Learning
Abstract: Although neural networks are conventionally optimized towards zero training loss, it has been recently learned that targeting a non-zero training loss threshold, referred to as a flood level, often enables better test time generalization. Current approaches, however, apply the same constant flood level to all training samples, which inherently assumes all the samples have the same difficulty. We present AdaFlood, a novel flood regularization method that adapts the flood level of each training sample according to the difficulty of the sample. Intuitively, since training samples are not equal in difficulty, the target training loss should be conditioned on the instance. Experiments on datasets covering four diverse input modalities - text, images, asynchronous event sequences, and tabular - demonstrate the versatility of AdaFlood across data domains and noise levels.
Published: 2023

6. Prompting-based Temporal Domain Generalization

Author: Hosseini, Sepidehsadat, Zhai, Mengyao, Hajimirsadegh, Hossein, and Tung, Frederick
Subjects: Computer Science - Machine Learning
Abstract: Machine learning traditionally assumes that the training and testing data are distributed independently and identically. However, in many real-world settings, the data distribution can shift over time, leading to poor generalization of trained models in future time periods. This paper presents a novel prompting-based approach to temporal domain generalization that is parameter-efficient, time-efficient, and does not require access to future data during training. Our method adapts a trained model to temporal drift by learning global prompts, domain-specific prompts, and drift-aware prompts that capture underlying temporal dynamics. Experiments on classification, regression, and time series forecasting tasks demonstrate the generality of the proposed approach. The code repository will be publicly shared.
Published: 2023

7. Tree Cross Attention

Author: Feng, Leo, Tung, Frederick, Hajimirsadeghi, Hossein, Bengio, Yoshua, and Ahmed, Mohamed Osama
Subjects: Computer Science - Machine Learning
Abstract: Cross Attention is a popular method for retrieving information from a set of context tokens for making predictions. At inference time, for each prediction, Cross Attention scans the full set of $\mathcal{O}(N)$ tokens. In practice, however, often only a small subset of tokens are required for good performance. Methods such as Perceiver IO are cheap at inference as they distill the information to a smaller-sized set of latent tokens $L < N$ on which cross attention is then applied, resulting in only $\mathcal{O}(L)$ complexity. However, in practice, as the number of input tokens and the amount of information to distill increases, the number of latent tokens needed also increases significantly. In this work, we propose Tree Cross Attention (TCA) - a module based on Cross Attention that only retrieves information from a logarithmic $\mathcal{O}(\log(N))$ number of tokens for performing inference. TCA organizes the data in a tree structure and performs a tree search at inference time to retrieve the relevant tokens for prediction. Leveraging TCA, we introduce ReTreever, a flexible architecture for token-efficient inference. We show empirically that Tree Cross Attention (TCA) performs comparable to Cross Attention across various classification and uncertainty regression tasks while being significantly more token-efficient. Furthermore, we compare ReTreever against Perceiver IO, showing significant gains while using the same number of tokens for inference., Comment: Accepted by ICLR 2024
Published: 2023

8. Constant Memory Attention Block

Author: Feng, Leo, Tung, Frederick, Hajimirsadeghi, Hossein, Bengio, Yoshua, and Ahmed, Mohamed Osama
Subjects: Computer Science - Machine Learning
Abstract: Modern foundation model architectures rely on attention mechanisms to effectively capture context. However, these methods require linear or quadratic memory in terms of the number of inputs/datapoints, limiting their applicability in low-compute domains. In this work, we propose Constant Memory Attention Block (CMAB), a novel general-purpose attention block that computes its output in constant memory and performs updates in constant computation. Highlighting CMABs efficacy, we introduce methods for Neural Processes and Temporal Point Processes. Empirically, we show our proposed methods achieve results competitive with state-of-the-art while being significantly more memory efficient., Comment: Workshop version of arXiv:2305.14567
Published: 2023

9. DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning

Author: Rahimian, Elahe, Javadi, Golara, Tung, Frederick, and Oliveira, Gabriel
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Multi-task networks rely on effective parameter sharing to achieve robust generalization across tasks. In this paper, we present a novel parameter sharing method for multi-task learning that conditions parameter sharing on both the task and the intermediate feature representations at inference time. In contrast to traditional parameter sharing approaches, which fix or learn a deterministic sharing pattern during training and apply the same pattern to all examples during inference, we propose to dynamically decide which parts of the network to activate based on both the task and the input instance. Our approach learns a hierarchical gating policy consisting of a task-specific policy for coarse layer selection and gating units for individual input instances, which work together to determine the execution path at inference time. Experiments on the NYU v2, Cityscapes and MIMIC-III datasets demonstrate the potential of the proposed approach and its applicability across problem domains.
Published: 2023

10. Memory Efficient Neural Processes via Constant Memory Attention Block

Author: Feng, Leo, Tung, Frederick, Hajimirsadeghi, Hossein, Bengio, Yoshua, and Ahmed, Mohamed Osama
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we first propose an efficient update operation for Cross Attention. Leveraging the update operation, we propose Constant Memory Attention Block (CMAB), a novel attention block that (i) is permutation invariant, (ii) computes its output in constant memory, and (iii) performs constant computation updates. Finally, building on CMAB, we detail Constant Memory Attentive Neural Processes. Empirically, we show CMANPs achieve state-of-the-art results on popular NP benchmarks while being significantly more memory efficient than prior methods.
Published: 2023

11. Ranking Regularization for Critical Rare Classes: Minimizing False Positives at a High True Positive Rate

Author: Kiarash, Mohammadi, He, Zhao, Zhai, Mengyao, and Tung, Frederick
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: In many real-world settings, the critical class is rare and a missed detection carries a disproportionately high cost. For example, tumors are rare and a false negative diagnosis could have severe consequences on treatment outcomes; fraudulent banking transactions are rare and an undetected occurrence could result in significant losses or legal penalties. In such contexts, systems are often operated at a high true positive rate, which may require tolerating high false positives. In this paper, we present a novel approach to address the challenge of minimizing false positives for systems that need to operate at a high true positive rate. We propose a ranking-based regularization (RankReg) approach that is easy to implement, and show empirically that it not only effectively reduces false positives, but also complements conventional imbalanced learning losses. With this novel technique in hand, we conduct a series of experiments on three broadly explored datasets (CIFAR-10&100 and Melanoma) and show that our approach lifts the previous state-of-the-art performance by notable margins.
Published: 2023

12. Meta Temporal Point Processes

Author: Bae, Wonho, Ahmed, Mohamed Osama, Tung, Frederick, and Oliveira, Gabriel L.
Subjects: Computer Science - Machine Learning
Abstract: A temporal point process (TPP) is a stochastic process where its realization is a sequence of discrete events in time. Recent work in TPPs model the process using a neural network in a supervised learning framework, where a training set is a collection of all the sequences. In this work, we propose to train TPPs in a meta learning framework, where each sequence is treated as a different task, via a novel framing of TPPs as neural processes (NPs). We introduce context sets to model TPPs as an instantiation of NPs. Motivated by attentive NP, we also introduce local history matching to help learn more informative features. We demonstrate the potential of the proposed method on popular public benchmark datasets and tasks, and compare with state-of-the-art TPP methods., Comment: Accepted to ICLR2023
Published: 2023

13. Gumbel-Softmax Selective Networks

Author: Salem, Mahmoud, Ahmed, Mohamed Osama, Tung, Frederick, and Oliveira, Gabriel
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: ML models often operate within the context of a larger system that can adapt its response when the ML model is uncertain, such as falling back on safe defaults or a human in the loop. This commonly encountered operational context calls for principled techniques for training ML models with the option to abstain from predicting when uncertain. Selective neural networks are trained with an integrated option to abstain, allowing them to learn to recognize and optimize for the subset of the data distribution for which confident predictions can be made. However, optimizing selective networks is challenging due to the non-differentiability of the binary selection function (the discrete decision of whether to predict or abstain). This paper presents a general method for training selective networks that leverages the Gumbel-softmax reparameterization trick to enable selection within an end-to-end differentiable training framework. Experiments on public datasets demonstrate the potential of Gumbel-softmax selective networks for selective regression and classification.
Published: 2022

14. RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression

Author: Gong, Yu, Mori, Greg, and Tung, Frederick
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Data imbalance, in which a plurality of the data samples come from a small proportion of labels, poses a challenge in training deep neural networks. Unlike classification, in regression the labels are continuous, potentially boundless, and form a natural ordering. These distinct features of regression call for new techniques that leverage the additional information encoded in label-space relationships. This paper presents the RankSim (ranking similarity) regularizer for deep imbalanced regression, which encodes an inductive bias that samples that are closer in label space should also be closer in feature space. In contrast to recent distribution smoothing based approaches, RankSim captures both nearby and distant relationships: for a given data sample, RankSim encourages the sorted list of its neighbors in label space to match the sorted list of its neighbors in feature space. RankSim is complementary to conventional imbalanced learning techniques, including re-weighting, two-stage training, and distribution smoothing, and lifts the state-of-the-art performance on three imbalanced regression benchmarks: IMDB-WIKI-DIR, AgeDB-DIR, and STS-B-DIR., Comment: Accepted to ICML 2022
Published: 2022

15. Heterogeneous Multi-task Learning with Expert Diversity

Author: Aoki, Raquel, Tung, Frederick, and Oliveira, Gabriel L.
Subjects: Computer Science - Machine Learning
Abstract: Predicting multiple heterogeneous biological and medical targets is a challenge for traditional deep learning models. In contrast to single-task learning, in which a separate model is trained for each target, multi-task learning (MTL) optimizes a single model to predict multiple related targets simultaneously. To address this challenge, we propose the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx). Our work aims to tackle the heterogeneous MTL setting, in which the same model optimizes multiple tasks with different characteristics. Such a scenario can overwhelm current MTL approaches due to the challenges in balancing shared and task-specific representations and the need to optimize tasks with competing optimization paths. Our method makes two key contributions: first, we introduce an approach to induce more diversity among experts, thus creating representations more suitable for highly imbalanced and heterogenous MTL learning; second, we adopt a two-step optimization [6, 11] approach to balancing the tasks at the gradient level. We validate our method on three MTL benchmark datasets, including Medical Information Mart for Intensive Care (MIMIC-III) and PubChem BioAssay (PCBA)., Comment: 10 pages, 7 figures, BIOKDD, IEEE/ACM
Published: 2021
Full Text: View/download PDF

16. Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation

Author: Zhai, Mengyao, Chen, Lei, He, Jiawei, Nawhal, Megha, Tung, Frederick, and Mori, Greg
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Humans accumulate knowledge in a lifelong fashion. Modern deep neural networks, on the other hand, are susceptible to catastrophic forgetting: when adapted to perform new tasks, they often fail to preserve their performance on previously learned tasks. Given a sequence of tasks, a naive approach addressing catastrophic forgetting is to train a separate standalone model for each task, which scales the total number of parameters drastically without efficiently utilizing previous models. In contrast, we propose a parameter efficient framework, Piggyback GAN, which learns the current task by building a set of convolutional and deconvolutional filters that are factorized into filters of the models trained on previous tasks. For the current task, our model achieves high generation quality on par with a standalone model at a lower number of parameters. For previous tasks, our model can also preserve generation quality since the filters for previous tasks are not altered. We validate Piggyback GAN on various image-conditioned generation tasks across different domains, and provide qualitative and quantitative results to show that the proposed approach can address catastrophic forgetting effectively and efficiently., Comment: Accepted to ECCV 2020
Published: 2021

17. Learning Discriminative Prototypes with Dynamic Time Warping

Author: Chang, Xiaobin, Tung, Frederick, and Mori, Greg
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Dynamic Time Warping (DTW) is widely used for temporal data processing. However, existing methods can neither learn the discriminative prototypes of different classes nor exploit such prototypes for further analysis. We propose Discriminative Prototype DTW (DP-DTW), a novel method to learn class-specific discriminative prototypes for temporal recognition tasks. DP-DTW shows superior performance compared to conventional DTWs on time series classification benchmarks. Combined with end-to-end deep learning, DP-DTW can handle challenging weakly supervised action segmentation problems and achieves state of the art results on standard benchmarks. Moreover, detailed reasoning on the input video is enabled by the learned action prototypes. Specifically, an action-based video summarization can be obtained by aligning the input sequence with action prototypes., Comment: CVPR'21 preview, 10 pages, 8 figures
Published: 2021

18. Similarity-Preserving Knowledge Distillation

Author: Tung, Frederick and Mori, Greg
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Knowledge distillation is a widely applicable technique for training a student neural network under the guidance of a trained teacher network. For example, in neural network compression, a high-capacity teacher is distilled to train a compact student; in privileged learning, a teacher trained with privileged data is distilled to train a student without access to that data. The distillation loss determines how a teacher's knowledge is captured and transferred to the student. In this paper, we propose a new form of knowledge distillation loss that is inspired by the observation that semantically similar inputs tend to elicit similar activation patterns in a trained network. Similarity-preserving knowledge distillation guides the training of a student network such that input pairs that produce similar (dissimilar) activations in the teacher network produce similar (dissimilar) activations in the student network. In contrast to previous distillation methods, the student is not required to mimic the representation space of the teacher, but rather to preserve the pairwise similarities in its own representation space. Experiments on three public datasets demonstrate the potential of our approach., Comment: ICCV 2019 camera ready
Published: 2019

19. Autonomous Mobile Robot Navigation in Uneven and Unstructured Indoor Environments

Author: Wang, Chaoqun, Meng, Lili, She, Sizhen, Mitchell, Ian M., Li, Teng, Tung, Frederick, Wan, Weiwei, Meng, Max. Q. -H., and de Silva, Clarence W.
Subjects: Computer Science - Robotics
Abstract: Robots are increasingly operating in indoor environments designed for and shared with people. However, robots working safely and autonomously in uneven and unstructured environments still face great challenges. Many modern indoor environments are designed with wheelchair accessibility in mind. This presents an opportunity for wheeled robots to navigate through sloped areas while avoiding staircases. In this paper, we present an integrated software and hardware system for autonomous mobile robot navigation in uneven and unstructured indoor environments. This modular and reusable software framework incorporates capabilities of perception and navigation. Our robot first builds a 3D OctoMap representation for the uneven environment with the 3D mapping using wheel odometry, 2D laser and RGB-D data. Then we project multilayer 2D occupancy maps from OctoMap to generate the the traversable map based on layer differences. The safe traversable map serves as the input for efficient autonomous navigation. Furthermore, we employ a variable step size Rapidly Exploring Random Trees that could adjust the step size automatically, eliminating tuning step sizes according to environments. We conduct extensive experiments in simulation and real-world, demonstrating the efficacy and efficiency of our system., Comment: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Published: 2017

20. Exploiting Points and Lines in Regression Forests for RGB-D Camera Relocalization

Author: Meng, Lili, Tung, Frederick, Little, James J., Valentin, Julien, and de Silva, Clarence
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Camera relocalization plays a vital role in many robotics and computer vision tasks, such as global localization, recovery from tracking failure and loop closure detection. Recent random forests based methods exploit randomly sampled pixel comparison features to predict 3D world locations for 2D image locations to guide the camera pose optimization. However, these image features are only sampled randomly in the images, without considering the spatial structures or geometric information, leading to large errors or failure cases with the existence of poorly textured areas or in motion blur. Line segment features are more robust in these environments. In this work, we propose to jointly exploit points and lines within the framework of uncertainty driven regression forests. The proposed approach is thoroughly evaluated on three publicly available datasets against several strong state-of-the-art baselines in terms of several different error metrics. Experimental results prove the efficacy of our method, showing superior or on-par state-of-the-art performance., Comment: published as a conference paper at 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Published: 2017

21. Multi-level Residual Networks from Dynamical Systems View

Author: Chang, Bo, Meng, Lili, Haber, Eldad, Tung, Frederick, and Begert, David
Subjects: Statistics - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep residual networks (ResNets) and their variants are widely used in many computer vision applications and natural language processing tasks. However, the theoretical principles for designing and training ResNets are still not fully understood. Recently, several points of view have emerged to try to interpret ResNet theoretically, such as unraveled view, unrolled iterative estimation and dynamical systems view. In this paper, we adopt the dynamical systems point of view, and analyze the lesioning properties of ResNet both theoretically and experimentally. Based on these analyses, we additionally propose a novel method for accelerating ResNet training. We apply the proposed method to train ResNets and Wide ResNets for three image classification benchmarks, reducing training time by more than 40% with superior or on-par accuracy., Comment: Published as a conference paper at ICLR 2018
Published: 2017

22. Backtracking Regression Forests for Accurate Camera Relocalization

Author: Meng, Lili, Chen, Jianhui, Tung, Frederick, Little, James J., Valentin, Julien, and de Silva, Clarence W.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Camera relocalization plays a vital role in many robotics and computer vision tasks, such as global localization, recovery from tracking failure, and loop closure detection. Recent random forests based methods directly predict 3D world locations for 2D image locations to guide the camera pose optimization. During training, each tree greedily splits the samples to minimize the spatial variance. However, these greedy splits often produce uneven sub-trees in training or incorrect 2D-3D correspondences in testing. To address these problems, we propose a sample-balanced objective to encourage equal numbers of samples in the left and right sub-trees, and a novel backtracking scheme to remedy the incorrect 2D-3D correspondence predictions. Furthermore, we extend the regression forests based methods to use local features in both training and testing stages for outdoor RGB-only applications. Experimental results on publicly available indoor and outdoor datasets demonstrate the efficacy of our approach, which shows superior or on-par accuracy with several state-of-the-art methods., Comment: 8 pages. Appear in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017
Published: 2017

23. Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization

Author: Tung, Frederick, Muralidharan, Srikanth, and Mori, Greg
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: When approaching a novel visual recognition problem in a specialized image domain, a common strategy is to start with a pre-trained deep neural network and fine-tune it to the specialized domain. If the target domain covers a smaller visual space than the source domain used for pre-training (e.g. ImageNet), the fine-tuned network is likely to be over-parameterized. However, applying network pruning as a post-processing step to reduce the memory requirements has drawbacks: fine-tuning and pruning are performed independently; pruning parameters are set once and cannot adapt over time; and the highly parameterized nature of state-of-the-art pruning methods make it prohibitive to manually search the pruning parameter space for deep networks, leading to coarse approximations. We propose a principled method for jointly fine-tuning and compressing a pre-trained convolutional network that overcomes these limitations. Experiments on two specialized image domains (remote sensing images and describable textures) demonstrate the validity of the proposed approach., Comment: BMVC 2017 oral
Published: 2017

24. Learning Person Trajectory Representations for Team Activity Analysis

Author: Mehrasa, Nazanin, Zhong, Yatao, Tung, Frederick, Bornn, Luke, and Mori, Greg
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Activity analysis in which multiple people interact across a large space is challenging due to the interplay of individual actions and collective group dynamics. We propose an end-to-end approach for learning person trajectory representations for group activity analysis. The learned representations encode rich spatio-temporal dependencies and capture useful motion patterns for recognizing individual events, as well as characteristic group dynamics that can be used to identify groups from their trajectories alone. We develop our deep learning approach in the context of team sports, which provide well-defined sets of events (e.g. pass, shot) and groups of people (teams). Analysis of events and team formations using NHL hockey and NBA basketball datasets demonstrate the generality of our approach.
Published: 2017

25. Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation

Author: Zhai, Mengyao, Chen, Lei, He, Jiawei, Nawhal, Megha, Tung, Frederick, Mori, Greg, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Vedaldi, Andrea, editor, Bischof, Horst, editor, Brox, Thomas, editor, and Frahm, Jan-Michael, editor
Published: 2020
Full Text: View/download PDF

26. Do Lenders Still Monitor? Leveraged Lending and the Search for Covenants.

Author: Tung, Frederick
Subjects: Corporate governance -- Laws, regulations and rules -- Research, Bank loans -- Contracts -- Laws, regulations and rules -- Research, Junk bonds -- Laws, regulations and rules -- Research, Leverage (Finance) -- Laws, regulations and rules -- Research, Deregulation -- Laws, regulations and rules -- Research, Collateralized loan obligations -- Laws, regulations and rules -- Research, Covenants -- Laws, regulations and rules -- Research, Government regulation, Contract agreement
Abstract: I. INTRODUCTION 155 II. TRADITIONAL BANK LENDING: COVENANTS AND MONITORING 159 A. The Structure of Bank Lending 159 B. Traditional Covenant Structure 161 1. Covenant Setting to Control Agency Costs [...], It was once conventional wisdom that lenders routinely influenced corporate managers' decision making. Covenants constrained borrower risk taking and compelled specific affirmative obligations to protect lenders. Recent policy discussion, however, laments loan markets' turn to various forms of high-risk lending. So-called leveraged loans-relatively risky, below-investment-grade loans--more than doubled in outstanding dollar terms, growing from about $550 billion in 2010 to $1.2 trillion by 2019. These risky loans have taken up a larger and larger share of the loan markets over time. More leveraged loans are also "covenant-lite," issued without traditional financial maintenance covenants. And regulators worry about "add-backs "--borrowers' growing practice of making upward adjustments to projected earnings that tend to weaken leverage constraints. Moreover, bank regulatory changes have incentivized "originate-to-distribute" loan syndications that enable non-bank lenders to hold and trade leveraged loans too risky for banks to keep. Syndicated lending now involves greater and greater participation by non-bank or "institutional" lenders like hedge funds, CLOs (collateralized loan obligations), and mutual funds. Commentators worry about the new species of risky loans, with their dearth of traditional covenants and the fewer instances of lender intervention, which may portend instability in debt markets. At the same time, weakened covenant protections may lead to weakened corporate governance. In this Article, I respond to these fears, arguing that they may be somewhat overblown. The increasing share of leveraged and covenant-lite loans may not necessarily evidence undisciplined debt issuance. Many seemingly troublesome loans are issued as subparts of deals that include loans with traditional covenants and cross-default provisions, which effectively constrain borrower behavior. Though add-backs typically increase firm leverage--which is worrisome--they may also improve the informativeness of earnings-based financial covenants. In addition, while the incidence of loan covenant violations has dropped dramatically across U.S. public firms, recent research suggests that covenants have become more efficient. In effect, covenants are doing more with less. Financial covenants have generally become less restrictive and more discriminating in differentiating distress from non-distress situations.
Published: 2021

27. Constraint-Aware Deep Neural Network Compression

Author: Chen, Changan, Tung, Frederick, Vedula, Naveen, Mori, Greg, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Ferrari, Vittorio, editor, Hebert, Martial, editor, Sminchisescu, Cristian, editor, and Weiss, Yair, editor
Published: 2018
Full Text: View/download PDF

28. Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation

Author: Zhai, Mengyao, primary, Chen, Lei, additional, He, Jiawei, additional, Nawhal, Megha, additional, Tung, Frederick, additional, and Mori, Greg, additional
Published: 2020
Full Text: View/download PDF

29. SSP: Supervised Sparse Projections for Large-Scale Retrieval in High Dimensions

Author: Tung, Frederick, Little, James J., Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Lai, Shang-Hong, editor, Lepetit, Vincent, editor, Nishino, Ko, editor, and Sato, Yoichi, editor
Published: 2017
Full Text: View/download PDF

30. Ranking Regularization for Critical Rare Classes: Minimizing False Positives at a High True Positive Rate

Author: Mohammadi, Kiarash, primary, Zhao, He, additional, Zhai, Mengyao, additional, and Tung, Frederick, additional
Published: 2023
Full Text: View/download PDF

31. DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning

Author: Rahimian, Elahe, primary, Javadi, Golara, additional, Tung, Frederick, additional, and Oliveira, Gabriel, additional
Published: 2023
Full Text: View/download PDF

32. Skepticism about Universalism: International Bankruptcy and International Relations

Author: Tung, Frederick
Abstract: There is no international bankruptcy law, but only the national bankruptcy laws of various states. The failure of a multinational firm therefore raises difficult questions of conflict and cooperation among national bankruptcy regimes. Theorists have proposed various reforms to the uncoordinated territorial approach that most states pursue when a multinational firm suffers financial distress. Among these reform proposals, universalism has long been the dominant idea. Under universalism, the bankruptcy regime of the debtor firm’s home country would govern, and that regime would have extraterritorial reach to treat all of the debtor’s assets and claimants worldwide.Despite its conceptual dominance, universalism has yet to find vindication in any concrete policy enactments. No universalist arrangements exist. While recent challenges to universalism have emerged, the current lively debate over universalism and rival proposals focuses almost exclusively on their comparative efficiencies. This article provides an entirely new perspective. Applying insights from elementary game theory and international relations theory, I show that universalism is politically implausible. Even for states interested in establishing universalist arrangements, they will be unable to do so. They will find themselves caught in a prisoners’ dilemma with no ready solution. I conclude therefore that universalism holds only dubious promise as a prescription for international bankruptcy cooperation.
Published: 2002

33. Skepticism about Universalism: International Bankruptcy and International Relations

Author: Tung, Frederick
Published: 2001

34. Scene parsing by nonparametric label transfer of content-adaptive windows

Author: Tung, Frederick and Little, James J.
Published: 2016
Full Text: View/download PDF

35. Law and Project Finance

Author: Subramanian, Krishnamurthy V. and Tung, Frederick
Published: 2016
Full Text: View/download PDF

36. CollageParsing: Nonparametric Scene Parsing by Adaptive Overlapping Windows

Author: Tung, Frederick, Little, James J., Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Kobsa, Alfred, Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Fleet, David, editor, Pajdla, Tomas, editor, Schiele, Bernt, editor, and Tuytelaars, Tinne, editor
Published: 2014
Full Text: View/download PDF

37. Constraint-Aware Deep Neural Network Compression

Author: Chen, Changan, primary, Tung, Frederick, additional, Vedula, Naveen, additional, and Mori, Greg, additional
Published: 2018
Full Text: View/download PDF

38. Improving scene attribute recognition using web-scale object detectors

Author: Tung, Frederick and Little, James J.
Published: 2015
Full Text: View/download PDF

39. Constant Memory Attentive Neural Processes

Author: Feng, Leo, Tung, Frederick, Hajimirsadeghi, Hossein, Bengio, Yoshua, and Ahmed, Mohamed Osama
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Neural Processes (NPs) are efficient methods for estimating predictive uncertainties. NPs comprise of a conditioning phase where a context dataset is encoded, a querying phase where the model makes predictions using the context dataset encoding, and an updating phase where the model updates its encoding with newly received datapoints. However, state-of-the-art methods require additional memory which scales linearly or quadratically with the size of the dataset, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant which only requires constant memory for the conditioning, querying, and updating phases. In building CMANPs, we propose Constant Memory Attention Block (CMAB), a novel general-purpose attention block that can compute its output in constant memory and perform updates in constant computation. Empirically, we show CMANPs achieve state-of-the-art results on meta-regression and image completion tasks while being (1) significantly more memory efficient than prior methods and (2) more scalable to harder settings.
Published: 2023
Full Text: View/download PDF

40. SSP: Supervised Sparse Projections for Large-Scale Retrieval in High Dimensions

Author: Tung, Frederick, primary and Little, James J., additional
Published: 2017
Full Text: View/download PDF

41. BREAKING BANKRUPTCY PRIORITY: HOW RENT-SEEKING UPENDS THE CREDITORS' BARGAIN

Author: Roe, Mark J. and Tung, Frederick
Published: 2013

42. Goal-based trajectory analysis for unusual behaviour detection in intelligent surveillance

Author: Tung, Frederick, Zelek, John S., and Clausi, David A.
Published: 2011
Full Text: View/download PDF

43. Heterogeneous Multi-task Learning with Expert Diversity

Author: Aoki, Raquel, primary, Tung, Frederick, additional, and Oliveira, Gabriel L., additional
Published: 2022
Full Text: View/download PDF

44. CollageParsing: Nonparametric Scene Parsing by Adaptive Overlapping Windows

Author: Tung, Frederick, primary and Little, James J., additional
Published: 2014
Full Text: View/download PDF

45. Learning Discriminative Prototypes with Dynamic Time Warping

Author: Chang, Xiaobin, primary, Tung, Frederick, additional, and Mori, Greg, additional
Published: 2021
Full Text: View/download PDF

46. Paying bank examiners for performance: should regulators receive bonuses for effectively guarding the public interest?

Author: Henderson, M. Todd and Tung, Frederick
Subjects: Banking industry -- Laws, regulations and rules, Bonuses -- Laws, regulations and rules, Government regulation, Banking industry, Business, Government, Law
Abstract: Few doubt that executive compensation arrangements encouraged the excessive risk taking by banks that led to the financial crisis of 2008. Accordingly, academics and lawmakers have called for the reform [...]
Published: 2012

47. Leverage in the board room: the unsung influence of private lenders in corporate governance.

Author: Tung, Frederick
Subjects: Corporate governance -- Research, Debtor and creditor -- Influence
Published: 2009

48. The great bailout of 2008-2009.

Author: Tung, Frederick
Subjects: Government lending -- Laws, regulations and rules, Mortgages -- Laws, regulations and rules, Subprime loans -- Laws, regulations and rules, Mortgages, Government regulation, Emergency Economic Stabilization Act of 2008
Published: 2009

49. The new death of contract: creeping corporate fiduciary duties for creditors.

Author: Tung, Frederick
Subjects: Fiduciary duties -- Laws, regulations and rules, Contracts -- Laws, regulations and rules, Debtor and creditor -- Laws, regulations and rules, Government regulation
Published: 2008

50. Do Lenders Still Monitor? Leveraged Lending and the Search for Covenants

Author: Tung, Frederick, primary
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

176 results on '"Tung, Frederick"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources