Author: "Cox, David Daniel" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Cox, David Daniel"' showing total 17 results

Start Over Author "Cox, David Daniel"

17 results on '"Cox, David Daniel"'

1. Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following

Author: Ding, Mingyu, Xu, Yan, Chen, Zhenfang, Cox, David Daniel, Luo, Ping, Tenenbaum, Joshua B., and Gan, Chuang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Humans, even at a very early age, can learn visual concepts and understand geometry and layout through active interaction with the environment, and generalize their compositions to complete tasks described by natural languages in novel scenes. To mimic such capability, we propose Embodied Concept Learner (ECL) in an interactive 3D environment. Specifically, a robot agent can ground visual concepts, build semantic maps and plan actions to complete tasks by learning purely from human demonstrations and language instructions, without access to ground-truth semantic and depth supervisions from simulations. ECL consists of: (i) an instruction parser that translates the natural languages into executable programs; (ii) an embodied concept learner that grounds visual concepts based on language descriptions; (iii) a map constructor that estimates depth and constructs semantic maps by leveraging the learned concepts; and (iv) a program executor with deterministic policies to execute each program. ECL has several appealing benefits thanks to its modularized design. Firstly, it enables the robotic agent to learn semantics and depth unsupervisedly acting like babies, e.g., ground concepts through active interaction and perceive depth by disparities when moving forward. Secondly, ECL is fully transparent and step-by-step interpretable in long-term planning. Thirdly, ECL could be beneficial for the embodied instruction following (EIF), outperforming previous works on the ALFRED benchmark when the semantic label is not provided. Also, the learned concept can be reused for other downstream tasks, such as reasoning of object states. Project page: http://ecl.csail.mit.edu/, Comment: CoRL 2022
Published: 2023

2. Learning to Grow Pretrained Models for Efficient Transformer Training

Author: Wang, Peihao, Panda, Rameswar, Hennigen, Lucas Torroba, Greengard, Philip, Karlinsky, Leonid, Feris, Rogerio, Cox, David Daniel, Wang, Zhangyang, and Kim, Yoon
Subjects: Computer Science - Machine Learning
Abstract: Scaling transformers has led to significant breakthroughs in many domains, leading to a paradigm in which larger versions of existing models are trained and released on a periodic basis. New instances of such models are typically trained completely from scratch, despite the fact that they are often just scaled-up versions of their smaller counterparts. How can we use the implicit knowledge in the parameters of smaller, extant models to enable faster training of newer, larger models? This paper describes an approach for accelerating transformer training by learning to grow pretrained transformers, where we learn to linearly map the parameters of the smaller model to initialize the larger model. For tractable learning, we factorize the linear transformation as a composition of (linear) width- and depth-growth operators, and further employ a Kronecker factorization of these growth operators to encode architectural knowledge. Extensive experiments across both language and vision transformers demonstrate that our learned Linear Growth Operator (LiGO) can save up to 50% computational cost of training from scratch, while also consistently outperforming strong baselines that also reuse smaller pretrained models to initialize larger models., Comment: International Conference on Learning Representations (ICLR), 2023
Published: 2023

3. Fine-tuning Deep Belief Networks using Harmony Search

Author: Papa, João Paulo, Scheirer, Walter, and Cox, David Daniel
Published: 2016
Full Text: View/download PDF

4. RNA-guided editing of bacterial genomes using CRISPR-Cas systems

Author: Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Cox, David Daniel, Zhang, Feng, Jiang, Wenyan, Bikard, David, Marraffini, Luciano A., Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Cox, David Daniel, Zhang, Feng, Jiang, Wenyan, Bikard, David, and Marraffini, Luciano A.
Abstract: Here we use the clustered, regularly interspaced, short palindromic repeats (CRISPR)–associated Cas9 endonuclease complexed with dual-RNAs to introduce precise mutations in the genomes of Streptococcus pneumoniae and Escherichia coli. The approach relies on dual-RNA:Cas9-directed cleavage at the targeted genomic site to kill unmutated cells and circumvents the need for selectable markers or counter-selection systems. We reprogram dual-RNA:Cas9 specificity by changing the sequence of short CRISPR RNA (crRNA) to make single- and multinucleotide changes carried on editing templates. Simultaneous use of two crRNAs enables multiplex mutagenesis. In S. pneumoniae, nearly 100% of cells that were recovered using our approach contained the desired mutation, and in E. coli, 65% that were recovered contained the mutation, when the approach was used in combination with recombineering. We exhaustively analyze dual-RNA:Cas9 target requirements to define the range of targetable sequences and show strategies for editing sites that do not meet these requirements, suggesting the versatility of this technique for bacterial genome engineering., National Institutes of Health (U.S.) (NIH Director's Pioneer Award (DP1MH100706)), National Institutes of Health (U.S.) (NIH Director's New Innovator Award (DP2AI104556)), National Institutes of Health (U.S.) (NIH Transformative R01 grant)
Published: 2016

5. Multiplex Genome Engineering Using CRISPR/Cas Systems

Author: Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Cong, Le, Ran, F. Ann, Cox, David Daniel, Lin, Shuailiang, Habib, Naomi, Hsu, Patrick, Zhang, Feng, Barretto, Robert, Wu, Xuebing, Jiang, Wenyan, Marraffini, Luciano A., Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Cong, Le, Ran, F. Ann, Cox, David Daniel, Lin, Shuailiang, Habib, Naomi, Hsu, Patrick, Zhang, Feng, Barretto, Robert, Wu, Xuebing, Jiang, Wenyan, and Marraffini, Luciano A.
Abstract: Functional elucidation of causal genetic variants and elements requires precise genome editing technologies. The type II prokaryotic CRISPR (clustered regularly interspaced short palindromic repeats)/Cas adaptive immune system has been shown to facilitate RNA-guided site-specific DNA cleavage. We engineered two different type II CRISPR/Cas systems and demonstrate that Cas9 nucleases can be directed by short RNAs to induce precise cleavage at endogenous genomic loci in human and mouse cells. Cas9 can also be converted into a nicking enzyme to facilitate homology-directed repair with minimal mutagenic activity. Lastly, multiple guide sequences can be encoded into a single CRISPR array to enable simultaneous editing of several sites within the mammalian genome, demonstrating easy programmability and wide applicability of the RNA-guided nuclease technology., National Institutes of Health (U.S.) (NIH grant R01-GM34277), National Institutes of Health (U.S.) (NIH grant R01-CA133404), National Institutes of Health (U.S.) (NIH Director's New Innovator Award (DP2AI104556)), National Institutes of Health (U.S.) (NIH Director's Pioneer Award (DP1MH100706))
Published: 2016

6. On the Model Selection of Bernoulli Restricted Boltzmann Machines Through Harmony Search

Author: Papa, Joao Papa, primary, Rosa, Gustavo Henrique, additional, Costa, Kelton A., additional, Marana, Nilceu A., additional, Scheirer, Walter, additional, and Cox, David Daniel, additional
Published: 2015
Full Text: View/download PDF

7. Neural Networks and Neuroscience-Inspired Computer Vision

Author: Cox, David Daniel, primary and Dean, Thomas, additional
Published: 2014
Full Text: View/download PDF

8. Do we understand high-level vision?

Author: Cox, David Daniel, primary
Published: 2014
Full Text: View/download PDF

9. Reverse engineering object recognition

Author: James J. DiCarlo., Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences., Cox, David Daniel, James J. DiCarlo., Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences., and Cox, David Daniel
Abstract: Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2007., This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections., Page 95 blank., Includes bibliographical references (p. 83-94)., Any given object in the world can cast an effectively infinite number of different images onto the retina, depending on its position relative to the viewer, the configuration of light sources, and the presence of other objects in the visual field. In spite of this, primates can robustly recognize a multitude of objects in a fraction of a second, with no apparent effort. The computational mechanisms underlying these amazing abilities are poorly understood. This thesis presents a collection of work from human psychophysics, monkey electrophysiology, and computational modelling in an effort to reverse-engineer the key computational components that enable this amazing ability in the primate visual system., by David Daniel Cox., Ph.D.
Published: 2008

10. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures

Author: Bergstra, J., Yamins, D., and Cox, David Daniel
Abstract: Many computer vision algorithms depend on configuration settings that are typically hand-tuned in the course of evaluating the algorithm for a particular data set. While such parameter tuning is often presented as being incidental to the algorithm, correctly setting these parameter choices is frequently critical to realizing a method’s full potential. Compounding matters, these parameters often must be re-tuned when the algorithm is applied to a new problem domain, and the tuning process itself often depends on personal experience and intuition in ways that are hard to quantify or describe. Since the performance of a given technique depends on both the fundamental quality of the algorithm and the details of its tuning, it is sometimes difficult to know whether a given technique is genuinely better, or simply better tuned. In this work, we propose a meta-modeling approach to support automated hyperparameter optimization, with the goal of providing practical tools that replace hand-tuning with a reproducible and unbiased optimization process. Our approach is to expose the underlying expression graph of how a performance metric (e.g. classification accuracy on validation examples) is computed from hyperparameters that govern not only how individual processing steps are applied, but even which processing steps are included. A hyperparameter optimization algorithm transforms this graph into a program for optimizing that performance metric. Our approach yields state of the art results on three disparate computer vision problems: a face-matching verification task (LFW), a face identification task (PubFig83) and an object recognition task (CIFAR-10), using a single broad class of feed-forward vision architectures., Engineering and Applied Sciences
Published: 2013

11. Hyperparameter Optimization and Boosting for Classifying Facial Expressions: How good can a 'Null' Model be?

Author: Bergstra, James and Cox, David Daniel
Abstract: One of the goals of the ICML workshop on representation and learning is to establish benchmark scores for a new data set of labeled facial expressions. This paper presents the performance of a "Null" model consisting of convolutions with random weights, PCA, pooling, normalization, and a linear readout. Our approach focused on hyperparameter optimization rather than novel model components. On the Facial Expression Recognition Challenge held by the Kaggle website, our hyperparameter optimization approach achieved a score of 60% accuracy on the test data. This paper also introduces a new ensemble construction variant that combines hyperparameter optimization with the construction of ensembles. This algorithm constructed an ensemble of four models that scored 65.5% accuracy. These scores rank 12th and 5th respectively among the 56 challenge participants. It is worth noting that our approach was developed prior to the release of the data set, and applied without modification; our strong competition performance suggests that the TPE hyperparameter optimization algorithm and domain expertise encoded in our Null model can generalize to new image classification data sets., Engineering and Applied Sciences
Published: 2013

12. Machine Learning for Predictive Auto-Tuning with Boosted Regression Trees

Author: Bergstra, James, Pinto, Nicolas, and Cox, David Daniel
Subjects: correlation, graphics processing unit, instruction sets, kernel, libraries, optimization, regression tree analysis
Abstract: The rapidly evolving landscape of multicore architectures makes the construction of efficient libraries a daunting task. A family of methods known collectively as “auto-tuning” has emerged to address this challenge. Two major approaches to auto-tuning are empirical and model-based: empirical autotuning is a generic but slow approach that works by measuring runtimes of candidate implementations, model-based auto-tuning predicts those runtimes using simplified abstractions designed by hand. We show that machine learning methods for non-linear regression can be used to estimate timing models from data, capturing the best of both approaches. A statistically-derived model offers the speed of a model-based approach, with the generality and simplicity of empirical auto-tuning. We validate our approach using the filterbank correlation kernel described in Pinto and Cox [2012], where we find that 0.1 seconds of hill climbing on the regression model (“predictive auto-tuning”) can achieve almost the same speed-up as is brought by minutes of empirical auto-tuning. Our approach is not specific to filterbank correlation, nor even to GPU kernel auto-tuning, and can be applied to almost any templated-code optimization problem, spanning a wide variety of problem types, kernel types, and platforms., Engineering and Applied Sciences, Molecular and Cellular Biology
Published: 2012
Full Text: View/download PDF

13. Perceptual Annotation: Measuring Human Vision to Improve Computer Vision

Author: Scheirer, Walter Jerome, Anthony, Samuel English, Nakayama, Ken, and Cox, David Daniel
Subjects: machine learning, citizen science, face detection, psychology, psychometrics, psychophysics, regularization, support vector machines, visual recognition
Abstract: For many problems in computer vision, human learners are considerably better than machines. Humans possess highly accurate internal recognition and learning mechanisms that are not yet understood, and they frequently have access to more extensive training data through a lifetime of unbiased experience with the visual world. We propose to use visual psychophysics to directly leverage the abilities of human subjects to build better machine learning systems. First, we use an advanced online psychometric testing platform to make new kinds of annotation data available for learning. Second, we develop a technique for harnessing these new kinds of information – “perceptual annotations” – for support vector machines. A key intuition for this approach is that while it may remain infeasible to dramatically increase the amount of data and high-quality labels available for the training of a given system, measuring the exemplar-by-exemplar difficulty and pattern of errors of human annotators can provide important information for regularizing the solution of the system at hand. A case study for the problem face detection demonstrates that this approach yields state-ofthe- art results on the challenging FDDB data set., Engineering and Applied Sciences, Molecular and Cellular Biology
Published: 2014
Full Text: View/download PDF

14. A High-Throughput Screening Approach to Discovering Good Forms of Biologically Inspired Visual Representation

Author: Doukhan, David, DiCarlo, James J., Pinto, Nicolas, and Cox, David Daniel
Subjects: neuroscience, sensory systems, computer science, natural and synthetic vision
Abstract: While many models of biological object recognition share a common set of “broad-stroke” properties, the performance of any one model depends strongly on the choice of parameters in a particular instantiation of that model—e.g., the number of units per layer, the size of pooling kernels, exponents in normalization operations, etc. Since the number of such parameters (explicit or implicit) is typically large and the computational cost of evaluating one particular parameter set is high, the space of possible model instantiations goes largely unexplored. Thus, when a model fails to approach the abilities of biological visual systems, we are left uncertain whether this failure is because we are missing a fundamental idea or because the correct “parts” have not been tuned correctly, assembled at sufficient scale, or provided with enough training. Here, we present a high-throughput approach to the exploration of such parameter sets, leveraging recent advances in stream processing hardware (high-end NVIDIA graphic cards and the PlayStation 3's IBM Cell Processor). In analogy to high-throughput screening approaches in molecular biology and genetics, we explored thousands of potential network architectures and parameter instantiations, screening those that show promising object recognition performance for further analysis. We show that this approach can yield significant, reproducible gains in performance across an array of basic object recognition tasks, consistently outperforming a variety of state-of-the-art purpose-built vision systems from the literature. As the scale of available computational power continues to expand, we argue that this approach has the potential to greatly accelerate progress in both artificial vision and our understanding of the computational underpinning of biological vision., Molecular and Cellular Biology
Published: 2009
Full Text: View/download PDF

15. Why is Real-World Visual Object Recognition Hard?

Author: DiCarlo, James J, Pinto, Nicolas, and Cox, David Daniel
Subjects: computational biology, neuroscience, homo (human)
Abstract: Progress in understanding the brain mechanisms underlying vision requires the construction of computational models that not only emulate the brain's anatomy and physiology, but ultimately match its performance on visual tasks. In recent years, “natural” images have become popular in the study of vision and have been used to show apparently impressive progress in building such models. Here, we challenge the use of uncontrolled “natural” images in guiding that progress. In particular, we show that a simple V1-like model—a neuroscientist's “null” model, which should perform poorly at real-world visual object recognition tasks—outperforms state-of-the-art object recognition systems (biologically inspired and otherwise) on a standard, ostensibly natural image recognition test. As a counterpoint, we designed a “simpler” recognition test to better span the real-world variation in object pose, position, and scale, and we show that this test correctly exposes the inadequacy of the V1-like model. Taken together, these results demonstrate that tests based on uncontrolled natural images can be seriously misleading, potentially guiding progress in the wrong direction. Instead, we reexamine what it means for images to be natural and argue for a renewed focus on the core problem of object recognition—real-world image variation., Molecular and Cellular Biology
Published: 2008
Full Text: View/download PDF

16. Multiplex Genome Engineering Using CRISPR/Cas Systems

Author: David M. Cox, Le Cong, Robert P. J. Barretto, Xuebing Wu, Feng Zhang, Luciano A. Marraffini, Shuailiang Lin, F. Ann Ran, Naomi Habib, Patrick D. Hsu, Wenyan Jiang, Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Cong, Le, Ran, F. Ann, Cox, David Daniel, Lin, Shuailiang, Habib, Naomi, Hsu, Patrick, and Zhang, Feng
Subjects: CRISPR/Cpf1, Streptococcus pyogenes, Molecular Sequence Data, Biology, Article, Mice, Genome editing, Animals, Humans, CRISPR, DNA Cleavage, Genetics, Trans-activating crRNA, CRISPR interference, Transcription activator-like effector nuclease, Genome, Multidisciplinary, Base Sequence, Cas9, Inverted Repeat Sequences, Recombinational DNA Repair, DNA, Microarray Analysis, Protospacer adjacent motif, Genetic Loci, Mutagenesis, RNA, CRISPR-Cas Systems, Genetic Engineering
Abstract: Functional elucidation of causal genetic variants and elements requires precise genome editing technologies. The type II prokaryotic CRISPR (clustered regularly interspaced short palindromic repeats)/Cas adaptive immune system has been shown to facilitate RNA-guided site-specific DNA cleavage. We engineered two different type II CRISPR/Cas systems and demonstrate that Cas9 nucleases can be directed by short RNAs to induce precise cleavage at endogenous genomic loci in human and mouse cells. Cas9 can also be converted into a nicking enzyme to facilitate homology-directed repair with minimal mutagenic activity. Lastly, multiple guide sequences can be encoded into a single CRISPR array to enable simultaneous editing of several sites within the mammalian genome, demonstrating easy programmability and wide applicability of the RNA-guided nuclease technology., National Institutes of Health (U.S.) (NIH grant R01-GM34277), National Institutes of Health (U.S.) (NIH grant R01-CA133404), National Institutes of Health (U.S.) (NIH Director's New Innovator Award (DP2AI104556)), National Institutes of Health (U.S.) (NIH Director's Pioneer Award (DP1MH100706))
Published: 2013

17. RNA-guided editing of bacterial genomes using CRISPR-Cas systems

Author: Wenyan Jiang, David Daniel Cox, Luciano A. Marraffini, David Bikard, Feng Zhang, Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Cox, David Daniel, and Zhang, Feng
Subjects: Molecular Sequence Data, Biomedical Engineering, Bioengineering, Biology, Applied Microbiology and Biotechnology, Article, Recombineering, Genome engineering, 03 medical and health sciences, Genome editing, Escherichia coli, CRISPR, DNA Cleavage, 030304 developmental biology, Trans-activating crRNA, Genetics, 0303 health sciences, CRISPR interference, Base Sequence, 030306 microbiology, Cas9, Endonucleases, 3. Good health, Protospacer adjacent motif, Streptococcus pneumoniae, Mutagenesis, Site-Directed, Molecular Medicine, Genetic Engineering, Genome, Bacterial, RNA, Guide, Kinetoplastida, Biotechnology
Abstract: Here we use the clustered, regularly interspaced, short palindromic repeats (CRISPR)–associated Cas9 endonuclease complexed with dual-RNAs to introduce precise mutations in the genomes of Streptococcus pneumoniae and Escherichia coli. The approach relies on dual-RNA:Cas9-directed cleavage at the targeted genomic site to kill unmutated cells and circumvents the need for selectable markers or counter-selection systems. We reprogram dual-RNA:Cas9 specificity by changing the sequence of short CRISPR RNA (crRNA) to make single- and multinucleotide changes carried on editing templates. Simultaneous use of two crRNAs enables multiplex mutagenesis. In S. pneumoniae, nearly 100% of cells that were recovered using our approach contained the desired mutation, and in E. coli, 65% that were recovered contained the mutation, when the approach was used in combination with recombineering. We exhaustively analyze dual-RNA:Cas9 target requirements to define the range of targetable sequences and show strategies for editing sites that do not meet these requirements, suggesting the versatility of this technique for bacterial genome engineering., National Institutes of Health (U.S.) (NIH Director's Pioneer Award (DP1MH100706)), National Institutes of Health (U.S.) (NIH Director's New Innovator Award (DP2AI104556)), National Institutes of Health (U.S.) (NIH Transformative R01 grant)
Published: 2012

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

17 results on '"Cox, David Daniel"'

1. Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following

2. Learning to Grow Pretrained Models for Efficient Transformer Training

3. Fine-tuning Deep Belief Networks using Harmony Search

4. RNA-guided editing of bacterial genomes using CRISPR-Cas systems

5. Multiplex Genome Engineering Using CRISPR/Cas Systems

6. On the Model Selection of Bernoulli Restricted Boltzmann Machines Through Harmony Search

7. Neural Networks and Neuroscience-Inspired Computer Vision

8. Do we understand high-level vision?

9. Reverse engineering object recognition

10. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures

11. Hyperparameter Optimization and Boosting for Classifying Facial Expressions: How good can a 'Null' Model be?

12. Machine Learning for Predictive Auto-Tuning with Boosted Regression Trees

13. Perceptual Annotation: Measuring Human Vision to Improve Computer Vision

14. A High-Throughput Screening Approach to Discovering Good Forms of Biologically Inspired Visual Representation

15. Why is Real-World Visual Object Recognition Hard?

16. Multiplex Genome Engineering Using CRISPR/Cas Systems

17. RNA-guided editing of bacterial genomes using CRISPR-Cas systems

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

17 results on '"Cox, David Daniel"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources