Author: "Christopher Jacob" / Database: arXiv - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Christopher Jacob"' showing total 4 results

Start Over Author "Christopher Jacob" Database arXiv

Author: Christopher, Jacob K, Bartoldson, Brian R, Kailkhura, Bhavya, and Fioretto, Ferdinando
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Speculative decoding has emerged as a widely adopted method to accelerate large language model inference without sacrificing the quality of the model outputs. While this technique has facilitated notable speed improvements by enabling parallel sequence verification, its efficiency remains inherently limited by the reliance on incremental token generation in existing draft models. To overcome this limitation, this paper proposes an adaptation of speculative decoding which uses discrete diffusion models to generate draft sequences. This allows parallelization of both the drafting and verification steps, providing significant speed-ups to the inference process. Our proposed approach, Speculative Diffusion Decoding (SpecDiff), is validated on standard language generation benchmarks and empirically demonstrated to provide a up to 8.7x speed-up over standard generation processes and up to 2.5x speed-up over existing speculative decoding approaches.
Published: 2024

Author: Christopher, Jacob K, Baek, Stephen, and Fioretto, Ferdinando
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: This paper introduces an approach to endow generative diffusion processes the ability to satisfy and certify compliance with constraints and physical principles. The proposed method recast the traditional sampling process of generative diffusion models as a constrained optimization problem, steering the generated data distribution to remain within a specified region to ensure adherence to the given constraints. These capabilities are validated on applications featuring both convex and challenging, non-convex, constraints as well as ordinary differential equations, in domains spanning from synthesizing new materials with precise morphometric properties, generating physics-informed motion, optimizing paths in planning scenarios, and human motion synthesis., Comment: Published at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Published: 2024

Author: Kotary, James, Christopher, Jacob, Dinh, My H, and Fioretto, Ferdinando
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control
Abstract: The integration of constrained optimization models as components in deep networks has led to promising advances on many specialized learning tasks. A central challenge in this setting is backpropagation through the solution of an optimization problem, which often lacks a closed form. One typical strategy is algorithm unrolling, which relies on automatic differentiation through the entire chain of operations executed by an iterative optimization solver. This paper provides theoretical insights into the backward pass of unrolled optimization, showing that it is asymptotically equivalent to the solution of a linear system by a particular iterative method. Several practical pitfalls of unrolling are demonstrated in light of these insights, and a system called Folded Optimization is proposed to construct more efficient backpropagation rules from unrolled solver implementations. Experiments over various end-to-end optimization and learning tasks demonstrate the advantages of this system both computationally, and in terms of flexibility over various optimization problem forms., Comment: arXiv admin note: text overlap with arXiv:2301.12047
Published: 2023

Author: Kotary, James, Di Vito, Vincenzo, Christopher, Jacob, Van Hentenryck, Pascal, and Fioretto, Ferdinando
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Many real-world decision processes are modeled by optimization problems whose defining parameters are unknown and must be inferred from observable data. The Predict-Then-Optimize framework uses machine learning models to predict unknown parameters of an optimization problem from features before solving. Recent works show that decision quality can be improved in this setting by solving and differentiating the optimization problem in the training loop, enabling end-to-end training with loss functions defined directly on the resulting decisions. However, this approach can be inefficient and requires handcrafted, problem-specific rules for backpropagation through the optimization step. This paper proposes an alternative method, in which optimal solutions are learned directly from the observable features by predictive models. The approach is generic, and based on an adaptation of the Learning-to-Optimize paradigm, from which a rich variety of existing techniques can be employed. Experimental evaluations show the ability of several Learning-to-Optimize methods to provide efficient, accurate, and flexible solutions to an array of challenging Predict-Then-Optimize problems.
Published: 2023

Books, media, physical & digital resources

Searchworks