Search

Your search keyword '"Jaggi A"' showing total 247 results

Search Constraints

Start Over You searched for: Author "Jaggi A" Remove constraint Author: "Jaggi A" Publication Type Reports Remove constraint Publication Type: Reports
247 results on '"Jaggi A"'

Search Results

1. Digital Twin Ecosystem for Oncology Clinical Operations

2. On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists

3. CoBo: Collaborative Learning via Bilevel Optimization

4. A New First-Order Meta-Learning Algorithm with Convergence Guarantees

5. Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

6. Effective Interplay between Sparsity and Quantization: From Theory to Practice

7. Deep Grokking: Would Deep Neural Networks Generalize Better?

8. Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

9. The Privacy Power of Correlated Noise in Decentralized Learning

10. Personalized Collaborative Fine-Tuning for On-Device Large Language Models

11. QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

12. Towards an empirical understanding of MoE design choices

13. Spectral Preconditioning for Gradient Methods on Graded Non-convex Functions

14. Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains

15. InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

16. DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging

17. Controllable Topic-Focused Abstractive Summarization

18. DoGE: Domain Reweighting with Generalization Estimation

19. Irreducible Curriculum for Language Model Pretraining

20. LASER: Linear Compression in Wireless Distributed Optimization

21. CoTFormer: A Chain-of-Thought Driven Architecture with Budget-Adaptive Computation Cost at Inference

22. MultiModN- Multimodal, Multi-Task, Interpretable Modular Networks

23. Layer-wise Linear Mode Connectivity

24. Provably Personalized and Robust Federated Learning

25. Faster Causal Attention Over Large Sequences Through Sparse Flash Attention

26. On Convergence of Incremental Gradient for Non-Convex Smooth Functions

27. Collaborative Learning via Prediction Consensus

28. Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks

29. Ghost Noise for Regularizing Deep Neural Networks

30. Multiplication-Free Transformer Training via Piecewise Affine Operations

31. Landmark Attention: Random-Access Infinite Context Length for Transformers

32. Linearization Algorithms for Fully Composite Optimization

33. Unified Convergence Theory of Stochastic and Variance-Reduced Cubic Newton Methods

34. Learning Analytics from Spoken Discussion Dialogs in Flipped Classroom

35. Beyond spectral gap (extended): The role of the topology in decentralized learning

36. Second-order optimization with lazy Hessians

37. Scalable Collaborative Learning via Representation Sharing

38. Accuracy Booster: Enabling 4-bit Fixed-point Arithmetic for DNN Training

39. Modular Clinical Decision Support Networks (MoDN) -- Updatable, Interpretable, and Portable Predictions for Evolving Clinical Environments

40. FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings

41. Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning

42. Beyond spectral gap: The role of the topology in decentralized learning

43. Special Properties of Gradient Descent with Large Learning Rates

44. SKILL: Structured Knowledge Infusion for Large Language Models

45. The Capacity of Causal Adversarial Channels

46. Data-heterogeneity-aware Mixing for Decentralized Learning

47. Improving Generalization via Uncertainty Driven Perturbations

48. Agree to Disagree: Diversity through Disagreement for Better Transferability

49. Characterizing & Finding Good Data Orderings for Fast Convergence of Sequential Gradient Methods

50. Byzantine-Robust Decentralized Learning via ClippedGossip

Catalog

Books, media, physical & digital resources