Search

Your search keyword '"Mishra, Mayank"' showing total 517 results

Search Constraints

Start Over You searched for: Author "Mishra, Mayank" Remove constraint Author: "Mishra, Mayank"
517 results on '"Mishra, Mayank"'

Search Results

2. Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models

3. Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

4. Scaling Granite Code Models to 128K Context

5. Enhancing Training Efficiency Using Packing with Flash Attention

6. The infrastructure powering IBM's Gen AI model development

7. Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

8. Granite Code Models: A Family of Open Foundation Models for Code Intelligence

9. Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

10. Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization

11. Direct visualization of local magnetic domain dynamics in a 2D Van der Walls material/ferromagnet interface

12. DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets

13. Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

14. StarCoder 2 and The Stack v2: The Next Generation

15. BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

16. Dense plasma irradiated platinum with improved spin Hall effect

17. Prompting with Pseudo-Code Instructions

18. StarCoder: may the source be with you!

19. SantaCoder: don't reach for the stars!

20. Hybrid Materials for Energy Storage

21. Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

22. Deep Learning Based Surface Crack Detection in Battledore of Darbhanga Fort

24. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

25. Joint Reasoning on Hybrid-knowledge sources for Task-Oriented Dialog

30. A Closer Look at Smoothness in Domain Adversarial Training

32. Demonstration of Entanglement-Enhanced Covert Sensing

33. Variational Learning for Unsupervised Knowledge Grounded Dialogs

34. Accelerating Gradient-based Meta Learner

38. Lambda Numbers of Finite $p$-Groups

41. A Hybrid Feature Based Approach of Facial Images for the Detection of Autism Spectrum Disorder

42. Neural Network-Based Flow Curve Modeling of High-Nitrogen Austenitic Stainless Steel

43. Increasing distillable key rate from bound entangled states by using local filtration

46. Adversarial Approximate Inference for Speech to Electroglottograph Conversion

47. Variational Inference with Latent Space Quantization for Adversarial Resilience

50. Structural Damage Identification in GFRP Composite Plates Using TLBO Algorithm

Catalog

Books, media, physical & digital resources