Search

Your search keyword '"Xiong, Wei"' showing total 17,176 results

Search Constraints

Start Over You searched for: Author "Xiong, Wei" Remove constraint Author: "Xiong, Wei"
17,176 results on '"Xiong, Wei"'

Search Results

2. RRM: Robust Reward Model Training Mitigates Reward Hacking

3. From Lists to Emojis: How Format Bias Affects Model Alignment

4. Semantics Preserving Emoji Recommendation with Large Language Models

5. GroundingBooth: Grounding Text-to-Image Customization

6. Building Math Agents with Multi-Turn Iterative Preference Learning

7. When do molecular polaritons behave like optical filters?

8. WAS: Dataset and Methods for Artistic Text Segmentation

9. Suppression of quantum dissipation: A cooperative effect of quantum squeezing and quantum measurement

11. Quantum teleportation between a continuous-variable optical qumode and a discrete-variable solid-state qubit

12. Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts

13. Mechanical dynamics around higher-order exceptional point in magno-optomechanics

14. BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics

15. Atomic transport dynamics in crossed optical dipole trap

16. RLHF Workflow: From Reward Modeling to Online RLHF

17. Harnessing metastability for grain size control in multiprincipal element alloys during additive manufacturing

18. DPO Meets PPO: Reinforced Token Optimization for RLHF

19. Tunable Entanglement in Cavity-Magnon Optomechanics

20. SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing

21. IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

22. Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

23. Exploring global symmetry-breaking superradiant phase via phase competition

24. Coherent competition and control between three-wave mixing and four-wave mixing in superconducting circuits

25. Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

26. Diffusion Model-Based Image Editing: A Survey

27. Giant enhancement of higher-order harmonics of an optical-tweezer phonon laser

28. Online Iterative Reinforcement Learning from Human Feedback with General Preference Model

29. Unitary and efficient spin squeezing in cavity optomechanics

30. Unraveling collisional energy loss of a heavy quark in quark-gluon plasma

31. Nonreciprocal Unconventional Photon Blockade with Kerr Magnons

45. Initial hemodynamic status and Acute Mortality in Cancer patients with Acute Pulmonary Embolism: from the COMMAND VTE Registry

46. PAFAH2 suppresses synchronized ferroptosis to ameliorate acute kidney injury

48. Multidimensional Widefield Infrared-Encoded Spontaneous Emission Microscopy: Distinguishing Chromophores by Ultrashort Infrared Pulses.

49. The rotating solutions beyond the spontaneous scalarization in Einstein-Maxwell-scalar theory

50. Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

Catalog

Books, media, physical & digital resources