Search

Your search keyword '"Chang, Xiaojun"' showing total 130 results

Search Constraints

Start Over You searched for: Author "Chang, Xiaojun" Remove constraint Author: "Chang, Xiaojun" Database arXiv Remove constraint Database: arXiv
130 results on '"Chang, Xiaojun"'

Search Results

1. Dual Conditional Diffusion Models for Sequential Recommendation

2. ContextDet: Temporal Action Detection with Adaptive Context Aggregation

3. Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes

4. Flexiffusion: Segment-wise Neural Architecture Search for Flexible Denoising Schedule

5. Normalized ground state solutions of Schr\'odinger-KdV system in $\mathbb{R}^3$

6. Efficient Training of Large Vision Models via Advanced Automated Progressive Learning

7. Medical Report Generation Is A Multi-label Classification Problem

8. Normalized solutions of $L^2$-supercritical Kirchhoff equations in bounded domains

9. RealCustom++: Representing Images as Real-Word for Real-Time Customization

10. Disentangled Noisy Correspondence Learning

11. Contrastive Learning with Counterfactual Explanations for Radiology Report Generation

12. Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

13. Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection

14. Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

15. MLP Can Be A Good Transformer Learner

16. LongVLM: Efficient Long Video Understanding via Large Language Models

17. Self-Supervised Multi-Frame Neural Scene Flow

18. Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding

19. NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

20. SWAP-NAS: Sample-Wise Activation Patterns for Ultra-fast NAS

21. DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions

22. MatchNAS: Optimizing Edge AI in Sparse-Label Data Contexts via Automating Deep Neural Network Porting for Mobile Deployment

23. Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation

24. Video Recognition in Portrait Mode

25. Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos

26. Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition

27. Disentangled Representation Learning with Transmitted Information Bottleneck

28. Mask Propagation for Efficient Video Semantic Segmentation

29. No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling

30. PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement

31. Normalized solutions for Sobolev critical Schr\'odinger-Bopp-Podolsky systems

32. ProAgent: Building Proactive Cooperative Agents with Large Language Models

33. SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation

34. FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

35. Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

36. Convergence of least energy sign-changing solutions for logarithmic Schr\'{o}dinger equations on locally finite graphs

37. Maximum Entropy Heterogeneous-Agent Reinforcement Learning

38. RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment

39. Act Like a Radiologist: Radiology Report Generation across Anatomical Regions

40. Toward the Automated Construction of Probabilistic Knowledge Graphs for the Maritime Domain

41. Existence and instability of standing waves for the biharmonic nonlinear Schroedinger equation with combined nonlinearities

42. Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining

43. A Benchmark for Cycling Close Pass Near Miss Event Detection from Video Streams

44. Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation

45. Guided Image-to-Image Translation by Discriminator-Generator Communication

46. ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency

47. Normalized solutions of $L^2$-supercritical NLS equations on noncompact metric graphs with localized nonlinearities

48. 3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation

49. Ground states for logarithmic Schr\'{o}dinger equations on locally finite graphs

50. Simple Primitives with Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-shot Learning

Catalog

Books, media, physical & digital resources