Search

Your search keyword '"Chang, Xiaojun"' showing total 849 results

Search Constraints

Start Over You searched for: Author "Chang, Xiaojun" Remove constraint Author: "Chang, Xiaojun"
849 results on '"Chang, Xiaojun"'

Search Results

1. Medical Report Generation Is A Multi-label Classification Problem

2. Normalized solutions of $L^2$-supercritical Kirchhoff equations in bounded domains

3. RealCustom++: Representing Images as Real-Word for Real-Time Customization

4. Disentangled Noisy Correspondence Learning

5. Contrastive Learning with Counterfactual Explanations for Radiology Report Generation

6. Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

7. Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection

8. Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

9. MLP Can Be A Good Transformer Learner

10. LongVLM: Efficient Long Video Understanding via Large Language Models

11. Self-Supervised Multi-Frame Neural Scene Flow

12. Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding

13. NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

14. SWAP-NAS: Sample-Wise Activation Patterns for Ultra-fast NAS

15. DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions

16. MatchNAS: Optimizing Edge AI in Sparse-Label Data Contexts via Automating Deep Neural Network Porting for Mobile Deployment

17. Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation

18. Video Recognition in Portrait Mode

19. Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos

20. Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition

21. Disentangled Representation Learning with Transmitted Information Bottleneck

22. Mask Propagation for Efficient Video Semantic Segmentation

23. No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling

26. PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement

27. Normalized solutions for Sobolev critical Schr\'odinger-Bopp-Podolsky systems

28. ProAgent: Building Proactive Cooperative Agents with Large Language Models

29. SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation

30. FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

31. Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

32. Convergence of least energy sign-changing solutions for logarithmic Schr\'{o}dinger equations on locally finite graphs

33. Maximum Entropy Heterogeneous-Agent Reinforcement Learning

34. Toward the Automated Construction of Probabilistic Knowledge Graphs for the Maritime Domain

35. Existence and instability of standing waves for the biharmonic nonlinear Schroedinger equation with combined nonlinearities

36. Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining

37. A Benchmark for Cycling Close Pass Near Miss Event Detection from Video Streams

38. No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling

39. Origin and evolution of the triploid cultivated banana genome

40. Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation

41. Guided Image-to-Image Translation by Discriminator-Generator Communication

42. ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency

43. Normalized solutions of $L^2$-supercritical NLS equations on noncompact metric graphs with localized nonlinearities

44. 3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation

45. Ground states for logarithmic Schr\'{o}dinger equations on locally finite graphs

46. Simple Primitives with Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-shot Learning

47. Bounded Palais-Smale sequences with Morse type information for some constrained functionals

48. Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers

49. PAR: Political Actor Representation Learning with Social Context and Expert Knowledge

50. ViLPAct: A Benchmark for Compositional Generalization on Multimodal Human Activities

Catalog

Books, media, physical & digital resources