397 results on '"Wei, Yichen"'
Search Results
2. Power-Domain Interference Graph Estimation for Multi-hop BLE Networks
- Author
-
Jia, Haifeng, Wei, Yichen, Pi, Yibo, and Chen, Cailian
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
Traditional wisdom for network management allocates network resources separately for the measurement and communication tasks. Heavy measurement tasks may compete limited resources with communication tasks and significantly degrade overall network performance. It is therefore challenging for the interference graph, deemed as incurring heavy measurement overhead, to be used in practice in wireless networks. To address this challenge in wireless sensor networks, our core insight is to use power as a new dimension for interference graph estimation (IGE) such that IGE can be done simultaneously with the communication tasks using the same frequency-time resources. We propose to marry power-domain IGE with concurrent flooding to achieve simultaneous measurement and communication in BLE networks, where the power linearity prerequisite for power-domain IGE holds naturally true in concurrent flooding. With extensive experiments, we conclude the necessary conditions for the power linearity to hold and analyze several nonlinearity issues of power related to hardware imperfections. We design and implement network protocols and power control algorithms for IGE in multi-hop BLE networks and conduct experiments to show that the marriage is mutually beneficial for both IGE and concurrent flooding. Furthermore, we demonstrate the potential of IGE in improving channel map convergence and convergecast in BLE networks., Comment: This paper is accepted for publication in the ACM Transactions on Sensor Networks (TOSN), and is an extension of our conference paper accepted at EWSN'23 (arXiv:2312.16807)
- Published
- 2024
- Full Text
- View/download PDF
3. Evaluating Cognitive and Neuropsychological Assessments -- A Comprehensive Review
- Author
-
Li, Chuang, Lin, Rubing, Liu, Yantong, and Wei, Yichen
- Subjects
Quantitative Biology - Neurons and Cognition - Abstract
Cognitive impairments in older adults represent a significant public health concern, necessitating accurate diagnostic and monitoring strategies. In this study, the principal cognitive and neuropsychological evaluations employed for the diagnosis and longitudinal observation of cognitive deficits in the elderly are investigated. An analytical review of instruments including the Mini-Mental State Examination (MMSE), Digit Symbol Substitution Test (DSST), Montreal Cognitive Assessment (MoCA), and Trail Making Test (TMT) is conducted. This examination encompasses an assessment of each instrument's methodology, efficacy, advantages, and limitations. The objective is to enhance comprehension of these assessments for the early identification and effective management of conditions such as dementia and mild cognitive impairment, thereby contributing to the advancement of cognitive health within the geriatric population.
- Published
- 2024
4. Immunogenic cell death triggered by pathogen ligands via host germ line-encoded receptors
- Author
-
Li, Chuang, Wei, Yichen, Qin, Chao, Chen, Shifan, and Shao, Xiaolong
- Subjects
Quantitative Biology - Molecular Networks ,Quantitative Biology - Subcellular Processes - Abstract
The strategic induction of cell death serves as a crucial immune defense mechanism for the eradication of pathogenic infections within host cells. Investigating the molecular mechanisms underlying immunogenic cell pathways has significantly enhanced our understanding of the host's immunity. This review provides a comprehensive overview of the immunogenic cell death mechanisms triggered by pathogen infections, focusing on the critical role of pattern recognition receptors. In response to infections, host cells dictate a variety of cell death pathways, including apoptosis, pyroptosis, necrosis, and lysosomal cell death, which are essential for amplifying immune responses and controlling pathogen dissemination. Key components of these mechanisms are host cellular receptors that recognize pathogen-associated ligands. These receptors activate downstream signaling cascades, leading to the expression of immunoregulatory genes and the production of antimicrobial cytokines and chemokines. Particularly, the inflammasome, a multi-protein complex, plays a pivotal role in these responses by processing pro-inflammatory cytokines and inducing pyroptotic cell death. Pathogens, in turn, have evolved strategies to manipulate these cell death pathways, either by inhibiting them to facilitate their replication or by triggering them to evade host defenses. This dynamic interplay between host immune mechanisms and pathogen strategies highlights the intricate co-evolution of microbial virulence and host immunity., Comment: 30 pages, 3 figures
- Published
- 2024
5. NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds
- Author
-
Goel, Shivam, Wei, Yichen, Lymperopoulos, Panagiotis, Chura, Klara, Scheutz, Matthias, and Sinapov, Jivko
- Subjects
Computer Science - Artificial Intelligence - Abstract
As AI agents leave the lab and venture into the real world as autonomous vehicles, delivery robots, and cooking robots, it is increasingly necessary to design and comprehensively evaluate algorithms that tackle the ``open-world''. To this end, we introduce NovelGym, a flexible and adaptable ecosystem designed to simulate gridworld environments, serving as a robust platform for benchmarking reinforcement learning (RL) and hybrid planning and learning agents in open-world contexts. The modular architecture of NovelGym facilitates rapid creation and modification of task environments, including multi-agent scenarios, with multiple environment transformations, thus providing a dynamic testbed for researchers to develop open-world AI agents., Comment: Accepted at AAMAS-2024
- Published
- 2024
6. Efficient Interference Graph Estimation via Concurrent Flooding
- Author
-
Jia, Haifeng, Wei, Yichen, Wang, Zhan, Jin, Jiani, Li, Haorui, and Pi, Yibo
- Subjects
Computer Science - Networking and Internet Architecture ,Electrical Engineering and Systems Science - Systems and Control ,C.2 - Abstract
Traditional wisdom for network management allocates network resources separately for the measurement and data transmission tasks. Heavy measurement tasks may take up resources for data transmission and significantly reduce network performance. It is therefore challenging for interference graphs, deemed as incurring heavy measurement overhead, to be used in practice in wireless networks. To address this challenge in wireless sensor networks, we propose to use power as a new dimension for interference graph estimation (IGE) and integrate IGE with concurrent flooding such that IGE can be done simultaneously with flooding using the same frequency-time resources. With controlled and real-world experiments, we show that it is feasible to efficiently achieve IGE via concurrent flooding on the commercial off-the-shelf (COTS) devices by controlling the transmit powers of nodes. We believe that efficient IGE would be a key enabler for the practical use of the existing scheduling algorithms assuming known interference graphs., Comment: Accepted by International Conference on Embedded Wireless Systems and Networking 2023 (EWSN'23), 7 pages with 9 figures, equal contribution by Haifeng Jia and Yichen Wei
- Published
- 2023
7. Dynamic Causal Disentanglement Model for Dialogue Emotion Detection
- Author
-
Su, Yuting, Wei, Yichen, Nie, Weizhi, Zhao, Sicheng, and Liu, Anan
- Subjects
Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Emotion detection is a critical technology extensively employed in diverse fields. While the incorporation of commonsense knowledge has proven beneficial for existing emotion detection methods, dialogue-based emotion detection encounters numerous difficulties and challenges due to human agency and the variability of dialogue content.In dialogues, human emotions tend to accumulate in bursts. However, they are often implicitly expressed. This implies that many genuine emotions remain concealed within a plethora of unrelated words and dialogues.In this paper, we propose a Dynamic Causal Disentanglement Model based on hidden variable separation, which is founded on the separation of hidden variables. This model effectively decomposes the content of dialogues and investigates the temporal accumulation of emotions, thereby enabling more precise emotion recognition. First, we introduce a novel Causal Directed Acyclic Graph (DAG) to establish the correlation between hidden emotional information and other observed elements. Subsequently, our approach utilizes pre-extracted personal attributes and utterance topics as guiding factors for the distribution of hidden variables, aiming to separate irrelevant ones. Specifically, we propose a dynamic temporal disentanglement model to infer the propagation of utterances and hidden variables, enabling the accumulation of emotion-related information throughout the conversation. To guide this disentanglement process, we leverage the ChatGPT-4.0 and LSTM networks to extract utterance topics and personal attributes as observed information.Finally, we test our approach on two popular datasets in dialogue emotion detection and relevant experimental results verified the model's superiority.
- Published
- 2023
8. Fine-grained recognition of bitter gourd maturity based on Improved YOLOv5-seg model
- Author
-
Jiang, Sheng, Ao, Jiangbo, Yang, Hualin, Xie, Fangnan, Liu, Ziyi, Yang, Shanglin, Wei, Yichen, and Deng, Xijin
- Published
- 2024
- Full Text
- View/download PDF
9. Disrupted morphological brain network organization in subjective cognitive decline and mild cognitive impairment
- Author
-
Chen, Yuxin, Liang, Lingyan, Wei, Yichen, Liu, Ying, Li, Xiaocheng, Zhang, Zhiguo, Li, Linling, and Deng, Demao
- Published
- 2024
- Full Text
- View/download PDF
10. Altered regional homogeneity following moxibustion in mild cognitive impairment
- Author
-
Zhang, Qingping, Liang, Lingyan, Lai, Ziyan, Wei, Yichen, Duan, Gaoxiong, Lai, Yinqi, Liu, Peng, and Deng, Demao
- Published
- 2024
- Full Text
- View/download PDF
11. Category Query Learning for Human-Object Interaction Classification
- Author
-
Xie, Chi, Zeng, Fangao, Hu, Yue, Liang, Shuang, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Unlike most previous HOI methods that focus on learning better human-object features, we propose a novel and complementary approach called category query learning. Such queries are explicitly associated to interaction categories, converted to image specific category representation via a transformer decoder, and learnt via an auxiliary image-level classification task. This idea is motivated by an earlier multi-label image classification method, but is for the first time applied for the challenging human-object interaction classification task. Our method is simple, general and effective. It is validated on three representative HOI baselines and achieves new state-of-the-art results on two benchmarks., Comment: Accepted by CVPR 2023
- Published
- 2023
12. HC-MVSNet: A probability sampling-based multi-view-stereo network with hybrid cascade structure for 3D reconstruction
- Author
-
Gao, Tianxiang, Hong, Zijian, Tan, Yixing, Sun, Lizhuo, Wei, Yichen, and Ma, Jianwei
- Published
- 2024
- Full Text
- View/download PDF
13. Numerical modeling on high-temperature and high-pressure gas condensate recovery considering the viscosity variation and dynamic relative permeability
- Author
-
Shao, Lihua, Wei, Yichen, and Wang, Yuhe
- Published
- 2023
- Full Text
- View/download PDF
14. Ambipolar tribotronic transistor of MoTe2
- Author
-
Li, Yonghai, Yu, Jinran, Wei, Yichen, Wang, Yifei, Cheng, Liuqi, Feng, Zhenyu, Yang, Ya, Wang, Zhong Lin, and Sun, Qijun
- Published
- 2023
- Full Text
- View/download PDF
15. Mechano-driven logic-in-memory with neuromorphic triboelectric charge-trapping transistor
- Author
-
Wei, Yichen, Yu, Jinran, Li, Yonghai, Wang, Yifei, Huo, Ziwei, Cheng, Liuqi, Yue, Dewu, Zhang, Keteng, Gong, Jie, Wang, Jie, Wang, Zhong Lin, and Sun, Qijun
- Published
- 2024
- Full Text
- View/download PDF
16. Towards multi-views cloud retrieval accounting for the 3-D structure collected by directional polarization camera
- Author
-
Yu, Haixiao, Sun, Xiaobing, Tu, Bihai, Ti, Rufang, Ma, Jinji, Hong, Jin, Chen, Cheng, Liu, Xiao, Huang, Honglian, Wang, Zeling, Ahmad, Safura, Wang, Yi, Fan, Yizhe, Li, Yiqi, Wei, Yichen, Wang, Yuxuan, and Wang, Yuyao
- Published
- 2024
- Full Text
- View/download PDF
17. A neurosymbolic cognitive architecture framework for handling novelties in open worlds
- Author
-
Goel, Shivam, Lymperopoulos, Panagiotis, Thielstrom, Ravenna, Krause, Evan, Feeney, Patrick, Lorang, Pierrick, Schneider, Sarah, Wei, Yichen, Kildebeck, Eric, Goss, Stephen, Hughes, Michael C., Liu, Liping, Sinapov, Jivko, and Scheutz, Matthias
- Published
- 2024
- Full Text
- View/download PDF
18. SOLQ: Segmenting Objects by Learning Queries
- Author
-
Dong, Bin, Zeng, Fangao, Wang, Tiancai, Zhang, Xiangyu, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In this paper, we propose an end-to-end framework for instance segmentation. Based on the recently introduced DETR [1], our method, termed SOLQ, segments objects by learning unified queries. In SOLQ, each query represents one object and has multiple representations: class, location and mask. The object queries learned perform classification, box regression and mask encoding simultaneously in an unified vector form. During training phase, the mask vectors encoded are supervised by the compression coding of raw spatial masks. In inference time, mask vectors produced can be directly transformed to spatial masks by the inverse process of compression coding. Experimental results show that SOLQ can achieve state-of-the-art performance, surpassing most of existing approaches. Moreover, the joint learning of unified query representation can greatly improve the detection performance of DETR. We hope our SOLQ can serve as a strong baseline for the Transformer-based instance segmentation. Code is available at https://github.com/megvii-research/SOLQ., Comment: Accepted by NeurlPS 2021. Code is available at https://github.com/megvii-research/SOLQ
- Published
- 2021
19. MOTR: End-to-End Multiple-Object Tracking with Transformer
- Author
-
Zeng, Fangao, Dong, Bin, Zhang, Yuang, Wang, Tiancai, Zhang, Xiangyu, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Temporal modeling of objects is a key challenge in multiple object tracking (MOT). Existing methods track by associating detections through motion-based and appearance-based similarity heuristics. The post-processing nature of association prevents end-to-end exploitation of temporal variations in video sequence. In this paper, we propose MOTR, which extends DETR and introduces track query to model the tracked instances in the entire video. Track query is transferred and updated frame-by-frame to perform iterative prediction over time. We propose tracklet-aware label assignment to train track queries and newborn object queries. We further propose temporal aggregation network and collective average loss to enhance temporal relation modeling. Experimental results on DanceTrack show that MOTR significantly outperforms state-of-the-art method, ByteTrack by 6.5% on HOTA metric. On MOT17, MOTR outperforms our concurrent works, TrackFormer and TransTrack, on association performance. MOTR can serve as a stronger baseline for future research on temporal modeling and Transformer-based trackers. Code is available at https://github.com/megvii-research/MOTR., Comment: Accepted by ECCV 2022. Code is available at https://github.com/megvii-research/MOTR
- Published
- 2021
20. Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales
- Author
-
Sun, Yifan, Zhu, Yuke, Zhang, Yuhan, Zheng, Pengkun, Qiu, Xi, Zhang, Chi, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper introduces a new fundamental characteristic, \ie, the dynamic range, from real-world metric tools to deep visual recognition. In metrology, the dynamic range is a basic quality of a metric tool, indicating its flexibility to accommodate various scales. Larger dynamic range offers higher flexibility. In visual recognition, the multiple scale problem also exist. Different visual concepts may have different semantic scales. For example, ``Animal'' and ``Plants'' have a large semantic scale while ``Elk'' has a much smaller one. Under a small semantic scale, two different elks may look quite \emph{different} to each other . However, under a large semantic scale (\eg, animals and plants), these two elks should be measured as being \emph{similar}. %We argue that such flexibility is also important for deep metric learning, because different visual concepts indeed correspond to different semantic scales. Introducing the dynamic range to deep metric learning, we get a novel computer vision task, \ie, the Dynamic Metric Learning. It aims to learn a scalable metric space to accommodate visual concepts across multiple semantic scales. Based on three types of images, \emph{i.e.}, vehicle, animal and online products, we construct three datasets for Dynamic Metric Learning. We benchmark these datasets with popular deep metric learning methods and find Dynamic Metric Learning to be very challenging. The major difficulty lies in a conflict between different scales: the discriminative ability under a small scale usually compromises the discriminative ability under a large one, and vice versa. As a minor contribution, we propose Cross-Scale Learning (CSL) to alleviate such conflict. We show that CSL consistently improves the baseline on all the three datasets. The datasets and the code will be publicly available at https://github.com/SupetZYK/DynamicMetricLearning., Comment: 8pages, accepted by CVPR 2021
- Published
- 2021
21. End-to-End Human Object Interaction Detection with HOI Transformer
- Author
-
Zou, Cheng, Wang, Bohan, Hu, Yue, Liu, Junqi, Wu, Qian, Zhao, Yu, Li, Boxun, Zhang, Chenguang, Zhang, Chi, Wei, Yichen, and Sun, Jian
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner. Current approaches either decouple HOI task into separated stages of object detection and interaction classification or introduce surrogate interaction problem. In contrast, our method, named HOI Transformer, streamlines the HOI pipeline by eliminating the need for many hand-designed components. HOI Transformer reasons about the relations of objects and humans from global image context and directly predicts HOI instances in parallel. A quintuple matching loss is introduced to force HOI predictions in a unified way. Our method is conceptually much simpler and demonstrates improved accuracy. Without bells and whistles, HOI Transformer achieves $26.61\% $ $ AP $ on HICO-DET and $52.9\%$ $AP_{role}$ on V-COCO, surpassing previous methods with the advantage of being much simpler. We hope our approach will serve as a simple and effective alternative for HOI tasks. Code is available at https://github.com/bbepoch/HoiTransformer ., Comment: Accepted to CVPR2021
- Published
- 2021
22. Measuring congestion-induced performance imbalance in Internet load balancing at scale
- Author
-
Pi, Yibo, Jamin, Sugih, and Wei, Yichen
- Published
- 2024
- Full Text
- View/download PDF
23. Alteration of white matter microstructure in patients with sleep disorders after COVID-19 infection
- Author
-
Qin, Haixia, Duan, Gaoxiong, Zhou, Kaixuan, Qin, Lixia, Lai, Yinqi, Liu, Ying, Lu, Yian, Peng, Bei, Zhang, Yan, Zhou, Xiaoyan, Huang, Jiazhu, Huang, Jinli, Liang, Lingyan, Wei, Yichen, Zhang, Qingping, Li, Xiaocheng, OuYang, Yinfei, Bin, Bolin, Zhao, Mingming, Yang, Jianrong, and Deng, Demao
- Published
- 2024
- Full Text
- View/download PDF
24. Spherical Feature Transform for Deep Metric Learning
- Author
-
Zhu, Yuke, Bai, Yan, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Data augmentation in feature space is effective to increase data diversity. Previous methods assume that different classes have the same covariance in their feature distributions. Thus, feature transform between different classes is performed via translation. However, this approach is no longer valid for recent deep metric learning scenarios, where feature normalization is widely adopted and all features lie on a hypersphere. This work proposes a novel spherical feature transform approach. It relaxes the assumption of identical covariance between classes to an assumption of similar covariances of different classes on a hypersphere. Consequently, the feature transform is performed by a rotation that respects the spherical data distributions. We provide a simple and effective training method, and in depth analysis on the relation between the two different transforms. Comprehensive experiments on various deep metric learning benchmarks and different baselines verify that our method achieves consistent performance improvement and state-of-the-art results., Comment: ECCV2020
- Published
- 2020
25. Prime-Aware Adaptive Distillation
- Author
-
Zhang, Youcai, Lan, Zhonghao, Dai, Yuchen, Zeng, Fangao, Bai, Yan, Chang, Jie, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Knowledge distillation(KD) aims to improve the performance of a student network by mimicing the knowledge from a powerful teacher network. Existing methods focus on studying what knowledge should be transferred and treat all samples equally during training. This paper introduces the adaptive sample weighting to KD. We discover that previous effective hard mining methods are not appropriate for distillation. Furthermore, we propose Prime-Aware Adaptive Distillation (PAD) by the incorporation of uncertainty learning. PAD perceives the prime samples in distillation and then emphasizes their effect adaptively. PAD is fundamentally different from and would refine existing methods with the innovative view of unequal training. For this reason, PAD is versatile and has been applied in various tasks including classification, metric learning, and object detection. With ten teacher-student combinations on six datasets, PAD promotes the performance of existing distillation methods and outperforms recent state-of-the-art methods., Comment: Accepted by ECCV 2020
- Published
- 2020
26. Joint Multi-Dimension Pruning via Numerical Gradient Update
- Author
-
Liu, Zechun, Zhang, Xiangyu, Shen, Zhiqiang, Li, Zhe, Wei, Yichen, Cheng, Kwang-Ting, and Sun, Jian
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
We present joint multi-dimension pruning (abbreviated as JointPruning), an effective method of pruning a network on three crucial aspects: spatial, depth and channel simultaneously. To tackle these three naturally different dimensions, we proposed a general framework by defining pruning as seeking the best pruning vector (i.e., the numerical value of layer-wise channel number, spacial size, depth) and construct a unique mapping from the pruning vector to the pruned network structures. Then we optimize the pruning vector with gradient update and model joint pruning as a numerical gradient optimization process. To overcome the challenge that there is no explicit function between the loss and the pruning vectors, we proposed self-adapted stochastic gradient estimation to construct a gradient path through network loss to pruning vectors and enable efficient gradient update. We show that the joint strategy discovers a better status than previous studies that focused on individual dimensions solely, as our method is optimized collaboratively across the three dimensions in a single end-to-end training and it is more efficient than the previous exhaustive methods. Extensive experiments on large-scale ImageNet dataset across a variety of network architectures MobileNet V1&V2&V3 and ResNet demonstrate the effectiveness of our proposed method. For instance, we achieve significant margins of 2.5% and 2.6% improvement over the state-of-the-art approach on the already compact MobileNet V1&V2 under an extremely large compression ratio., Comment: Accepted to IEEE Transactions on Image Processing (TIP) 2021
- Published
- 2020
- Full Text
- View/download PDF
27. Angle-based Search Space Shrinking for Neural Architecture Search
- Author
-
Hu, Yiming, Liang, Yuding, Guo, Zichao, Wan, Ruosi, Zhang, Xiangyu, Wei, Yichen, Gu, Qingyi, and Sun, Jian
- Subjects
Computer Science - Neural and Evolutionary Computing ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
In this work, we present a simple and general search space shrinking method, called Angle-Based search space Shrinking (ABS), for Neural Architecture Search (NAS). Our approach progressively simplifies the original search space by dropping unpromising candidates, thus can reduce difficulties for existing NAS methods to find superior architectures. In particular, we propose an angle-based metric to guide the shrinking process. We provide comprehensive evidences showing that, in weight-sharing supernet, the proposed metric is more stable and accurate than accuracy-based and magnitude-based metrics to predict the capability of child models. We also show that the angle-based metric can converge fast while training supernet, enabling us to get promising shrunk search spaces efficiently. ABS can easily apply to most of NAS approaches (e.g. SPOS, FairNAS, ProxylessNAS, DARTS and PDARTS). Comprehensive experiments show that ABS can dramatically enhance existing NAS approaches by providing a promising shrunk search space., Comment: Accepted in ECCV 2020
- Published
- 2020
28. Data Uncertainty Learning in Face Recognition
- Author
-
Chang, Jie, Lan, Zhonghao, Cheng, Changmao, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Modeling data uncertainty is important for noisy images, but seldom explored for face recognition. The pioneer work, PFE, considers uncertainty by modeling each face image embedding as a Gaussian distribution. It is quite effective. However, it uses fixed feature (mean of the Gaussian) from an existing model. It only estimates the variance and relies on an ad-hoc and costly metric. Thus, it is not easy to use. It is unclear how uncertainty affects feature learning. This work applies data uncertainty learning to face recognition, such that the feature (mean) and uncertainty (variance) are learnt simultaneously, for the first time. Two learning methods are proposed. They are easy to use and outperform existing deterministic methods as well as PFE on challenging unconstrained scenarios. We also provide insightful analysis on how incorporating uncertainty estimation helps reducing the adverse effects of noisy samples and affects the feature learning., Comment: Accepted as poster by CVPR2020
- Published
- 2020
29. Balanced Alignment for Face Recognition: A Joint Learning Approach
- Author
-
Wei, Huawei, Lu, Peng, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Face alignment is crucial for face recognition and has been widely adopted. However, current practice is too simple and under-explored. There lacks an understanding of how important face alignment is and how it should be performed, for recognition. This work studies these problems and makes two contributions. First, it provides an in-depth and quantitative study of how alignment strength affects recognition accuracy. Our results show that excessive alignment is harmful and an optimal balanced point of alignment is in need. To strike the balance, our second contribution is a novel joint learning approach where alignment learning is controllable with respect to its strength and driven by recognition. Our proposed method is validated by comprehensive experiments on several benchmarks, especially the challenging ones with large pose., Comment: 17 pages, 9 figures
- Published
- 2020
30. Circle Loss: A Unified Perspective of Pair Similarity Optimization
- Author
-
Sun, Yifan, Cheng, Changmao, Zhang, Yuhan, Zhang, Chi, Zheng, Liang, Wang, Zhongdao, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper provides a pair similarity optimization viewpoint on deep feature learning, aiming to maximize the within-class similarity $s_p$ and minimize the between-class similarity $s_n$. We find a majority of loss functions, including the triplet loss and the softmax plus cross-entropy loss, embed $s_n$ and $s_p$ into similarity pairs and seek to reduce $(s_n-s_p)$. Such an optimization manner is inflexible, because the penalty strength on every single similarity score is restricted to be equal. Our intuition is that if a similarity score deviates far from the optimum, it should be emphasized. To this end, we simply re-weight each similarity to highlight the less-optimized similarity scores. It results in a Circle loss, which is named due to its circular decision boundary. The Circle loss has a unified formula for two elemental deep feature learning approaches, i.e. learning with class-level labels and pair-wise labels. Analytically, we show that the Circle loss offers a more flexible optimization approach towards a more definite convergence target, compared with the loss functions optimizing $(s_n-s_p)$. Experimentally, we demonstrate the superiority of the Circle loss on a variety of deep feature learning tasks. On face recognition, person re-identification, as well as several fine-grained image retrieval datasets, the achieved performance is on par with the state of the art.
- Published
- 2020
31. Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization
- Author
-
Yan, Junjie, Wan, Ruosi, Zhang, Xiangyu, Zhang, Wei, Wei, Yichen, and Sun, Jian
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Batch Normalization (BN) is one of the most widely used techniques in Deep Learning field. But its performance can awfully degrade with insufficient batch size. This weakness limits the usage of BN on many computer vision tasks like detection or segmentation, where batch size is usually small due to the constraint of memory consumption. Therefore many modified normalization techniques have been proposed, which either fail to restore the performance of BN completely, or have to introduce additional nonlinear operations in inference procedure and increase huge consumption. In this paper, we reveal that there are two extra batch statistics involved in backward propagation of BN, on which has never been well discussed before. The extra batch statistics associated with gradients also can severely affect the training of deep neural network. Based on our analysis, we propose a novel normalization method, named Moving Average Batch Normalization (MABN). MABN can completely restore the performance of vanilla BN in small batch cases, without introducing any additional nonlinear operations in inference procedure. We prove the benefits of MABN by both theoretical analysis and experiments. Our experiments demonstrate the effectiveness of MABN in multiple computer vision tasks including ImageNet and COCO. The code has been released in https://github.com/megvii-model/MABN., Comment: ICLR2020; https://github.com/megvii-model/MABN
- Published
- 2020
32. Hier-SFL: Client-edge-cloud collaborative traffic classification framework based on hierarchical federated split learning
- Author
-
Qin, Tian, Cheng, Guang, Wei, Yichen, and Yao, Zifan
- Published
- 2023
- Full Text
- View/download PDF
33. Potential use of nano calcium carbonate in polypropylene fiber reinforced recycled aggregate concrete: Microstructures and properties evaluation
- Author
-
Chen, Xuyong, Ai, Yangzheng, Wu, Qiaoyun, Cheng, Shukai, Wei, Yichen, Xu, Xiong, and Fan, Tao
- Published
- 2023
- Full Text
- View/download PDF
34. Vehicle Re-identification with Viewpoint-aware Metric Learning
- Author
-
Chu, Ruihang, Sun, Yifan, Li, Yadong, Liu, Zheng, Zhang, Chi, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper considers vehicle re-identification (re-ID) problem. The extreme viewpoint variation (up to 180 degrees) poses great challenges for existing approaches. Inspired by the behavior in human's recognition process, we propose a novel viewpoint-aware metric learning approach. It learns two metrics for similar viewpoints and different viewpoints in two feature spaces, respectively, giving rise to viewpoint-aware network (VANet). During training, two types of constraints are applied jointly. During inference, viewpoint is firstly estimated and the corresponding metric is used. Experimental results confirm that VANet significantly improves re-ID accuracy, especially when the pair is observed from different viewpoints. Our method establishes the new state-of-the-art on two benchmarks., Comment: Accepted by ICCV 2019
- Published
- 2019
35. 3D Dense Face Alignment via Graph Convolution Networks
- Author
-
Wei, Huawei, Liang, Shuang, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recently, 3D face reconstruction and face alignment tasks are gradually combined into one task: 3D dense face alignment. Its goal is to reconstruct the 3D geometric structure of face with pose information. In this paper, we propose a graph convolution network to regress 3D face coordinates. Our method directly performs feature learning on the 3D face mesh, where the geometric structure and details are well preserved. Extensive experiments show that our approach gains superior performance over state-of-the-art methods on several challenging datasets.
- Published
- 2019
36. Re-Identification Supervised Texture Generation
- Author
-
Wang, Jian, Zhong, Yunshan, Li, Yachun, Zhang, Chi, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The estimation of 3D human body pose and shape from a single image has been extensively studied in recent years. However, the texture generation problem has not been fully discussed. In this paper, we propose an end-to-end learning strategy to generate textures of human bodies under the supervision of person re-identification. We render the synthetic images with textures extracted from the inputs and maximize the similarity between the rendered and input images by using the re-identification network as the perceptual metrics. Experiment results on pedestrian images show that our model can generate the texture from a single image and demonstrate that our textures are of higher quality than those generated by other available methods. Furthermore, we extend the application scope to other categories and explore the possible utilization of our generated textures., Comment: Accepted by CVPR2019
- Published
- 2019
37. Single Path One-Shot Neural Architecture Search with Uniform Sampling
- Author
-
Guo, Zichao, Zhang, Xiangyu, Mu, Haoyuan, Heng, Wen, Liu, Zechun, Wei, Yichen, and Sun, Jian
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We revisit the one-shot Neural Architecture Search (NAS) paradigm and analyze its advantages over existing NAS approaches. Existing one-shot method, however, is hard to train and not yet effective on large scale datasets like ImageNet. This work propose a Single Path One-Shot model to address the challenge in the training. Our central idea is to construct a simplified supernet, where all architectures are single paths so that weight co-adaption problem is alleviated. Training is performed by uniform path sampling. All architectures (and their weights) are trained fully and equally. Comprehensive experiments verify that our approach is flexible and effective. It is easy to train and fast to search. It effortlessly supports complex search spaces (e.g., building blocks, channel, mixed-precision quantization) and different search constraints (e.g., FLOPs, latency). It is thus convenient to use for various needs. It achieves start-of-the-art performance on the large dataset ImageNet., Comment: ECCV 2020
- Published
- 2019
38. Rethinking on Multi-Stage Networks for Human Pose Estimation
- Author
-
Li, Wenbo, Wang, Zhicheng, Yin, Binyi, Peng, Qixiang, Du, Yuming, Xiao, Tianzi, Yu, Gang, Lu, Hongtao, Wei, Yichen, and Sun, Jian
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Existing pose estimation approaches fall into two categories: single-stage and multi-stage methods. While multi-stage methods are seemingly more suited for the task, their performance in current practice is not as good as single-stage methods. This work studies this issue. We argue that the current multi-stage methods' unsatisfactory performance comes from the insufficiency in various design choices. We propose several improvements, including the single-stage module design, cross stage feature aggregation, and coarse-to-fine supervision. The resulting method establishes the new state-of-the-art on both MS COCO and MPII Human Pose dataset, justifying the effectiveness of a multi-stage architecture. The source code is publicly available for further research.
- Published
- 2019
39. MOTR: End-to-End Multiple-Object Tracking with Transformer
- Author
-
Zeng, Fangao, Dong, Bin, Zhang, Yuang, Wang, Tiancai, Zhang, Xiangyu, Wei, Yichen, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Avidan, Shai, editor, Brostow, Gabriel, editor, Cissé, Moustapha, editor, Farinella, Giovanni Maria, editor, and Hassner, Tal, editor
- Published
- 2022
- Full Text
- View/download PDF
40. Simple Baselines for Human Pose Estimation and Tracking
- Author
-
Xiao, Bin, Wu, Haiping, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
There has been significant progress on pose estimation and increasing interests on pose tracking in recent years. At the same time, the overall algorithm and system complexity increases as well, making the algorithm analysis and comparison more difficult. This work provides simple and effective baseline methods. They are helpful for inspiring and evaluating new ideas for the field. State-of-the-art results are achieved on challenging benchmarks. The code will be available at https://github.com/leoxiaobin/pose.pytorch., Comment: Accepted by ECCV 2018
- Published
- 2018
41. Towards High Performance Video Object Detection for Mobiles
- Author
-
Zhu, Xizhou, Dai, Jifeng, Zhu, Xingchi, Wei, Yichen, and Yuan, Lu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Despite the recent success of video object detection on Desktop GPUs, its architecture is still far too heavy for mobiles. It is also unclear whether the key principles of sparse feature propagation and multi-frame feature aggregation apply at very limited computational resources. In this paper, we present a light weight network architecture for video object detection on mobiles. Light weight image object detector is applied on sparse key frames. A very small network, Light Flow, is designed for establishing correspondence across frames. A flow-guided GRU module is designed to effectively aggregate features on key frames. For non-key frames, sparse feature propagation is performed. The whole network can be trained end-to-end. The proposed system achieves 60.2% mAP score at speed of 25.6 fps on mobiles (e.g., HuaWei Mate 8).
- Published
- 2018
42. Learning Region Features for Object Detection
- Author
-
Gu, Jiayuan, Hu, Han, Wang, Liwei, Wei, Yichen, and Dai, Jifeng
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
While most steps in the modern object detection methods are learnable, the region feature extraction step remains largely hand-crafted, featured by RoI pooling methods. This work proposes a general viewpoint that unifies existing region feature extraction methods and a novel method that is end-to-end learnable. The proposed method removes most heuristic choices and outperforms its RoI pooling counterparts. It moves further towards fully learnable object detection.
- Published
- 2018
43. Pseudo Mask Augmented Object Detection
- Author
-
Zhao, Xiangyun, Liang, Shuang, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In this work, we present a novel and effective framework to facilitate object detection with the instance-level segmentation information that is only supervised by bounding box annotation. Starting from the joint object detection and instance segmentation network, we propose to recursively estimate the pseudo ground-truth object masks from the instance-level object segmentation network training, and then enhance the detection network with top-down segmentation feedbacks. The pseudo ground truth mask and network parameters are optimized alternatively to mutually benefit each other. To obtain the promising pseudo masks in each iteration, we embed a graphical inference that incorporates the low-level image appearance consistency and the bounding box annotations to refine the segmentation masks predicted by the segmentation network. Our approach progressively improves the object detection performance by incorporating the detailed pixel-wise information learned from the weakly-supervised segmentation network. Extensive evaluation on the detection task in PASCAL VOC 2007 and 2012 [12] verifies that the proposed approach is effective.
- Published
- 2018
44. Relation Networks for Object Detection
- Author
-
Hu, Han, Gu, Jiayuan, Zhang, Zheng, Dai, Jifeng, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era. All state-of-the-art object detection systems still rely on recognizing object instances individually, without exploiting their relations during learning. This work proposes an object relation module. It processes a set of objects simultaneously through interaction between their appearance feature and geometry, thus allowing modeling of their relations. It is lightweight and in-place. It does not require additional supervision and is easy to embed in existing networks. It is shown effective on improving object recognition and duplicate removal steps in the modern object detection pipeline. It verifies the efficacy of modeling object relations in CNN based detection. It gives rise to the first fully end-to-end object detector.
- Published
- 2017
45. Towards High Performance Video Object Detection
- Author
-
Zhu, Xizhou, Dai, Jifeng, Yuan, Lu, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
There has been significant progresses for image object detection in recent years. Nevertheless, video object detection has received little attention, although it is more challenging and more important in practical scenarios. Built upon the recent works, this work proposes a unified approach based on the principle of multi-frame end-to-end learning of features and cross-frame motion. Our approach extends prior works with three new techniques and steadily pushes forward the performance envelope (speed-accuracy tradeoff), towards high performance video object detection.
- Published
- 2017
46. Integral Human Pose Regression
- Author
-
Sun, Xiao, Xiao, Bin, Wei, Fangyin, Liang, Shuang, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
State-of-the-art human pose estimation methods are based on heat map representation. In spite of the good performance, the representation has a few issues in nature, such as not differentiable and quantization error. This work shows that a simple integral operation relates and unifies the heat map representation and joint regression, thus avoiding the above issues. It is differentiable, efficient, and compatible with any heat map based methods. Its effectiveness is convincingly validated via comprehensive ablation experiments under various settings, specifically on 3D pose estimation, for the first time.
- Published
- 2017
47. Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
- Author
-
Zhou, Xingyi, Huang, Qixing, Sun, Xiao, Xue, Xiangyang, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In this paper, we study the task of 3D human pose estimation in the wild. This task is challenging due to lack of training data, as existing datasets are either in the wild images with 2D pose or in the lab images with 3D pose. We propose a weakly-supervised transfer learning method that uses mixed 2D and 3D labels in a unified deep neutral network that presents two-stage cascaded structure. Our network augments a state-of-the-art 2D pose estimation sub-network with a 3D depth regression sub-network. Unlike previous two stage approaches that train the two sub-networks sequentially and separately, our training is end-to-end and fully exploits the correlation between the 2D pose and depth estimation sub-tasks. The deep features are better learnt through shared representations. In doing so, the 3D pose labels in controlled lab environments are transferred to in the wild images. In addition, we introduce a 3D geometric constraint to regularize the 3D pose prediction, which is effective in the absence of ground truth depth labels. Our method achieves competitive results on both 2D and 3D benchmarks., Comment: Accepted to ICCV 2017
- Published
- 2017
48. Compositional Human Pose Regression
- Author
-
Sun, Xiao, Shang, Jiaxiang, Liang, Shuang, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Regression based methods are not performing as well as detection based methods for human pose estimation. A central problem is that the structural information in the pose is not well exploited in the previous regression methods. In this work, we propose a structure-aware regression approach. It adopts a reparameterized pose representation using bones instead of joints. It exploits the joint connection structure to define a compositional loss function that encodes the long range interactions in the pose. It is simple, effective, and general for both 2D and 3D pose estimation in a unified setting. Comprehensive evaluation validates the effectiveness of our approach. It significantly advances the state-of-the-art on Human3.6M and is competitive with state-of-the-art results on MPII., Comment: Accepted by International Conference on Computer Vision (ICCV) 2017
- Published
- 2017
49. Flow-Guided Feature Aggregation for Video Object Detection
- Author
-
Zhu, Xizhou, Wang, Yujie, Dai, Jifeng, Yuan, Lu, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Extending state-of-the-art object detectors from image to video is challenging. The accuracy of detection suffers from degenerated object appearances in videos, e.g., motion blur, video defocus, rare poses, etc. Existing work attempts to exploit temporal information on box level, but such methods are not trained end-to-end. We present flow-guided feature aggregation, an accurate and end-to-end learning framework for video object detection. It leverages temporal coherence on feature level instead. It improves the per-frame features by aggregation of nearby features along the motion paths, and thus improves the video recognition accuracy. Our method significantly improves upon strong single-frame baselines in ImageNet VID, especially for more challenging fast moving objects. Our framework is principled, and on par with the best engineered systems winning the ImageNet VID challenges 2016, without additional bells-and-whistles. The proposed method, together with Deep Feature Flow, powered the winning entry of ImageNet VID challenges 2017. The code is available at https://github.com/msracver/Flow-Guided-Feature-Aggregation.
- Published
- 2017
50. Deformable Convolutional Networks
- Author
-
Dai, Jifeng, Qi, Haozhi, Xiong, Yuwen, Li, Yi, Zhang, Guodong, Hu, Han, and Wei, Yichen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in its building modules. In this work, we introduce two new modules to enhance the transformation modeling capacity of CNNs, namely, deformable convolution and deformable RoI pooling. Both are based on the idea of augmenting the spatial sampling locations in the modules with additional offsets and learning the offsets from target tasks, without additional supervision. The new modules can readily replace their plain counterparts in existing CNNs and can be easily trained end-to-end by standard back-propagation, giving rise to deformable convolutional networks. Extensive experiments validate the effectiveness of our approach on sophisticated vision tasks of object detection and semantic segmentation. The code would be released.
- Published
- 2017
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.