Author: "Fang, Xin" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

1. The USTC-NERCSLIP Systems for The ICMC-ASR Challenge

Author: Wu, Minghui, Xu, Luzhen, Zhang, Jie, Tang, Haitao, Yue, Yanyan, Liao, Ruizhi, Zhao, Jintao, Zhang, Zhengzhe, Wang, Yichi, Yan, Haoyin, Yu, Hongliang, Ma, Tongle, Liu, Jiachen, Wu, Chongliang, Li, Yongchao, Zhang, Yanyong, Fang, Xin, and Zhang, Yue
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: This report describes the submitted system to the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) challenge, which considers the ASR task with multi-speaker overlapping and Mandarin accent dynamics in the ICMC case. We implement the front-end speaker diarization using the self-supervised learning representation based multi-speaker embedding and beamforming using the speaker position, respectively. For ASR, we employ an iterative pseudo-label generation method based on fusion model to obtain text labels of unsupervised data. To mitigate the impact of accent, an Accent-ASR framework is proposed, which captures pronunciation-related accent features at a fine-grained level and linguistic information at a coarse-grained level. On the ICMC-ASR eval set, the proposed system achieves a CER of 13.16% on track 1 and a cpCER of 21.48% on track 2, which significantly outperforms the official baseline system and obtains the first rank on both tracks., Comment: Accepted at ICASSP 2024
Published: 2024

2. Equity-aware Load Shedding Optimization

Author: Fang, Xin, Wang, Wenbo, and Ding, Fei
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Load shedding is usually the last resort to balance generation and demand to maintain stable operation of the electric grid after major disturbances. Current load-shedding optimization practices focus mainly on the physical optimality of the network power flow. This might lead to an uneven allocation of load curtailment, disadvantaging some loads more than others. Addressing this oversight, this paper introduces an innovative equity-aware load-shedding optimization model that emphasizes a fair allocation of load curtailment across the network. By proposing a novel equity indicator for load shedding and integrating it into an ACOPF-based optimization framework, we offer grid operators a more balanced and equitable load shedding strategy. Case studies highlight the importance of equity considerations in determining optimal load curtailment between buses., Comment: Contact email for corresponding and first author: allen.fangxin@gmail.com
Published: 2024

3. Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios

Author: Jiang, Ya, Wang, Qing, Du, Jun, Hu, Maocheng, Hu, Pengfei, Liu, Zeyan, Cheng, Shi, Nian, Zhaoxu, Dong, Yuxuan, Cai, Mingqi, Fang, Xin, and Lee, Chin-Hui
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Signal Processing
Abstract: This study presents an audio-visual information fusion approach to sound event localization and detection (SELD) in low-resource scenarios. We aim at utilizing audio and video modality information through cross-modal learning and multi-modal fusion. First, we propose a cross-modal teacher-student learning (TSL) framework to transfer information from an audio-only teacher model, trained on a rich collection of audio data with multiple data augmentation techniques, to an audio-visual student model trained with only a limited set of multi-modal data. Next, we propose a two-stage audio-visual fusion strategy, consisting of an early feature fusion and a late video-guided decision fusion to exploit synergies between audio and video modalities. Finally, we introduce an innovative video pixel swapping (VPS) technique to extend an audio channel swapping (ACS) method to an audio-visual joint augmentation. Evaluation results on the Detection and Classification of Acoustic Scenes and Events (DCASE) 2023 Challenge data set demonstrate significant improvements in SELD performances. Furthermore, our submission to the SELD task of the DCASE 2023 Challenge ranks first place by effectively integrating the proposed techniques into a model ensemble., Comment: accepted by icme2024
Published: 2024

4. Linear degenerate symplectic flag varieties: symmetric degenerations and PBW locus

Author: Boos, Magdalena, Irelli, Giovanni Cerulli, Fang, Xin, and Fourier, Ghislain
Subjects: Mathematics - Representation Theory, Mathematics - Rings and Algebras, 16Gxx, 14L35, 17B45
Abstract: We conceptualize in the paper the linear degenerate symplectic flag varieties as symmetric degenerations within the framework of type $A$ equioriented quivers. First, in the larger context of symmetric degenerations, we give a self-contained proof of the equivalence of different degeneration orders. Furthermore, we investigate the PBW locus: geometric properties of the degenerate varieties in this locus are proved by realizing them from different perspectives., Comment: 23 pages
Published: 2024

5. Dynkin abelianisations of flag varieties

Author: Enugandla, Shreepranav Varma, Fang, Xin, Fourier, Ghislain, and Steinert, Christian
Subjects: Mathematics - Representation Theory, Mathematics - Algebraic Geometry, Mathematics - Rings and Algebras, 17B10, 14D06, 14M15
Abstract: Cerulli Irelli and Lanini have shown that PBW degenerations of flag varieties in type A and C are actually Schubert varieties of higher rank. We introduce Dynkin cones to parameterise specific abelianisations of classical Lie algebras. Within this framework, we generalise their result to all degenerations of flag varieties defined by degree vectors originating from a Dynkin cone. This framework allows us to determine the extent to which a flag variety can be degenerate while still naturally being a Schubert variety of the same Lie type. Furthermore, we compute the defining relations for the corresponding degenerate simple modules in all classical types., Comment: 27 pages, this is a preliminary version
Published: 2024

6. Computing monomial bases in Lie theory using OSCAR

Author: Fang, Xin, Fourier, Ghislain, Göttgens, Lars, and Wilop, Ben
Subjects: Mathematics - Representation Theory, Mathematics - Algebraic Geometry, 17B10, 14M25, 14D06, 14M15, 14-04, 97N80
Abstract: In this survey, we present a detailed guide on using the computer algebra system OSCAR to compute monomial bases for simple, finite-dimensional modules of simple, complex Lie algebras. We will also demonstrate how to determine monomial bases for the homogeneous coordinate ring of a (partial) flag variety, depending on a chosen birational sequence and a monomial order. This survey will be updated to reflect any advancements in OSCAR's capabilities in these areas., Comment: 16 pages, Submitted as a bookchapter for the upcoming Oscar book
Published: 2024

7. Multitask frame-level learning for few-shot sound event detection

Author: Zou, Liang, Yan, Genwei, Wang, Ruoyu, Du, Jun, Lei, Meng, Gao, Tian, and Fang, Xin
Subjects: Computer Science - Sound, Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples. However, prevailing methods methods in few-shot SED predominantly rely on segment-level predictions, which often providing detailed, fine-grained predictions, particularly for events of brief duration. Although frame-level prediction strategies have been proposed to overcome these limitations, these strategies commonly face difficulties with prediction truncation caused by background noise. To alleviate this issue, we introduces an innovative multitask frame-level SED framework. In addition, we introduce TimeFilterAug, a linear timing mask for data augmentation, to increase the model's robustness and adaptability to diverse acoustic environments. The proposed method achieves a F-score of 63.8%, securing the 1st rank in the few-shot bioacoustic event detection category of the Detection and Classification of Acoustic Scenes and Events Challenge 2023., Comment: 6 pages, 4 figures, conference
Published: 2024

8. Schubert valuations on Grassmann varieties

Author: Chirivì, Rocco, Fang, Xin, and Littelmann, Peter
Subjects: Mathematics - Algebraic Geometry, Mathematics - Combinatorics
Abstract: The goal of the paper is twofold: on one side it provides an order structure on the set of all maximal chains in the Bruhat poset of Schubert varieties in a Grassmann variety; on the other hand, using this order structure, it works out explicit formulae for the valuation and the Newton-Okounkov body associated to each maximal chain appearing in the framework of Seshadri stratification., Comment: 26 pages
Published: 2024

9. Stable Relay Learning Optimization Approach for Fast Power System Production Cost Minimization Simulation

Author: Guo, Zishan, Hu, Qinran, Qian, Tao, Fang, Xin, Hu, Renjie, and Wu, Zaijun
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Production cost minimization (PCM) simulation is commonly employed for assessing the operational efficiency, economic viability, and reliability, providing valuable insights for power system planning and operations. However, solving a PCM problem is time-consuming, consisting of numerous binary variables for simulation horizon extending over months and years. This hinders rapid assessment of modern energy systems with diverse planning requirements. Existing methods for accelerating PCM tend to sacrifice accuracy for speed. In this paper, we propose a stable relay learning optimization (s-RLO) approach within the Branch and Bound (B&B) algorithm. The proposed approach offers rapid and stable performance, and ensures optimal solutions. The two-stage s-RLO involves an imitation learning (IL) phase for accurate policy initialization and a reinforcement learning (RL) phase for time-efficient fine-tuning. When implemented on the popular SCIP solver, s-RLO returns the optimal solution up to 2 times faster than the default relpscost rule and 1.4 times faster than IL, or exhibits a smaller gap at the predefined time limit. The proposed approach shows stable performance, reducing fluctuations by approximately 50% compared with IL. The efficacy of the proposed s-RLO approach is supported by numerical results., Comment: Submitted to IEEE Transactions on Power Systems on December 15, 2023
Published: 2023

10. SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

Author: Luo, Xiangde, Fu, Jia, Zhong, Yunxin, Liu, Shuolin, Han, Bing, Astaraki, Mehdi, Bendazzoli, Simone, Toma-Dasu, Iuliana, Ye, Yiwen, Chen, Ziyang, Xia, Yong, Su, Yanzhou, Ye, Jin, He, Junjun, Xing, Zhaohu, Wang, Hongqiu, Zhu, Lei, Yang, Kaixiang, Fang, Xin, Wang, Zhiwei, Lee, Chan Woong, Park, Sang Joon, Chun, Jaehee, Ulrich, Constantin, Maier-Hein, Klaus H., Ndipenoch, Nchongmaje, Miron, Alina, Li, Yongmin, Zhang, Yimeng, Chen, Yu, Bai, Lu, Huang, Jinlong, An, Chengyang, Wang, Lisheng, Huang, Kaiwen, Gu, Yunqi, Zhou, Tao, Zhou, Mu, Zhang, Shichuan, Liao, Wenjun, Wang, Guotai, and Zhang, Shaoting
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results in many medical image segmentation tasks. However, for NPC OARs and GTVs segmentation, few public datasets are available for model development and evaluation. To alleviate this problem, the SegRap2023 challenge was organized in conjunction with MICCAI2023 and presented a large-scale benchmark for OAR and GTV segmentation with 400 Computed Tomography (CT) scans from 200 NPC patients, each with a pair of pre-aligned non-contrast and contrast-enhanced CT scans. The challenge's goal was to segment 45 OARs and 2 GTVs from the paired CT scans. In this paper, we detail the challenge and analyze the solutions of all participants. The average Dice similarity coefficient scores for all submissions ranged from 76.68\% to 86.70\%, and 70.42\% to 73.44\% for OARs and GTVs, respectively. We conclude that the segmentation of large-size OARs is well-addressed, and more efforts are needed for GTVs and small-size or thin-structure OARs. The benchmark will remain publicly available here: https://segrap2023.grand-challenge.org, Comment: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)
Published: 2023

11. Extriangulated ideal quotients, with applications to cluster theory and gentle algebras

Author: Fang, Xin, Gorsky, Mikhail, Palu, Yann, Plamondon, Pierre-Guy, and Pressland, Matthew
Subjects: Mathematics - Representation Theory
Abstract: We extend results of Br\"ustle-Yang on ideal quotients of 2-term subcategories of perfect derived categories of non-positive dg algebras to a relative setting. We find a new interpretation of such quotients: they appear as prototypical examples of a new construction of quotients of extriangulated categories by ideals generated by morphisms from injectives to projectives. We apply our results to Frobenius exact cluster categories and Higgs categories with suitable relative extriangulated structures, and to categories of walks related to gentle algebras. In all three cases, the extriangulated structures are well-behaved (they are 0-Auslander) and their quotients are equivalent to homotopy categories of two-term complexes of projectives over suitable finite-dimensional algebras., Comment: 47 pages, comments welcome. v2: corrected statements in Section 5.1, other minor corrections
Published: 2023

12. AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer

Author: Li, Kang, Song, Yan, Dai, Li-Rong, McLoughlin, Ian, Fang, Xin, and Liu, Lin
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: In this paper, we propose an effective sound event detection (SED) method based on the audio spectrogram transformer (AST) model, pretrained on the large-scale AudioSet for audio tagging (AT) task, termed AST-SED. Pretrained AST models have recently shown promise on DCASE2022 challenge task4 where they help mitigate a lack of sufficient real annotated data. However, mainly due to differences between the AT and SED tasks, it is suboptimal to directly utilize outputs from a pretrained AST model. Hence the proposed AST-SED adopts an encoder-decoder architecture to enable effective and efficient fine-tuning without needing to redesign or retrain the AST model. Specifically, the Frequency-wise Transformer Encoder (FTE) consists of transformers with self attention along the frequency axis to address multiple overlapped audio events issue in a single clip. The Local Gated Recurrent Units Decoder (LGD) consists of nearest-neighbor interpolation (NNI) and Bidirectional Gated Recurrent Units (Bi-GRU) to compensate for temporal resolution loss in the pretrained AST model output. Experimental results on DCASE2022 task4 development set have demonstrated the superiority of the proposed AST-SED with FTE-LGD architecture. Specifically, the Event-Based F1-score (EB-F1) of 59.60% and Polyphonic Sound detection Score scenario1 (PSDS1) score of 0.5140 significantly outperform CRNN and other pretrained AST-based systems., Comment: accepted to ICASSP 2023
Published: 2023

13. Tropical symplectic flag varieties: a Lie-theoretic approach

Author: Balla, George and Fang, Xin
Subjects: Mathematics - Algebraic Geometry, Mathematics - Representation Theory, 14M15 (Primary), 14T20 (Secondary)
Abstract: We study tropicalization of symplectic flag varieties with respect to the Pl\"ucker embedding. We identify a particular maximal prime cone in this tropicalization by explicitly giving its facets. For every interior point of this maximal cone, the corresponding Gr\"obner degeneration is the toric variety associated to the Feigin-Fourier-Littelmann-Vinberg (FFLV) polytope. Our main tool is the notion of birational sequences introduced by Fourier, Littelmann and the second author, which bridges between weighted PBW filtrations of representations of symplectic Lie algebras and degree functions on defining ideals of symplectic flag varieties., Comment: 31 pages
Published: 2023
Full Text: View/download PDF

14. Deep Virtual-to-Real Distillation for Pedestrian Crossing Prediction

Author: Bai, Jie, Fang, Xin, Fang, Jianwu, Xue, Jianru, and Yuan, Changwei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Pedestrian crossing is one of the most typical behavior which conflicts with natural driving behavior of vehicles. Consequently, pedestrian crossing prediction is one of the primary task that influences the vehicle planning for safe driving. However, current methods that rely on the practically collected data in real driving scenes cannot depict and cover all kinds of scene condition in real traffic world. To this end, we formulate a deep virtual to real distillation framework by introducing the synthetic data that can be generated conveniently, and borrow the abundant information of pedestrian movement in synthetic videos for the pedestrian crossing prediction in real data with a simple and lightweight implementation. In order to verify this framework, we construct a benchmark with 4667 virtual videos owning about 745k frames (called Virtual-PedCross-4667), and evaluate the proposed method on two challenging datasets collected in real driving situations, i.e., JAAD and PIE datasets. State-of-the-art performance of this framework is demonstrated by exhaustive experiment analysis. The dataset and code can be downloaded from the website \url{http://www.lotvs.net/code_data/}., Comment: Accepted by ITSC 2022
Published: 2022

15. A Unified Analytical Method to Quantify Three Types of Fast Frequency Response from Inverter-based Resources

Author: Dong, Shuan, Fang, Xin, Tan, Jin, Gao, Ningchao, Cui, Xiaofan, and Hoke, Anderson
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: With more inverter-based resources (IBRs), our power systems have lower frequency nadirs following N-1 contingencies, and undesired under-frequency load shedding (UFLS) can occur. To address this challenge, IBRs can be programmed to provide at least three types of fast frequency response (FFR), e.g., step response, proportional response (P/f droop response), and derivative response (synthetic inertia). However, these heterogeneous FFR challenge the study of power system frequency dynamics. Thus, this paper develops an analytical frequency nadir prediction method that allows for the consideration of all three potential forms of FFR provided by IBRs. The proposed method provides fast and accurate frequency nadir estimation after N-1 generation tripping contingencies. Our method is grounded on the closed-form solution for the frequency nadir, which is solved from the second-order system frequency response model considering the governor dynamics and three types of FFR. The simulation results in the IEEE 39-bus system with different types of FFR demonstrate that the proposed method provides an accurate and fast prediction of the frequency nadir under various disturbances.
Published: 2022

16. Seshadri stratifications and Schubert varieties: a geometric construction of a standard monomial theory

Author: Chirivì, Rocco, Fang, Xin, and Littelmann, Peter
Subjects: Mathematics - Algebraic Geometry, Mathematics - Combinatorics, Mathematics - Representation Theory
Abstract: A standard monomial theory for Schubert varieties is constructed exploiting (1) the geometry of the Seshadri stratifications of Schubert varieties by their Schubert subvarieties and (2) the combinatorial LS-path character formula for Demazure modules. The general theory of Seshadri stratifications is improved by using arbitrary linearization of the partial order and by weakening the definition of balanced stratification.
Published: 2022

17. On normal Seshadri stratifications

Author: Chirivì, Rocco, Fang, Xin, and Littelmann, Peter
Subjects: Mathematics - Algebraic Geometry, Mathematics - Commutative Algebra
Abstract: The existence of a Seshadri stratification on an embedded projective variety provides a flat degeneration of the variety to a union of projective toric varieties, called a semi-toric variety. Such a stratification is said to be normal when each irreducible component of the semi-toric variety is a normal toric variety. In this case, we show that a Gr\"obner basis of the defining ideal of the semi-toric variety can be lifted to define the embedded projective variety. Applications to Koszul and Gorenstein properties are discussed., Comment: 21 pages
Published: 2022

18. Specialization map for quiver Grassmannians

Author: Irelli, Giovanni Cerulli, Esposito, Francesco, Fang, Xin, and Fourier, Ghislain
Subjects: Mathematics - Representation Theory, Mathematics - Algebraic Geometry
Abstract: We define a specialization map between cohomology algebras of quiver Grassmannians of Dynkin type and we prove that it is surjective in type A. This generalizes a result of Lanini and Strickland., Comment: The new version include a new definition of the specialization map for quiver Grassmannians with respect to the one given in the first version. The new definition can be applied to a large class of quiver algebras
Published: 2022

19. DLMP of Competitive Markets in Active Distribution Networks: Models, Solutions, Applications, and Visions

Author: Wang, Xiaofei, Li, Fangxing, Bai, Linquan, and Fang, Xin
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Traditionally, the electric distribution system operates with uniform energy prices across all system nodes. However, as the adoption of distributed energy resources (DERs) propels a shift from passive to active distribution network (ADN) operation, a distribution-level electricity market has been proposed to manage new complexities efficiently. In addition, distribution locational marginal price (DLMP) has been established in the literature as the primary pricing mechanism. The DLMP inherits the LMP concept in the transmission-level wholesale market, but incorporates characteristics of the distribution system, such as high R/X ratios and power losses, system imbalance, and voltage regulation needs. The DLMP provides a solution that can be essential for competitive market operation in future distribution systems. This paper first provides an overview of the current distribution-level market architectures and their early implementations. Next, the general clearing model, model relaxations, and DLMP formulation are comprehensively reviewed. The state-of-the-art solution methods for distribution market clearing are summarized and categorized into centralized, distributed, and decentralized methods. Then, DLMP applications for the operation and planning of DERs and distribution system operators (DSOs) are discussed in detail. Finally, visions of future research directions and possible barriers and challenges are presented.
Published: 2022
Full Text: View/download PDF

20. Band degeneration and evolution in nonlinear triatomic chain superlattices

Author: Gong, Chen, Fang, Xin, and Cheng, Li
Subjects: Condensed Matter - Other Condensed Matter, Nonlinear Sciences - Adaptation and Self-Organizing Systems
Abstract: Nonlinear superlattices exhibit unique features allowing for wave manipulations. Despite the increasing attention received, the underlying physical mechanisms and the evolution process of the band structures and bandgaps in strongly nonlinear superlattices remain unclear. Here we establish and examine strongly nonlinear superlattice models (three triatomic models) to show the evolution process of typical nonlinear band structures based on analytical and numerical approaches. We find that the strongly nonlinear superlattices present particular band degeneration and bifurcation, accompanied with the vibration mode transfer in their unit cells. The evolution processes and the physical mechanisms of the band degeneration in different models are clarified with the consideration of the mode transfer. The observed degeneration may occur as the shifting, bifurcating, shortening, merging or disappearing of dispersion curves, all depending on the arrangement of the coupled nonlinear elements. Meanwhile, the dimension of the unit cell reduces, alongside changes in the frequency range and mechanisms (Bragg and local resonance) of the bandgaps. These findings answer some foundamental questions peritinent to the study of nonlinear periodic structures, nonlinear crystals and nonlinear metamaterials, which are of interest to the broad community of physics, Comment: 14 pages,11 figures
Published: 2022

21. A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition

Author: Du, Ye-Qian, Zhang, Jie, Zhu, Qiu-Shi, Dai, Li-Rong, Wu, Ming-Hui, Fang, Xin, and Yang, Zhou-Wang
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing, I.2.7
Abstract: Unpaired data has shown to be beneficial for low-resource automatic speech recognition~(ASR), which can be involved in the design of hybrid models with multi-task training or language model dependent pre-training. In this work, we leverage unpaired data to train a general sequence-to-sequence model. Unpaired speech and text are used in the form of data pairs by generating the corresponding missing parts in prior to model training. Inspired by the complementarity of speech-PseudoLabel pair and SynthesizedAudio-text pair in both acoustic features and linguistic features, we propose a complementary joint training~(CJT) method that trains a model alternatively with two data pairs. Furthermore, label masking for pseudo-labels and gradient restriction for synthesized audio are proposed to further cope with the deviations from real data, termed as CJT++. Experimental results show that compared to speech-only training, the proposed basic CJT achieves great performance improvements on clean/other test sets, and the CJT++ re-training yields further performance enhancements. It is also apparent that the proposed method outperforms the wav2vec2.0 model with the same model size and beam size, particularly in extreme low-resource cases., Comment: 5 pages, 3 figures
Published: 2022

22. Seshadri stratification for Schubert varieties and Standard Monomial Theory

Author: Chirivì, Rocco, Fang, Xin, and Littelmann, Peter
Subjects: Mathematics - Algebraic Geometry, Mathematics - Combinatorics, Mathematics - Quantum Algebra, Mathematics - Representation Theory
Abstract: The theory of Seshadri stratifications has been developed by the authors with the intention to build up a new geometric approach towards a standard monomial theory for embedded projective varieties with certain nice properties. In this article, we investigate the Seshadri stratification on a Schubert variety arising from its Schubert subvarieties. We show that the standard monomial theory developed in [32] is compatible with this new strategy., Comment: 55 pages, to appear in Proceedings - Mathematical Sciences, Special Issue in Memory of Professor C S Seshadri
Published: 2022

23. LS Algebras, Valuations and Schubert Varieties

Author: Chirivì, Rocco, Fang, Xin, and Littelmann, Peter
Subjects: Mathematics - Algebraic Geometry, Mathematics - Commutative Algebra, Mathematics - Combinatorics, Mathematics - Representation Theory, 14M15, 14M25
Abstract: In this paper, we propose an algebraic approach via Lakshmibai-Seshadri (LS) algebras to establish a link between standard monomial theories, Newton-Okounkov bodies and valuations. This is applied to Schubert varieties, where this approach is compatible with the one using Seshadri stratifications by the same authors (arXiv:2112.03776), showing that LS paths encode vanishing multiplicities with respect to the web of Schubert varieties.
Published: 2022

24. Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition

Author: Zhang, Zi-Qiang, Zhang, Jie, Zhang, Jian-Shu, Wu, Ming-Hui, Fang, Xin, and Dai, Li-Rong
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: With the advance in self-supervised learning for audio and visual modalities, it has become possible to learn a robust audio-visual speech representation. This would be beneficial for improving the audio-visual speech recognition (AVSR) performance, as the multi-modal inputs contain more fruitful information in principle. In this paper, based on existing self-supervised representation learning methods for audio modality, we therefore propose an audio-visual representation learning approach. The proposed approach explores both the complementarity of audio-visual modalities and long-term context dependency using a transformer-based fusion module and a flexible masking strategy. After pre-training, the model is able to extract fused representations required by AVSR. Without loss of generality, it can be applied to single-modal tasks, e.g. audio/visual speech recognition by simply masking out one modality in the fusion module. The proposed pre-trained model is evaluated on speech recognition and lipreading tasks using one or two modalities, where the superiority is revealed., Comment: 5 pages
Published: 2022

25. A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition

Author: Zhu, Qiu-Shi, Zhang, Jie, Zhang, Zi-Qiang, Wu, Ming-Hui, Fang, Xin, and Dai, Li-Rong
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: Wav2vec2.0 is a popular self-supervised pre-training framework for learning speech representations in the context of automatic speech recognition (ASR). It was shown that wav2vec2.0 has a good robustness against the domain shift, while the noise robustness is still unclear. In this work, we therefore first analyze the noise robustness of wav2vec2.0 via experiments. We observe that wav2vec2.0 pre-trained on noisy data can obtain good representations and thus improve the ASR performance on the noisy test set, which however brings a performance degradation on the clean test set. To avoid this issue, in this work we propose an enhanced wav2vec2.0 model. Specifically, the noisy speech and the corresponding clean version are fed into the same feature encoder, where the clean speech provides training targets for the model. Experimental results reveal that the proposed method can not only improve the ASR performance on the noisy test set which surpasses the original wav2vec2.0, but also ensure a tiny performance decrease on the clean test set. In addition, the effectiveness of the proposed method is demonstrated under different types of noise conditions., Comment: Accepted by ICASSP 2022
Published: 2022
Full Text: View/download PDF

26. Seshadri stratifications and standard monomial theory

Author: Chirivì, Rocco, Fang, Xin, and Littelmann, Peter
Subjects: Mathematics - Algebraic Geometry, Mathematics - Commutative Algebra, Mathematics - Combinatorics
Abstract: We introduce the notion of a Seshadri stratification on an embedded projective variety. Such a structure enables us to construct a Newton-Okounkov simplicial complex and a flat degeneration of the projective variety into a union of toric varieties. We show that the Seshadri stratification provides a geometric setup for a standard monomial theory. In this framework, Lakshmibai-Seshadri paths for Schubert varieties get a geometric interpretation as successive vanishing orders of regular functions., Comment: 76 pages
Published: 2021
Full Text: View/download PDF

27. Chaotic time-delay signature suppression using quantum noise

Author: Guo, Yanqiang, Fang, Xin, Zhang, Haojie, Zhao, Tong, Virte, Martin, and Guo, Xiaomin
Subjects: Quantum Physics, Nonlinear Sciences - Chaotic Dynamics, Physics - Optics
Abstract: Time-delay signature (TDS) suppression of semiconductor lasers with external optical feedback is necessary to ensure the security of chaos-based secure communications. Here we numerically and experimentally demonstrate a technique to effectively suppress the TDS of chaotic lasers using quantum noise. The TDS and dynamical complexity are quantified using the autocorrelation function and normalized permutation entropy at the feedback delay time, respectively. Quantum noise from quadrature fluctuations of vacuum state is prepared through balanced homodyne measurement. The effects of strength and bandwidth of quantum noise on chaotic TDS suppression and complexity enhancement are investigated numerically and experimentally. Compared to the original dynamics, the TDS of this quantum-noise improved chaos is suppressed up to 94% and the bandwidth suppression ratio of quantum noise to chaotic laser is 1:25. The experiment agrees well with the theory. The improved chaotic laser is potentially beneficial to chaos-based random number generation and secure communication., Comment: 4 pages, 5 figures
Published: 2021
Full Text: View/download PDF

28. Impact of DER Communication Delay in AGC: Cyber-Physical Dynamic Simulation

Author: Wang, Wenbo, Fang, Xin, and Florita, Anthony
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Distributed energy resource (DER) frequency regulations are promising technologies for future grid operation. Unlike conventional generators, DERs might require open communication networks to exchange signals with control centers, possibly through DER aggregators; therefore, the impacts of the communication variations on the system stability need to be investigated. This paper develops a cyber-physical dynamic simulation model based on the Hierarchical Engine for Large-Scale Co-Simulation (HELICS) to evaluate the impact of the communication variations, such as delays in DER frequency regulations. The feasible delay range can be obtained under different parameter settings. The results show that the risk of instability generally increases with the communication delay.
Published: 2021

29. Rethinking Annotation Granularity for Overcoming Shortcuts in Deep Learning-based Radiograph Diagnosis: A Multicenter Study

Author: Luo, Luyang, Chen, Hao, Xiao, Yongjie, Zhou, Yanning, Wang, Xi, Vardhanabhuti, Varut, Wu, Mingxiang, Han, Chu, Liu, Zaiyi, Fang, Xin Hao Benjamin, Tsougenis, Efstratios, Lin, Huangjing, and Heng, Pheng-Ann
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Two DL models were developed using radiograph-level annotations (yes or no disease) and fine-grained lesion-level annotations (lesion bounding boxes), respectively named CheXNet and CheXDet. The models' internal classification performance and lesion localization performance were compared on a testing set (n=2,922), external classification performance was compared on NIH-Google (n=4,376) and PadChest (n=24,536) datasets, and external lesion localization performance was compared on NIH-ChestX-ray14 dataset (n=880). The models were also compared to radiologists on a subset of the internal testing set (n=496). Given sufficient training data, both models performed comparably to radiologists. CheXDet achieved significant improvement for external classification, such as in classifying fracture on NIH-Google (CheXDet area under the ROC curve [AUC]: 0.67, CheXNet AUC: 0.51; p<.001) and PadChest (CheXDet AUC: 0.78, CheXNet AUC: 0.55; p<.001). CheXDet achieved higher lesion detection performance than CheXNet for most abnormalities on all datasets, such as in detecting pneumothorax on the internal set (CheXDet jacknife alternative free-response ROC-figure of merit [JAFROC-FOM]: 0.87, CheXNet JAFROC-FOM: 0.13; p<.001) and NIH-ChestX-ray14 (CheXDet JAFROC-FOM: 0.55, CheXNet JAFROC-FOM: 0.04; p<.001). To summarize, fine-grained annotations overcame shortcut learning and enabled DL models to identify correct lesion patterns, improving the models' generalizability., Comment: Radiology: Artificial Intelligence
Published: 2021

30. USTC-NELSLIP System Description for DIHARD-III Challenge

Author: Wang, Yuxuan, He, Maokui, Niu, Shutong, Sun, Lei, Gao, Tian, Fang, Xin, Pan, Jia, Du, Jun, and Lee, Chin-Hui
Subjects: Computer Science - Sound, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: This system description describes our submission system to the Third DIHARD Speech Diarization Challenge. Besides the traditional clustering based system, the innovation of our system lies in the combination of various front-end techniques to solve the diarization problem, including speech separation and target-speaker based voice activity detection (TS-VAD), combined with iterative data purification. We also adopted audio domain classification to design domain-dependent processing. Finally, we performed post processing to do system fusion and selection. Our best system achieved DERs of 11.30% in track 1 and 16.78% in track 2 on evaluation set, respectively.
Published: 2021

31. XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition

Author: Zhang, Zi-Qiang, Song, Yan, Wu, Ming-Hui, Fang, Xin, and Dai, Li-Rong
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Sound
Abstract: In this paper, we propose a weakly supervised multilingual representation learning framework, called cross-lingual self-training (XLST). XLST is able to utilize a small amount of annotated data from high-resource languages to improve the representation learning on multilingual un-annotated data. Specifically, XLST uses a supervised trained model to produce initial representations and another model to learn from them, by maximizing the similarity between output embeddings of these two models. Furthermore, the moving average mechanism and multi-view data augmentation are employed, which are experimentally shown to be crucial to XLST. Comprehensive experiments have been conducted on the CommonVoice corpus to evaluate the effectiveness of XLST. Results on 5 downstream low-resource ASR tasks shows that our multilingual pretrained model achieves relatively 18.6% PER reduction over the state-of-the-art self-supervised method, with leveraging additional 100 hours of annotated English data., Comment: 5 pages, 1 figure
Published: 2021

32. Bidirectional elastic diode with frequency-preserved nonreciprocity

Author: Fang, Xin, Wen, Jihong, Cheng, Li, and Li, Baowen
Subjects: Physics - Applied Physics
Abstract: The study of nonreciprocal wave propagation is of great interests for both fundamental research and engineering applications. Here we demonstrate theoretically and experimentally a bidirectional, nonreciprocal, and high-quality diode that can rectify elastic waves in both forward and backward directions in an elastic metamaterial designed to exhibit enhanced nonlinearity of resonances. This diode can preserve or vary frequency, rectify low-frequency long wave with small system size, offer high-quality insulation, can be modulated by amplitude, and break reciprocity of both the total energy and fundamental wave. We report three mechanisms to break reciprocity: the amplitude-dependent bandgap combining interface reflection, chaotic response combining linear bandgap, amplitude-dependent attenuation rate in damping diode. The bidirectional diode paves ways for mutually controlling information/energy transport between two sources, which can be used as new wave insulators., Comment: 11 pages, 13 figures
Published: 2021
Full Text: View/download PDF

33. Transmission-and-Distribution Frequency Dynamic Co-Simulation Framework for Distributed Energy Resources Frequency Response

Author: Wang, Wenbo, Fang, Xin, Cui, Hantao, and Li, Fangxing
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: The rapid deployment of distributed energy resources (DERs) in distribution networks has brought challenges to balance the system and stabilize frequency. DERs have the ability to provide frequency regulation; however, existing dynamic frequency simulation tools-which were developed mainly for the transmission system-lack the capability to simulate distribution network dynamics with high penetrations of DERs. Although electromagnetic transient (EMT) simulation tools can simulate distribution network dynamics, the computation efficiency limits their use for large-scale transmission-and-distribution (T&D) simulations. This paper presents an efficient T&D dynamic frequency co-simulation framework for DER frequency response based on the HELICS platform and existing off-the-shelf simulators. The challenge of synchronizing frequency between the transmission network and DERs hosted in the distribution network is approached by detailed modeling of DERs in frequency dynamic models while DER phasor models are also preserved in the distribution networks. Thereby, local voltage constraints can be respected when dispatching the DER power for frequency response. The DER frequency responses (primary and secondary)-are simulated in case studies to validate the proposed framework. Lastly, fault-induced delayed voltage recovery (FIDVR) event of a large system is presented to demonstrate the efficiency and effectiveness of the overall framework.
Published: 2021

34. Two-Stage Stochastic Optimization Frameworks to Aid in Decision-Making Under Uncertainty for Variable Resource Generators Participating in a Sequential Energy Market

Author: Al-Lawati, Razan A. H., Crespo-Vazquez, Jose L., Faiz, Tasnim Ibn, Fang, Xin, and Noor-E-Alam, Md.
Subjects: Mathematics - Optimization and Control
Abstract: Decisions for a variable renewable resource generators commitment in the energy market are typically made in advance when little information is obtainable about wind availability and market prices. Much research has been published recommending various frameworks for addressing this issue. However, these frameworks are limited as they do not consider all markets a producer can participate in. Moreover, current stochastic programming models do not allow for uncertainty data to be updated as more accurate information becomes available. This work proposes two decision-making frameworks for a wind energy generator participating in day-ahead, intraday, reserve, and balancing markets. The first framework is a two-stage stochastic convex optimization approach, where both scenario-independent and scenario-dependent decisions are made concurrently. The second framework is a series of four two-stage stochastic optimization models wherein the results from each model feed into each subsequent model allowing for scenarios to be updated as more information becomes available to the decision-maker. In the simulation experiments, the multi-phase framework performs better than the single-phase in every run, and results in an average profit increase of 7%. The proposed optimization frameworks aid in better decision-making while addressing uncertainty related to variable resource generators and maximize the return on investment.
Published: 2020

35. Effective Parallelism for Equation and Jacobian Evaluation in Power Flow Calculation

Author: Cui, Hantao, Li, Fangxing, and Fang, Xin
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: This letter investigates parallelism approaches for equation and Jacobian evaluations in large-scale power flow calculation. Two levels of parallelism are proposed and analyzed: inter-model parallelism, which evaluates models in parallel, and intra-model parallelism, which evaluates calculations within each model in parallel. Parallelism techniques such as multi-threading and single instruction multiple data (SIMD) vectorization are discussed, implemented, and benchmarked as six calculation workflows. Case studies on the 70,000-bus synthetic grid show that equation evaluations can be accelerated by ten times, and the overall Newton power flow advances the state of the art by 20%.
Published: 2020
Full Text: View/download PDF

36. Domain-Embeddings Based DGA Detection with Incremental Training Method

Author: Fang, Xin, Sun, Xiaoqing, Yang, Jiahai, and Liu, Xinran
Subjects: Computer Science - Cryptography and Security
Abstract: DGA-based botnet, which uses Domain Generation Algorithms (DGAs) to evade supervision, has become a part of the most destructive threats to network security. Over the past decades, a wealth of defense mechanisms focusing on domain features have emerged to address the problem. Nonetheless, DGA detection remains a daunting and challenging task due to the big data nature of Internet traffic and the potential fact that the linguistic features extracted only from the domain names are insufficient and the enemies could easily forge them to disturb detection. In this paper, we propose a novel DGA detection system which employs an incremental word-embeddings method to capture the interactions between end hosts and domains, characterize time-series patterns of DNS queries for each IP address and therefore explore temporal similarities between domains. We carefully modify the Word2Vec algorithm and leverage it to automatically learn dynamic and discriminative feature representations for over 1.9 million domains, and develop an simple classifier for distinguishing malicious domains from the benign. Given the ability to identify temporal patterns of domains and update models incrementally, the proposed scheme makes the progress towards adapting to the changing and evolving strategies of DGA domains. Our system is evaluated and compared with the state-of-art system FANCI and two deep-learning methods CNN and LSTM, with data from a large university's network named TUNET. The results suggest that our system outperforms the strong competitors by a large margin on multiple metrics and meanwhile achieves a remarkable speed-up on model updating., Comment: 6 pages, 3 figures, ISCC2020
Published: 2020

37. Exact structures and degeneration of Hall algebras

Author: Fang, Xin and Gorsky, Mikhail
Subjects: Mathematics - Representation Theory, Mathematics - Category Theory, Mathematics - Quantum Algebra, Mathematics - Rings and Algebras
Abstract: We study degenerations of the Hall algebras of exact categories induced by degree functions on the set of isomorphism classes of indecomposable objects. We prove that each such degeneration of the Hall algebra $\mathcal{H}(\mathcal{E})$ of an exact category $\mathcal{E}$ is the Hall algebra of a smaller exact structure $\mathcal{E}' < \mathcal{E}$ on the same additive category $\mathcal{A}.$ When $\mathcal{E}$ is admissible in the sense of Enomoto, for any $\mathcal{E}' < \mathcal{E}$ satisfying suitable finiteness conditions, we prove that $\mathcal{H}(\mathcal{E}')$ is a degeneration of $\mathcal{H}(\mathcal{E})$ of this kind. In the additively finite case, all such degree functions form a simplicial cone whose face lattice reflects properties of the lattice of exact structures. For the categories of representations of Dynkin quivers, we recover degenerations of the negative part of the corresponding quantum group, as well as the associated polyhedral structure studied by Fourier, Reineke and the first author. Along the way, we give minor improvements to certain results of Enomoto and Br\"ustle-Langford-Hassoun-Roy concerning the classification of exact structures on an additive category. We prove that for each idempotent complete additive category $\mathcal{A}$, there exists an abelian category whose lattice of Serre subcategories is isomorphic to the lattice of exact structures on $\mathcal{A}$. We show that every Krull-Schmidt category admits a unique maximal admissible exact structure and that the lattice of smaller exact structures of an admissible exact structure is Boolean., Comment: 32 pages, comments are welcome
Published: 2020

38. Lusztig polytopes and FFLV polytopes

Author: Fang, Xin and Koshevoy, Gleb
Subjects: Mathematics - Representation Theory, Mathematics - Combinatorics
Abstract: In this paper we prove that in type $\tt A_n$, the Feigin-Fourier-Littelmann-Vinberg (FFLV) polytope coincides with the Minkowski sum of Lusztig polytopes arising from various reduced decompositions. Using this result, we formulate a conjecture about the crystal structures on FFLV polytopes., Comment: 22 pages, 12 figures; v2: added Proposition 5.2 and 5.4
Published: 2020

39. Cones from quantum groups to tropical flag varieties

Author: Fang, Xin, Fourier, Ghislain, and Reineke, Markus
Subjects: Mathematics - Quantum Algebra, Mathematics - Algebraic Geometry, Mathematics - Rings and Algebras
Abstract: We relate quantum degree cones, parametrizing PBW degenerations of quantized enveloping algebras, to (negative tight monomial) cones introduced by Lusztig in the study of monomials in canonical bases, to K-theoretic cones for quiver representations, and to some maximal prime cones in tropical flag varieties., Comment: 15 pages, v2: statement of Corollary 1 corrected
Published: 2019

40. SIFO: Secure Computational Infrastructure using FPGA Overlays

Author: Fang, Xin, Ioannidis, Stratis, and Leeser, Miriam
Subjects: Computer Science - Cryptography and Security, Computer Science - Hardware Architecture, Security and privacy: Privacy-preserving protocols, Hardware: Reconfigurable logic applications
Abstract: Secure Function Evaluation (SFE) has received recent attention due to the massive collection and mining of personal data, but remains impractical due to its large computational cost. Garbled Circuits (GC) is a protocol for implementing SFE which can evaluate any function that can be expressed as a Boolean circuit and obtain the result while keeping each party's input private. Recent advances have led to a surge of garbled circuit implementations in software for a variety of different tasks. However, these implementations are inefficient and therefore GC is not widely used, especially for large problems. This research investigates, implements and evaluates secure computation generation using a heterogeneous computing platform featuring FPGAs. We have designed and implemented SIFO: Secure computational Infrastructure using FPGA Overlays. Unlike traditional FPGA design, a coarse grained overlay architecture is adopted which supports mapping SFE problems that are too large to map to a single FPGA. Host tools provided include SFE problem generator, parser and automatic host code generation. Our design allows re-purposing an FPGA to evaluate different SFE tasks without the need for reprogramming, and fully explores the parallelism for any GC problem. Our system demonstrates an order of magnitude speedup compared with an existing software platform., Comment: International Journal of Reconfigurable Computing, to appear
Published: 2019

41. Precise Photon Correlation Measurement of a Chaotic Laser

Author: Guo, Xiaomin, Cheng, Chen, Liu, Tong, Fang, Xin, and Guo, Yanqiang
Subjects: Quantum Physics, Nonlinear Sciences - Chaotic Dynamics, Physics - Optics
Abstract: The second order photon correlation g^(2)(tau) of a chaotic optical-feedback semiconductor laser is precisely measured using a Hanbury Brown-Twiss interferometer. The accurate g^(2)(tau) with non-zero delay time is obtained experimentally from the photon pair time interval distribution through a ninth-order self-convolution correction. The experimental results agree well with the theoretical analysis. The relative error of g^(2)(tau) is no more than 0.005 within 50 ns delay time. The bunching effect and coherence time of the chaotic laser are measured via the precise photon correlation technique. This technique provides a new tool to improve the accuracy of g^(2)(tau) measurement and boost applications of quantum statistics and correlation., Comment: 12 pages, 7 figures
Published: 2019

42. Evaluating entropy rate of laser chaos and shot noise

Author: Guo, Xiaomin, Liu, Tong, Wang, Lijing, Fang, Xin, Zhao, Tong, Virte, Martin, and Guo, Yanqiang
Subjects: Physics - Optics, Nonlinear Sciences - Chaotic Dynamics, Quantum Physics
Abstract: Evaluating entropy rate of high-dimensional chaos and shot noise from analog raw signals remains elusive and important in information security. We experimentally present an accurate assessment of entropy rate for physical process randomness. The entropy generation of optical-feedback laser chaos and physical randomness limit from shot noise are quantified and unambiguously discriminated using the growth rate of average permutation entropy value in memory time. The permutation entropy difference of filtered laser chaos with varying embedding delay time is investigated experimentally and theoretically. High resolution maps of the entropy difference is observed over the range of the injection-feedback parameter space. We also clarify an inverse relationship between the entropy rate and time delay signature of laser chaos over a wide range of parameters. Compared to the original chaos, the time delay signature is suppressed up to 95% with the minimum of 0.015 via frequency-band extractor, and the experiment agrees well with the theory. Our system provides a commendable entropy evaluation and source for physical random number generation., Comment: 7 pages, 9 figures
Published: 2019
Full Text: View/download PDF

43. Space-time Variant Self-growing Bandgap in Nonlinear Acoustic Metamaterial

Author: Fang, Xin, Wen, Jihong, and Yu, Dianlong
Subjects: Physics - Applied Physics
Abstract: Material band structure is key foundation for various modern technologies, but it was regarded as a space-time invariant feature. Acoustic metamaterials show extraordinary properties for processing elastic waves, but conventional realizations suffer from narrow bandgaps. Here we first report a nonlinear acoustic metamaterial whose band structure self-adapts to the propagation distance/time and the bandgap exhibits a self-growing behaviour stemming from giant nonlinear interaction. This space-time self-modulating characteristic highlights an unconventional understanding of the band structure, and the self-growth generates an ultralow and ultrabroad bandgap that breaks through the limitation of the mass law for linear locally resonant bandgaps. We also elucidate the self-adaptive mechanisms. This first demonstration sheds light on conceiving advanced devices and metamaterials with broadband, space-time variant bandgaps for wave self-manipulation.
Published: 2019
Full Text: View/download PDF

44. Channel adversarial training for cross-channel text-independent speaker recognition

Author: Fang, Xin, Zou, Liang, Li, Jin, Sun, Lei, and Ling, Zhen-Hua
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: The conventional speaker recognition frameworks (e.g., the i-vector and CNN-based approach) have been successfully applied to various tasks when the channel of the enrolment dataset is similar to that of the test dataset. However, in real-world applications, mismatch always exists between these two datasets, which may severely deteriorate the recognition performance. Previously, a few channel compensation algorithms have been proposed, such as Linear Discriminant Analysis (LDA) and Probabilistic LDA. However, these methods always require the collections of different channels from a specific speaker, which is unrealistic to be satisfied in real scenarios. Inspired by domain adaptation, we propose a novel deep-learning based speaker recognition framework to learn the channel-invariant and speaker-discriminative speech representations via channel adversarial training. Specifically, we first employ a gradient reversal layer to remove variations across different channels. Then, the compressed information is projected into the same subspace by adversarial training. Experiments on test datasets with 54,133 speakers demonstrate that the proposed method is not only effective at alleviating the channel mismatch problem, but also outperforms state-of-the-art speaker recognition methods. Compared with the i-vector-based method and the CNN-based method, our proposed method achieves significant relative improvement of 44.7% and 22.6% respectively in terms of the Top1 recall., Comment: 5 pages, 2 figures, 2 tabels
Published: 2019

45. Linear degenerations of flag varieties: partial flags, defining equations, and group actions

Author: Irelli, Giovanni Cerulli, Fang, Xin, Feigin, Evgeny, Fourier, Ghislain, and Reineke, Markus
Subjects: Mathematics - Algebraic Geometry, Mathematics - Combinatorics, Mathematics - Representation Theory
Abstract: We continue, generalize and expand our study of linear degenerations of flag varieties from [G. Cerulli Irelli, X. Fang, E. Feigin, G. Fourier, M. Reineke, Math. Z. 287 (2017), no. 1-2, 615-654]. We realize partial flag varieties as quiver Grassmannians for equi-oriented type A quivers and construct linear degenerations by varying the corresponding quiver representation. We prove that there exists the deepest flat degeneration and the deepest flat irreducible degeneration: the former is the partial analogue of the mf-degenerate flag variety and the latter coincides with the partial PBW-degenerate flag variety. We compute the generating function of the number of orbits in the flat irreducible locus and study the natural family of line bundles on the degenerations from the flat irreducible locus. We also describe explicitly the reduced scheme structure on these degenerations and conjecture that similar results hold for the whole flat locus. Finally, we prove an analogue of the Borel-Weil theorem for the flat irreducible locus., Comment: 22 pages. arXiv admin note: text overlap with arXiv:1603.08395
Published: 2019

46. Wave propagation in infinite nonlinear acoustic metamaterial beam by considering the third harmonic generation

Author: Fang, Xin, Wen, Jihong, Yu, Dianlong, Huang, Guoliang, and Yin, Jianfei
Subjects: Physics - Applied Physics
Abstract: Nonlinear acoustic metamaterial (NAM) initiates new fields for controlling elastic waves. In this work, the flexural wave propagation in the half-infinite NAM beam consisting of periodic Duffing resonators is reported by considering the third harmonic generation (THG). Different analytical methods are proposed to describe the wave propagation in the equivalent homogenous medium. Then their effectiveness and accuracy are demonstrated in comparison with the finite element methods. We unveil analytically and numerically extensive physical properties of the strongly nonlinear AM, including the nonlinear resonance in a cell, the effective density, nonlinear locally resonant (NLR) bandgap, propagations and couplings of the fundamental and the third harmonics. These characteristics are highly interrelated, which facilitates the prediction of functionalities. In the near field, the identical bifurcation frequency of these features acts as the start frequency of the NLR bandgap for fundamental waves, whose width is narrower for a stronger nonlinearity. While in the far field, the NLR bandgap characterizes a distance-amplitude-dependent behavior leading to a self-adaptive bandwidth. Moreover, the transmission in the passband of the infinite NAM is different from the chaotic band effect of finite NAMs, and it is influenced by the shifted NLR gap. Our work will promote future studies and constructions of NAMs with novel properties.
Published: 2018

47. Degenerate Schubert Varieties in Type A

Author: Chirivi', Rocco, Fang, Xin, and Fourier, Ghislain
Subjects: Mathematics - Representation Theory, Mathematics - Algebraic Geometry, Mathematics - Combinatorics, 17B10, 16S30, 14M15, 06A07
Abstract: We introduce rectangular elements in the symmetric group. In the framework of PBW degenerations, we show that in type A the degenerate Schubert variety associated to a rectangular element is indeed a Schubert variety in a partial flag variety of the same type with larger rank. Moreover, the degenerate Demazure module associated to a rectangular element is isomorphic to the Demazure module for this particular Schubert variety of larger rank. This generalizes previous results by Cerulli Irelli, Lanini and Littelmann for the PBW degenerate flag variety., Comment: minor improvements; 28 pages, 6 figures
Published: 2018

48. The Minkowski Property and Reflexivity of Marked Poset Polytopes

Author: Fang, Xin, Fourier, Ghislain, and Pegel, Christoph
Subjects: Mathematics - Combinatorics, 52B20 (Primary) 06A07 (Secondary)
Abstract: We provide a Minkowski sum decomposition of marked chain-order polytopes into building blocks associated to elementary markings and thus give an explicit minimal set of generators of an associated semi-group algebra. We proceed by characterizing the reflexive polytopes among marked chain-order polytopes as those with the underlying marked poset being ranked., Comment: 17 pages
Published: 2018
Full Text: View/download PDF

49. Bridging coupling bandgaps in nonlinear acoustic metamaterials

Author: Fang, Xin, Wen, Jihong, and Yu, Dianlong
Subjects: Physics - Applied Physics, Condensed Matter - Materials Science, Nonlinear Sciences - Chaotic Dynamics
Abstract: Nonlinear acoustic metamaterials (NAMs) open new freedoms in exploiting novel technologies for wave manipulations. Recently, the desired ultra-low and ultra-broad-band wave suppressions were achieved by the chaotic bands in NAMs [Nature Commun. 8, 1288 (2017)]. This work describes a remote interaction mechanism in NAMs-bridging coupling of nonlinear locally resonant bandgaps. Bridging bandgaps generate chaotic bands and share the negative mass between nonlinear resonators. The bandwidth and the efficiency for the wave reduction in chaotic bands can be manipulated effectively by modulating the frequency distance between the bridging pair. Theoretical analyses on the triatomic model containing two nonlinearly coupled resonances clarify the principle of bridging bandgaps. NAM beams are created to demonstrate this mechanism experimentally by including the bifurcations of periodic solutions. Our study extends the content of NAMs and more nonlinear effects are anticipated based on this mechanism.
Published: 2018
Full Text: View/download PDF

50. Supports for linear degenerations of flag varieties

Author: Fang, Xin and Reineke, Markus
Subjects: Mathematics - Algebraic Geometry, Mathematics - Quantum Algebra, Mathematics - Representation Theory
Abstract: We determine the set of supports for the flat family of linear degenerations of flag varieties in terms of Motzkin combinatorics., Comment: 17 pages
Published: 2018

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

82 results on '"Fang, Xin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources