Author: "Ma Yiwei" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ma Yiwei"' showing total 626 results

Start Over Author "Ma Yiwei"

626 results on '"Ma Yiwei"'

1. Any-to-3D Generation via Hybrid Diffusion Supervision

Author: Fan, Yijun, Ma, Yiwei, Ji, Jiayi, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent progress in 3D object generation has been fueled by the strong priors offered by diffusion models. However, existing models are tailored to specific tasks, accommodating only one modality at a time and necessitating retraining to change modalities. Given an image-to-3D model and a text prompt, a naive approach is to convert text prompts to images and then use the image-to-3D model for generation. This approach is both time-consuming and labor-intensive, resulting in unavoidable information loss during modality conversion. To address this, we introduce XBind, a unified framework for any-to-3D generation using cross-modal pre-alignment techniques. XBind integrates an multimodal-aligned encoder with pre-trained diffusion models to generate 3D objects from any modalities, including text, images, and audio. We subsequently present a novel loss function, termed Modality Similarity (MS) Loss, which aligns the embeddings of the modality prompts and the rendered images, facilitating improved alignment of the 3D objects with multiple modalities. Additionally, Hybrid Diffusion Supervision combined with a Three-Phase Optimization process improves the quality of the generated 3D objects. Extensive experiments showcase XBind's broad generation capabilities in any-to-3D scenarios. To our knowledge, this is the first method to generate 3D objects from any modality prompts. Project page: https://zeroooooooow1440.github.io/.
Published: 2024

2. I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing

Author: Ma, Yiwei, Ji, Jiayi, Ye, Ke, Lin, Weihuang, Wang, Zhibin, Zheng, Yonghan, Zhou, Qiang, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Significant progress has been made in the field of Instruction-based Image Editing (IIE). However, evaluating these models poses a significant challenge. A crucial requirement in this field is the establishment of a comprehensive evaluation benchmark for accurately assessing editing results and providing valuable insights for its further development. In response to this need, we propose I2EBench, a comprehensive benchmark designed to automatically evaluate the quality of edited images produced by IIE models from multiple dimensions. I2EBench consists of 2,000+ images for editing, along with 4,000+ corresponding original and diverse instructions. It offers three distinctive characteristics: 1) Comprehensive Evaluation Dimensions: I2EBench comprises 16 evaluation dimensions that cover both high-level and low-level aspects, providing a comprehensive assessment of each IIE model. 2) Human Perception Alignment: To ensure the alignment of our benchmark with human perception, we conducted an extensive user study for each evaluation dimension. 3) Valuable Research Insights: By analyzing the advantages and disadvantages of existing IIE models across the 16 dimensions, we offer valuable research insights to guide future development in the field. We will open-source I2EBench, including all instructions, input images, human annotations, edited images from all evaluated methods, and a simple script for evaluating the results from new IIE models. The code, dataset and generated images from all IIE models are provided in github: https://github.com/cocoshe/I2EBench., Comment: NeurIPS2024, 15 pages, 7 figures
Published: 2024

3. 3D-GRES: Generalized 3D Referring Expression Segmentation

Author: Wu, Changli, Liu, Yihang, Ji, Jiayi, Ma, Yiwei, Wang, Haowei, Luo, Gen, Ding, Henghui, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: 3D Referring Expression Segmentation (3D-RES) is dedicated to segmenting a specific instance within a 3D space based on a natural language description. However, current approaches are limited to segmenting a single target, restricting the versatility of the task. To overcome this limitation, we introduce Generalized 3D Referring Expression Segmentation (3D-GRES), which extends the capability to segment any number of instances based on natural language instructions. In addressing this broader task, we propose the Multi-Query Decoupled Interaction Network (MDIN), designed to break down multi-object segmentation tasks into simpler, individual segmentations. MDIN comprises two fundamental components: Text-driven Sparse Queries (TSQ) and Multi-object Decoupling Optimization (MDO). TSQ generates sparse point cloud features distributed over key targets as the initialization for queries. Meanwhile, MDO is tasked with assigning each target in multi-object scenarios to different queries while maintaining their semantic consistency. To adapt to this new task, we build a new dataset, namely Multi3DRes. Our comprehensive evaluations on this dataset demonstrate substantial enhancements over existing models, thus charting a new path for intricate multi-object 3D scene comprehension. The benchmark and code are available at https://github.com/sosppxo/MDIN., Comment: Accepted by ACM MM 2024 (Oral), Code: https://github.com/sosppxo/MDIN
Published: 2024

4. INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model

Author: Ma, Yiwei, Wang, Zhibin, Sun, Xiaoshuai, Lin, Weihuang, Zhou, Qiang, Ji, Jiayi, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: With advancements in data availability and computing resources, Multimodal Large Language Models (MLLMs) have showcased capabilities across various fields. However, the quadratic complexity of the vision encoder in MLLMs constrains the resolution of input images. Most current approaches mitigate this issue by cropping high-resolution images into smaller sub-images, which are then processed independently by the vision encoder. Despite capturing sufficient local details, these sub-images lack global context and fail to interact with one another. To address this limitation, we propose a novel MLLM, INF-LLaVA, designed for effective high-resolution image perception. INF-LLaVA incorporates two innovative components. First, we introduce a Dual-perspective Cropping Module (DCM), which ensures that each sub-image contains continuous details from a local perspective and comprehensive information from a global perspective. Second, we introduce Dual-perspective Enhancement Module (DEM) to enable the mutual enhancement of global and local features, allowing INF-LLaVA to effectively process high-resolution images by simultaneously capturing detailed local information and comprehensive global context. Extensive ablation studies validate the effectiveness of these components, and experiments on a diverse set of benchmarks demonstrate that INF-LLaVA outperforms existing MLLMs. Code and pretrained model are available at https://github.com/WeihuangLin/INF-LLaVA.
Published: 2024

5. Multi-branch Collaborative Learning Network for 3D Visual Grounding

Author: Qian, Zhipeng, Ma, Yiwei, Lin, Zhekai, Ji, Jiayi, Zheng, Xiawu, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: 3D referring expression comprehension (3DREC) and segmentation (3DRES) have overlapping objectives, indicating their potential for collaboration. However, existing collaborative approaches predominantly depend on the results of one task to make predictions for the other, limiting effective collaboration. We argue that employing separate branches for 3DREC and 3DRES tasks enhances the model's capacity to learn specific information for each task, enabling them to acquire complementary knowledge. Thus, we propose the MCLN framework, which includes independent branches for 3DREC and 3DRES tasks. This enables dedicated exploration of each task and effective coordination between the branches. Furthermore, to facilitate mutual reinforcement between these branches, we introduce a Relative Superpoint Aggregation (RSA) module and an Adaptive Soft Alignment (ASA) module. These modules significantly contribute to the precise alignment of prediction results from the two branches, directing the module to allocate increased attention to key positions. Comprehensive experimental evaluation demonstrates that our proposed method achieves state-of-the-art performance on both the 3DREC and 3DRES tasks, with an increase of 2.05% in Acc@0.5 for 3DREC and 3.96% in mIoU for 3DRES., Comment: ECCV2024
Published: 2024

6. Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Author: Yang, Danni, Dong, Ruohan, Ji, Jiayi, Ma, Yiwei, Wang, Haowei, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: Recently, diffusion models have increasingly demonstrated their capabilities in vision understanding. By leveraging prompt-based learning to construct sentences, these models have shown proficiency in classification and visual grounding tasks. However, existing approaches primarily showcase their ability to perform sentence-level localization, leaving the potential for leveraging contextual information for phrase-level understanding largely unexplored. In this paper, we utilize Panoptic Narrative Grounding (PNG) as a proxy task to investigate this capability further. PNG aims to segment object instances mentioned by multiple noun phrases within a given narrative text. Specifically, we introduce the DiffPNG framework, a straightforward yet effective approach that fully capitalizes on the diffusion's architecture for segmentation by decomposing the process into a sequence of localization, segmentation, and refinement steps. The framework initially identifies anchor points using cross-attention mechanisms and subsequently performs segmentation with self-attention to achieve zero-shot PNG. Moreover, we introduce a refinement module based on SAM to enhance the quality of the segmentation masks. Our extensive experiments on the PNG dataset demonstrate that DiffPNG achieves strong performance in the zero-shot PNG task setting, conclusively proving the diffusion model's capability for context-aware, phrase-level understanding. Source code is available at \url{https://github.com/nini0919/DiffPNG}., Comment: Accepted by ECCV2024
Published: 2024

7. AnyTrans: Translate AnyText in the Image with Large Scale Models

Author: Qian, Zhipeng, Zhang, Pei, Yang, Baosong, Fan, Kai, Ma, Yiwei, Wong, Derek F., Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: This paper introduces AnyTrans, an all-encompassing framework for the task-Translate AnyText in the Image (TATI), which includes multilingual text translation and text fusion within images. Our framework leverages the strengths of large-scale models, such as Large Language Models (LLMs) and text-guided diffusion models, to incorporate contextual cues from both textual and visual elements during translation. The few-shot learning capability of LLMs allows for the translation of fragmented texts by considering the overall context. Meanwhile, the advanced inpainting and editing abilities of diffusion models make it possible to fuse translated text seamlessly into the original image while preserving its style and realism. Additionally, our framework can be constructed entirely using open-source models and requires no training, making it highly accessible and easily expandable. To encourage advancement in the TATI task, we have meticulously compiled a test dataset called MTIT6, which consists of multilingual text image translation data from six language pairs.
Published: 2024

8. Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval

Author: Ma, Yiwei, Sun, Xiaoshuai, Ji, Jiayi, Jiang, Guannan, Zhuang, Weilin, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-based person retrieval (TPR) is a challenging task that involves retrieving a specific individual based on a textual description. Despite considerable efforts to bridge the gap between vision and language, the significant differences between these modalities continue to pose a challenge. Previous methods have attempted to align text and image samples in a modal-shared space, but they face uncertainties in optimization directions due to the movable features of both modalities and the failure to account for one-to-many relationships of image-text pairs in TPR datasets. To address this issue, we propose an effective bi-directional one-to-many embedding paradigm that offers a clear optimization direction for each sample, thus mitigating the optimization problem. Additionally, this embedding scheme generates multiple features for each sample without introducing trainable parameters, making it easier to align with several positive samples. Based on this paradigm, we propose a novel Bi-directional one-to-many Embedding Alignment (Beat) model to address the TPR task. Our experimental results demonstrate that the proposed Beat model achieves state-of-the-art performance on three popular TPR datasets, including CUHK-PEDES (65.61 R@1), ICFG-PEDES (58.25 R@1), and RSTPReID (48.10 R@1). Furthermore, additional experiments on MS-COCO, CUB, and Flowers datasets further demonstrate the potential of Beat to be applied to other image-text retrieval tasks., Comment: ACM MM2023
Published: 2024

9. SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation

Author: Yang, Danni, Ji, Jiayi, Ma, Yiwei, Guo, Tianyu, Wang, Haowei, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: In this paper, we introduce SemiRES, a semi-supervised framework that effectively leverages a combination of labeled and unlabeled data to perform RES. A significant hurdle in applying semi-supervised techniques to RES is the prevalence of noisy pseudo-labels, particularly at the boundaries of objects. SemiRES incorporates the Segment Anything Model (SAM), renowned for its precise boundary demarcation, to improve the accuracy of these pseudo-labels. Within SemiRES, we offer two alternative matching strategies: IoU-based Optimal Matching (IOM) and Composite Parts Integration (CPI). These strategies are designed to extract the most accurate masks from SAM's output, thus guiding the training of the student model with enhanced precision. In instances where a precise mask cannot be matched from the available candidates, we develop the Pixel-Wise Adjustment (PWA) strategy, guiding the student model's training directly by the pseudo-labels. Extensive experiments on three RES benchmarks--RefCOCO, RefCOCO+, and G-Ref reveal its superior performance compared to fully supervised methods. Remarkably, with only 1% labeled data, our SemiRES outperforms the supervised baseline by a large margin, e.g. +18.64% gains on RefCOCO val set. The project code is available at \url{https://github.com/nini0919/SemiRES}., Comment: Accepted by ICML2024
Published: 2024

10. Image Captioning via Dynamic Path Customization

Author: Ma, Yiwei, Ji, Jiayi, Sun, Xiaoshuai, Zhou, Yiyi, Hong, Xiaopeng, Wu, Yongjian, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper explores a novel dynamic network for vision and language tasks, where the inferring structure is customized on the fly for different inputs. Most previous state-of-the-art approaches are static and hand-crafted networks, which not only heavily rely on expert knowledge, but also ignore the semantic diversity of input samples, therefore resulting in suboptimal performance. To address these issues, we propose a novel Dynamic Transformer Network (DTNet) for image captioning, which dynamically assigns customized paths to different samples, leading to discriminative yet accurate captions. Specifically, to build a rich routing space and improve routing efficiency, we introduce five types of basic cells and group them into two separate routing spaces according to their operating domains, i.e., spatial and channel. Then, we design a Spatial-Channel Joint Router (SCJR), which endows the model with the capability of path customization based on both spatial and channel information of the input sample. To validate the effectiveness of our proposed DTNet, we conduct extensive experiments on the MS-COCO dataset and achieve new state-of-the-art performance on both the Karpathy split and the online test server., Comment: TNNLS24
Published: 2024

11. X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation

Author: Ma, Yiwei, Lin, Zhekai, Ji, Jiayi, Fan, Yijun, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advancements in automatic 3D avatar generation guided by text have made significant progress. However, existing methods have limitations such as oversaturation and low-quality output. To address these challenges, we propose X-Oscar, a progressive framework for generating high-quality animatable avatars from text prompts. It follows a sequential Geometry->Texture->Animation paradigm, simplifying optimization through step-by-step generation. To tackle oversaturation, we introduce Adaptive Variational Parameter (AVP), representing avatars as an adaptive distribution during training. Additionally, we present Avatar-aware Score Distillation Sampling (ASDS), a novel technique that incorporates avatar-aware noise into rendered images for improved generation quality during optimization. Extensive evaluations confirm the superiority of X-Oscar over existing text-to-3D and text-to-avatar approaches. Our anonymous project page: https://xmu-xiaoma666.github.io/Projects/X-Oscar/., Comment: ICML2024
Published: 2024

12. Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

Author: Liu, Sihan, Ma, Yiwei, Zhang, Xiaoqing, Wang, Haowei, Ji, Jiayi, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Referring Remote Sensing Image Segmentation (RRSIS) is a new challenge that combines computer vision and natural language processing, delineating specific regions in aerial images as described by textual queries. Traditional Referring Image Segmentation (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery, leading to suboptimal segmentation results. To address these challenges, we introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS. RMSIN incorporates an Intra-scale Interaction Module (IIM) to effectively address the fine-grained detail required at multiple scales and a Cross-scale Interaction Module (CIM) for integrating these details coherently across the network. Furthermore, RMSIN employs an Adaptive Rotated Convolution (ARC) to account for the diverse orientations of objects, a novel contribution that significantly enhances segmentation accuracy. To assess the efficacy of RMSIN, we have curated an expansive dataset comprising 17,402 image-caption-mask triplets, which is unparalleled in terms of scale and variety. This dataset not only presents the model with a wide range of spatial and rotational scenarios but also establishes a stringent benchmark for the RRSIS task, ensuring a rigorous evaluation of performance. Our experimental evaluations demonstrate the exceptional performance of RMSIN, surpassing existing state-of-the-art models by a significant margin. All datasets and code are made available at https://github.com/Lsan2401/RMSIN., Comment: Accepted by CVPR 2024
Published: 2023

13. Multi-branch Collaborative Learning Network for 3D Visual Grounding

Author: Qian, Zhipeng, Ma, Yiwei, Lin, Zhekai, Ji, Jiayi, Zheng, Xiawu, Sun, Xiaoshuai, Ji, Rongrong, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

14. X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation

Author: Ma, Yiwei, Fan, Yijun, Ji, Jiayi, Wang, Haowei, Sun, Xiaoshuai, Jiang, Guannan, Shu, Annan, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent times, automatic text-to-3D content creation has made significant progress, driven by the development of pretrained 2D diffusion models. Existing text-to-3D methods typically optimize the 3D representation to ensure that the rendered image aligns well with the given text, as evaluated by the pretrained 2D diffusion model. Nevertheless, a substantial domain gap exists between 2D images and 3D assets, primarily attributed to variations in camera-related attributes and the exclusive presence of foreground objects. Consequently, employing 2D diffusion models directly for optimizing 3D representations may lead to suboptimal outcomes. To address this issue, we present X-Dreamer, a novel approach for high-quality text-to-3D content creation that effectively bridges the gap between text-to-2D and text-to-3D synthesis. The key components of X-Dreamer are two innovative designs: Camera-Guided Low-Rank Adaptation (CG-LoRA) and Attention-Mask Alignment (AMA) Loss. CG-LoRA dynamically incorporates camera information into the pretrained diffusion models by employing camera-dependent generation for trainable parameters. This integration enhances the alignment between the generated 3D assets and the camera's perspective. AMA loss guides the attention map of the pretrained diffusion model using the binary mask of the 3D object, prioritizing the creation of the foreground object. This module ensures that the model focuses on generating accurate and detailed foreground objects. Extensive evaluations demonstrate the effectiveness of our proposed method compared to existing text-to-3D approaches. Our project webpage: https://xmu-xiaoma666.github.io/Projects/X-Dreamer/ ., Comment: ToMM24
Published: 2023

15. Semi-Supervised Panoptic Narrative Grounding

Author: Yang, Danni, Ji, Jiayi, Sun, Xiaoshuai, Wang, Haowei, Li, Yinan, Ma, Yiwei, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite considerable progress, the advancement of Panoptic Narrative Grounding (PNG) remains hindered by costly annotations. In this paper, we introduce a novel Semi-Supervised Panoptic Narrative Grounding (SS-PNG) learning scheme, capitalizing on a smaller set of labeled image-text pairs and a larger set of unlabeled pairs to achieve competitive performance. Unlike visual segmentation tasks, PNG involves one pixel belonging to multiple open-ended nouns. As a result, existing multi-class based semi-supervised segmentation frameworks cannot be directly applied to this task. To address this challenge, we first develop a novel SS-PNG Network (SS-PNG-NW) tailored to the SS-PNG setting. We thoroughly investigate strategies such as Burn-In and data augmentation to determine the optimal generic configuration for the SS-PNG-NW. Additionally, to tackle the issue of imbalanced pseudo-label quality, we propose a Quality-Based Loss Adjustment (QLA) approach to adjust the semi-supervised objective, resulting in an enhanced SS-PNG-NW+. Employing our proposed QLA, we improve BCE Loss and Dice loss at pixel and mask levels, respectively. We conduct extensive experiments on PNG datasets, with our SS-PNG-NW+ demonstrating promising results comparable to fully-supervised models across all data ratios. Remarkably, our SS-PNG-NW+ outperforms fully-supervised models with only 30% and 50% supervision data, exceeding their performance by 0.8% and 1.1% respectively. This highlights the effectiveness of our proposed SS-PNG-NW+ in overcoming the challenges posed by limited annotations and enhancing the applicability of PNG tasks. The source code is available at https://github.com/nini0919/SSPNG., Comment: ACM MM 2023
Published: 2023
Full Text: View/download PDF

16. JM3D & JM3D-LLM: Elevating 3D Understanding with Joint Multi-modal Cues

Author: Ji, Jiayi, Wang, Haowei, Wu, Changli, Ma, Yiwei, Sun, Xiaoshuai, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The rising importance of 3D understanding, pivotal in computer vision, autonomous driving, and robotics, is evident. However, a prevailing trend, which straightforwardly resorted to transferring 2D alignment strategies to the 3D domain, encounters three distinct challenges: (1) Information Degradation: This arises from the alignment of 3D data with mere single-view 2D images and generic texts, neglecting the need for multi-view images and detailed subcategory texts. (2) Insufficient Synergy: These strategies align 3D representations to image and text features individually, hampering the overall optimization for 3D models. (3) Underutilization: The fine-grained information inherent in the learned representations is often not fully exploited, indicating a potential loss in detail. To address these issues, we introduce JM3D, a comprehensive approach integrating point cloud, text, and image. Key contributions include the Structured Multimodal Organizer (SMO), enriching vision-language representation with multiple views and hierarchical text, and the Joint Multi-modal Alignment (JMA), combining language understanding with visual representation. Our advanced model, JM3D-LLM, marries 3D representation with large language models via efficient fine-tuning. Evaluations on ModelNet40 and ScanObjectNN establish JM3D's superiority. The superior performance of JM3D-LLM further underscores the effectiveness of our representation transfer approach. Our code and models are available at https://github.com/Mr-Neko/JM3D., Comment: 16 pages, 4 figures, 10 tables, 3D understanding
Published: 2023

17. Photovoltaic MPPT Algorithm Based on Hybrid Boost Converter and Variable Step Size Incremental Conductance

Author: Ma Yiwei, Wang Fuxing, Huang Zongsheng, Feng Qin, and Piao Changhao
Subjects: Environmental sciences, GE1-350
Abstract: Aiming at the problem of low voltage gain of traditional boost converter and the incompatibility of tracking speed and tracking accuracy with the traditional incremental conductance algorithm (INC), this paper uses the hybrid boost converter as the DC/DC converter of photovoltaic system, and designs the variable step size INC algorithm control strategy to achieve Maximum power point tracking (MPPT) of photovoltaic. Simulink simulation model verifies the feasibility of the proposed algorithm, which effectively improves the output voltage and power generation efficiency of the photovoltaic system.
Published: 2021
Full Text: View/download PDF

18. 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation

Author: Wu, Changli, Ma, Yiwei, Chen, Qi, Wang, Haowei, Luo, Gen, Ji, Jiayi, and Sun, Xiaoshuai
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In 3D Referring Expression Segmentation (3D-RES), the earlier approach adopts a two-stage paradigm, extracting segmentation proposals and then matching them with referring expressions. However, this conventional paradigm encounters significant challenges, most notably in terms of the generation of lackluster initial proposals and a pronounced deceleration in inference speed. Recognizing these limitations, we introduce an innovative end-to-end Superpoint-Text Matching Network (3D-STMN) that is enriched by dependency-driven insights. One of the keystones of our model is the Superpoint-Text Matching (STM) mechanism. Unlike traditional methods that navigate through instance proposals, STM directly correlates linguistic indications with their respective superpoints, clusters of semantically related points. This architectural decision empowers our model to efficiently harness cross-modal semantic relationships, primarily leveraging densely annotated superpoint-text pairs, as opposed to the more sparse instance-text pairs. In pursuit of enhancing the role of text in guiding the segmentation process, we further incorporate the Dependency-Driven Interaction (DDI) module to deepen the network's semantic comprehension of referring expressions. Using the dependency trees as a beacon, this module discerns the intricate relationships between primary terms and their associated descriptors in expressions, thereby elevating both the localization and segmentation capacities of our model. Comprehensive experiments on the ScanRefer benchmark reveal that our model not only set new performance standards, registering an mIoU gain of 11.7 points but also achieve a staggering enhancement in inference speed, surpassing traditional methods by 95.7 times. The code and models are available at https://github.com/sosppxo/3D-STMN.
Published: 2023

19. Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation

Author: Wang, Haowei, Tang, Jiji, Ji, Jiayi, Sun, Xiaoshuai, Zhang, Rongsheng, Ma, Yiwei, Zhao, Minda, Li, Lincheng, zhao, zeng, Lv, Tangjie, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent years, 3D understanding has turned to 2D vision-language pre-trained models to overcome data scarcity challenges. However, existing methods simply transfer 2D alignment strategies, aligning 3D representations with single-view 2D images and coarse-grained parent category text. These approaches introduce information degradation and insufficient synergy issues, leading to performance loss. Information degradation arises from overlooking the fact that a 3D representation should be equivalent to a series of multi-view images and more fine-grained subcategory text. Insufficient synergy neglects the idea that a robust 3D representation should align with the joint vision-language space, rather than independently aligning with each modality. In this paper, we propose a multi-view joint modality modeling approach, termed JM3D, to obtain a unified representation for point cloud, text, and image. Specifically, a novel Structured Multimodal Organizer (SMO) is proposed to address the information degradation issue, which introduces contiguous multi-view images and hierarchical text to enrich the representation of vision and language modalities. A Joint Multi-modal Alignment (JMA) is designed to tackle the insufficient synergy problem, which models the joint modality by incorporating language knowledge into the visual modality. Extensive experiments on ModelNet40 and ScanObjectNN demonstrate the effectiveness of our proposed method, JM3D, which achieves state-of-the-art performance in zero-shot 3D classification. JM3D outperforms ULIP by approximately 4.3% on PointMLP and achieves an improvement of up to 6.5% accuracy on PointNet++ in top-1 accuracy for zero-shot 3D classification on ModelNet40. The source code and trained models for all our experiments are publicly available at https://github.com/Mr-Neko/JM3D., Comment: ACM MM 2023, 3D Understanding, JM3D
Published: 2023
Full Text: View/download PDF

20. X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance

Author: Ma, Yiwei, Zhang, Xiaioqing, Sun, Xiaoshuai, Ji, Jiayi, Wang, Haowei, Jiang, Guannan, Zhuang, Weilin, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-driven 3D stylization is a complex and crucial task in the fields of computer vision (CV) and computer graphics (CG), aimed at transforming a bare mesh to fit a target text. Prior methods adopt text-independent multilayer perceptrons (MLPs) to predict the attributes of the target mesh with the supervision of CLIP loss. However, such text-independent architecture lacks textual guidance during predicting attributes, thus leading to unsatisfactory stylization and slow convergence. To address these limitations, we present X-Mesh, an innovative text-driven 3D stylization framework that incorporates a novel Text-guided Dynamic Attention Module (TDAM). The TDAM dynamically integrates the guidance of the target text by utilizing text-relevant spatial and channel-wise attentions during vertex feature extraction, resulting in more accurate attribute prediction and faster convergence speed. Furthermore, existing works lack standard benchmarks and automated metrics for evaluation, often relying on subjective and non-reproducible user studies to assess the quality of stylized 3D assets. To overcome this limitation, we introduce a new standard text-mesh benchmark, namely MIT-30, and two automated metrics, which will enable future research to achieve fair and objective comparisons. Our extensive qualitative and quantitative experiments demonstrate that X-Mesh outperforms previous state-of-the-art methods., Comment: 12 pages, 7 figures, ICCV2023
Published: 2023

21. Optimal dispatch of hybrid energy islanded microgrid considering V2G under TOU tariffs

Author: Ma Yiwei, Chen Yuyang, Chen Xin, Deng Fuchun, and Song Xiantong
Subjects: Environmental sciences, GE1-350
Abstract: In order to achieve a prospective economic effect of renewable energy generations and vehicle to grid (V2G), this paper proposes an optimal dispatch method of wind-PV-battery microgrid considering V2G under time-of-use (TOU) tariffs for those isolated communities in remote islands and mountainous areas. A cooperative dispatch strategy and an optimal dispatch model are both presented for the total operation cost minimization and higher utilization of renewable energy generation. Finally, the simulation results show that the proposed method is effective and feasible.
Published: 2019
Full Text: View/download PDF

22. Towards Local Visual Modeling for Image Captioning

Author: Ma, Yiwei, Ji, Jiayi, Sun, Xiaoshuai, Zhou, Yiyi, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: In this paper, we study the local visual modeling with grid features for image captioning, which is critical for generating accurate and detailed captions. To achieve this target, we propose a Locality-Sensitive Transformer Network (LSTNet) with two novel designs, namely Locality-Sensitive Attention (LSA) and Locality-Sensitive Fusion (LSF). LSA is deployed for the intra-layer interaction in Transformer via modeling the relationship between each grid and its neighbors. It reduces the difficulty of local object recognition during captioning. LSF is used for inter-layer information fusion, which aggregates the information of different encoder layers for cross-layer semantical complementarity. With these two novel designs, the proposed LSTNet can model the local visual information of grid features to improve the captioning quality. To validate LSTNet, we conduct extensive experiments on the competitive MS-COCO benchmark. The experimental results show that LSTNet is not only capable of local visual modeling, but also outperforms a bunch of state-of-the-art captioning models on offline and online testings, i.e., 134.8 CIDEr and 136.3 CIDEr, respectively. Besides, the generalization of LSTNet is also verified on the Flickr8k and Flickr30k datasets, Comment: Preprint
Published: 2023

23. Multi-source Cooperative Scheduling Strategy for Electric Vehicles Integrated into Microgrid Under TOU

Author: Ma, Yiwei, Luo, Genhong, Huang, Botao, Chen, Changjin, Ma, Weixing, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

24. Fuzzy Energy Management Strategy for Battery Electric Vehicles Considering Driving Style Recognition

Author: Ma, Yiwei, Huang, Botao, Piao, Changhao, Luo, Genhong, Ma, Weixing, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

25. Continuous Production of High-Concentration Nitrated Water with Catalytic Concentrated High-Intensity Electric Field Process at Ambient Conditions

Author: Lv, Yuancai, Chen, Ling, Zhou, Nan, Dai, Leilei, Cheng, Yanling, Ma, Yiwei, Liu, Juer, Cobb, Kirk, Chen, Paul, and Ruan, Roger
Published: 2024
Full Text: View/download PDF

26. Structural optimization design of machine tools based on parallel artificial neural networks and genetic algorithms

Author: Ma, Yiwei, Tian, Yanling, and Liu, Xianping
Published: 2023
Full Text: View/download PDF

27. X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval

Author: Ma, Yiwei, Xu, Guohai, Sun, Xiaoshuai, Yan, Ming, Zhang, Ji, and Ji, Rongrong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Video-text retrieval has been a crucial and fundamental task in multi-modal research. The development of video-text retrieval has been considerably promoted by large-scale multi-modal contrastive pre-training, which primarily focuses on coarse-grained or fine-grained contrast. However, cross-grained contrast, which is the contrast between coarse-grained representations and fine-grained representations, has rarely been explored in prior research. Compared with fine-grained or coarse-grained contrasts, cross-grained contrast calculate the correlation between coarse-grained features and each fine-grained feature, and is able to filter out the unnecessary fine-grained features guided by the coarse-grained feature during similarity calculation, thus improving the accuracy of retrieval. To this end, this paper presents a novel multi-grained contrastive model, namely X-CLIP, for video-text retrieval. However, another challenge lies in the similarity aggregation problem, which aims to aggregate fine-grained and cross-grained similarity matrices to instance-level similarity. To address this challenge, we propose the Attention Over Similarity Matrix (AOSM) module to make the model focus on the contrast between essential frames and words, thus lowering the impact of unnecessary frames and words on retrieval results. With multi-grained contrast and the proposed AOSM module, X-CLIP achieves outstanding performance on five widely-used video-text retrieval datasets, including MSR-VTT (49.3 R@1), MSVD (50.4 R@1), LSMDC (26.1 R@1), DiDeMo (47.8 R@1) and ActivityNet (46.2 R@1). It outperforms the previous state-of-theart by +6.3%, +6.6%, +11.1%, +6.7%, +3.8% relative improvements on these benchmarks, demonstrating the superiority of multi-grained contrast and AOSM., Comment: 13 pages, 6 figures, ACMMM22
Published: 2022

28. Optimization of liquid film parameters of indirect evaporative cooling system based on machine vision

Author: You, Yuwen, Chen, Yan, Yang, Bin, Guo, Chunmei, Gao, Rong, and Ma, Yiwei
Published: 2024
Full Text: View/download PDF

29. All-Fiber Fabry-Perot Micro-cavity temperature-insensitive sensor for strain sensing based on T-shaped structure

Author: Su, Chunbo, Gao, Jun, You, Yuqi, Feng, Yong, Ma, Yiwei, and Geng, Tao
Published: 2024
Full Text: View/download PDF

30. Chemical composition, pharmacological effects, and parasitic mechanisms of Cistanche deserticola: An update

Author: Zhang, Shengai, Ma, Yiwei, Chen, Jia, Yu, Mingli, Zhao, Qinghua, Jing, Bo, Yang, Na, Ma, Xinyu, and Wang, Yuyan
Published: 2024
Full Text: View/download PDF

31. A temperature sensor based on multi-beam capture and interference

Author: Tian, Tian, Han, Jinyang, Liang, Ku, Li, Song, Ma, Yiwei, Geng, Tao, and Yuan, Libo
Published: 2024
Full Text: View/download PDF

32. A Fabry-Perot interferometer based on probe-embedded bubble for ultrasensitive strain measurement

Author: Tian, Tian, Liang, Ku, Ma, Yiwei, and Geng, Tao
Published: 2025
Full Text: View/download PDF

33. A SNS fiber structure with spot-coated thermoplastic microspheres for the simultaneous measurement of refractive index and temperature

Author: Tian, Tian, Ma, Yiwei, Li, Yuanyuan, Li, Min, Mu, Zonghao, and Geng, Tao
Published: 2024
Full Text: View/download PDF

34. Compact refractive index sensor based on offset splicing long-period fiber grating

Author: Tian, Tian, Li, Yuanyuan, Han, Jinyang, Ma, Yiwei, Li, Song, Sun, Weimin, and Geng, Tao
Published: 2024
Full Text: View/download PDF

35. Semi-analytical Stiffness Model of Bolted Joints in Machine Tools Considering the Coupling Effect

Author: Ma, Yiwei, Fu, Yutao, Tian, Yanling, and Liu, Xianping
Published: 2023
Full Text: View/download PDF

36. Dynamic modeling and analysis of the 3-PRS power head based on the screw theory and rigid multipoint constraints

Author: Ma, YiWei, Tian, YanLing, Liu, XianPing, and Lu, ChengHao
Published: 2023
Full Text: View/download PDF

37. Highly sensitive magnetostrictive sensor with well-sealed and sensitivity tunability

Author: Su, Chunbo, Liu, Xuanting, You, Yuqi, Ma, Yiwei, and Geng, Tao
Published: 2024
Full Text: View/download PDF

38. Research and Analysis of Learning Factors Based on the Foundation of Computer Big Data

Author: Ma, Yiwei, Striełkowski, Wadim, Editor-in-Chief, Peng, Chew Fong, editor, Asmawi, Adelina, editor, and Zhao, Chuanjun, editor
Published: 2023
Full Text: View/download PDF

39. Observation Noise Covariance Matrix Initialization-Based Objective State Estimation for Kalman Filter Using SVR

Author: Chen, Junhu, Liu, Mingjie, Ren, Fan, Yuan, Peng, Chen, Jianbin, Ma, Yiwei, Liu, Tai, Piao, Changhao, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Park, Ji Su, editor, Yang, Laurence T., editor, Pan, Yi, editor, and Park, Jong Hyuk, editor
Published: 2023
Full Text: View/download PDF

40. A New Control Strategy of Grid Connected Inverter Based on SOGI in Asymmetric Fault

Author: Ma, Yiwei, Feng, Qin, Huang, Botao, Wang, Peng, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Yang, Qingxin, editor, Li, Jian, editor, Xie, Kaigui, editor, and Hu, Jianlin, editor
Published: 2023
Full Text: View/download PDF

41. A highly sensitive torsion sensor based on upright-inverted alternating triangular prism structure on the single-multi-single mode fiber

Author: Zhu, Qianfei, Su, Chunbo, Ma, Yiwei, Mu, Zonghao, Zhu, Yongtian, Sun, Weimin, and Geng, Tao
Published: 2024
Full Text: View/download PDF

42. A highly sensitive strain fiber sensor based on waved core structure

Author: Dai, Zizhao, Mu, Zonghao, Su, Chunbo, Li, Yuanyuan, Ma, Yiwei, Geng, Tao, and Song, Li
Published: 2024
Full Text: View/download PDF

43. Rapid Predictions for Lower-Order Dynamics of Machine Tools Based on the Rigid Multipoint Constraints

Author: Ma, Yiwei, Tian, Yanling, and Liu, Xianping
Published: 2023
Full Text: View/download PDF

44. Identification and Perceptual Interaction of Characteristic Aroma Compounds in Black Tea

Author: NIU Yunwei, MA Yiwei, XIAO Zuobing, HONG Liu, ZHAO Wei, CAI Haocheng
Subjects: black tea, headspace solid-phase microextraction, solvent-assisted evaporative extraction, gas chromatography olfactometry, aroma perceptual interaction, Food processing and manufacture, TP368-456
Abstract: The characteristic aroma volatile compounds of black tea were identified, the aroma profile of black tea was dissected, and the perceptual interaction of important characteristic aroma compounds was explored. On this basis, the modified vector model and Steven’s law were applied to this system. Headspace solid-phase microextraction (HS-SPME) and solvent-assisted evaporative extraction (SAFE) combined with gas chromatography-olfactometry (GC-O) were used to evaluate aroma compounds. A total of 51 characteristic aroma compounds were obtained. The major aroma substances identified were alcohols and aldehydes. The results of aroma extract dilution analysis (AEDA) and aroma intensity (OI) record showed that linalool, phenethyl alcohol, (Z)-3-hexenol, geraniol and methyl salicylate significantly contributed to the aroma of black tea. The partial least squares method was used to analyze the correlation between compounds and aroma characteristics, and the electronic nose sensors were matched with the aroma profile using the gray correlation method for comprehensive understanding of the aroma of black tea. Feller’s additive model was used to analyze the perceptual interaction of aroma compounds. Among the 10 groups of binary compounds, eight showed a masking effect, and the remaining two showed a synergistic effect. In conclusion, the above experiments can provide a reference for further study of important flavor substances in black tea.
Published: 2023
Full Text: View/download PDF

45. Recent progress in embedded LPFGs

Author: Geng, Tao, Su, Chunbo, Zhang, Shuo, and Ma, Yiwei
Published: 2023
Full Text: View/download PDF

46. Effects of olive oil on hepatic steatosis and liver enzymes: A systematic review

Author: Ma, Yiwei, Ding, Xinyue, Gu, Jie, Zhou, Shengmin, and Jiang, Yuanrong
Published: 2023
Full Text: View/download PDF

47. Solidification microstructure characteristics and their formation mechanism of K447A nickel-based superalloy for dual-performance blisk

Author: Pan, Chonglin, Yao, Zhihao, Ma, Yiwei, Li, Dayu, Yao, Kaijun, Chen, Yang, and Dong, Jianxin
Published: 2023
Full Text: View/download PDF

48. A Mach–Zehnder interferometer based on peanut structure for temperature and refractive index measurement

Author: Yu, Xuelian, Zuo, Shanshan, Zhang, Yue, Ma, Yiwei, Wang, Ruoning, Yang, Wenlei, Tian, Ke, Geng, Tao, and Wang, Pengfei
Published: 2022
Full Text: View/download PDF

49. Core-modulated long-period fiber gratings formed by heating and stretching

Author: Ma, Yiwei, Tian, Tian, Su, Chunbo, Dai, Zizhao, li, Yuanyuan, and Geng, Tao
Published: 2023
Full Text: View/download PDF

50. A Mach-Zehnder interferometer with two V-shaped cores for refractive index sensing

Author: Ma, Yiwei, Tian, Tian, Tan, Haoyang, Geng, Tao, Jin, Xiren, Sun, Weimin, and Yuan, Libo
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

626 results on '"Ma Yiwei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources