Author: "Wanling Gao" / Journal: benchcouncil transactions on benchmarks, standards & evaluations - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wanling Gao"' showing total 9 results

Start Over Author "Wanling Gao" Journal benchcouncil transactions on benchmarks, standards & evaluations

9 results on '"Wanling Gao"'

1. Evaluatology: The science and engineering of evaluation.

Author: Jianfeng Zhan, Lei Wang, Wanling Gao, Hongxiao Li, Chenxi Wang, Yunyou Huang, Yatao Li, Zhengxin Yang, Guoxin Kang, Chunjie Luo, Hainan Ye, Shaopeng Dai, and Zhifei Zhang
Subjects: SCIENCE education, ENGINEERING education, EVALUATION, METHODOLOGY, MATHEMATICAL models
Abstract: Evaluation is a crucial aspect of human existence and plays a vital role in each field. However, it is often approached in an empirical and ad-hoc manner, lacking consensus on universal concepts, terminologies, theories, and methodologies. This lack of agreement has significant consequences. This article aims to formally introduce the discipline of evaluatology, which encompasses the science and engineering of evaluation. We propose a universal framework for evaluation, encompassing concepts, terminologies, theories, and methodologies that can be applied across various disciplines, if not all disciplines. Our research reveals that the essence of evaluation lies in conducting experiments that intentionally apply a well-defined evaluation condition to individuals or systems under scrutiny, which we refer to as the subjects. This process allows for the creation of an evaluation system or model. By measuring and/or testing this evaluation system or model, we can infer the impact of different subjects. Derived from the essence of evaluation, we propose five axioms focusing on key aspects of evaluation outcomes as the foundational evaluation theory. These axioms serve as the bedrock upon which we build universal evaluation theories and methodologies. When evaluating a single subject, it is crucial to create evaluation conditions with different levels of equivalency. By applying these conditions to diverse subjects, we can establish reference evaluation models. These models allow us to alter a single independent variable at a time while keeping all other variables as controls. When evaluating complex scenarios, the key lies in establishing a series of evaluation models that maintain transitivity. Building upon the science of evaluation, we propose a formal definition of a benchmark as a simplified and sampled evaluation condition that guarantees different levels of equivalency. This concept serves as the cornerstone for a universal benchmark-based engineering approach to evaluation across various disciplines, which we refer to as benchmarkology. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

2. AIGCBench: Comprehensive evaluation of image-to-video content generated by AI.

Author: Fanda Fan, Chunjie Luo, Wanling Gao, and Jianfeng Zhan
Subjects: ARTIFICIAL intelligence, DIGITAL video, ALGORITHMS, IMAGE quality analysis, PERFORMANCE evaluation
Abstract: The burgeoning field of Artificial Intelligence Generated Content (AIGC) is witnessing rapid advancements, particularly in video generation. This paper introduces AIGCBench, a pioneering comprehensive and scalable benchmark designed to evaluate a variety of video generation tasks, with a primary focus on Image-to-Video (I2V) generation. AIGCBench tackles the limitations of existing benchmarks, which suffer from a lack of diverse datasets, by including a varied and open-domain image-text dataset that evaluates different state-of-the-art algorithms under equivalent conditions. We employ a novel text combiner and GPT-4 to create rich text prompts, which are then used to generate images via advanced Text-to-Image models. To establish a unified evaluation framework for video generation tasks, our benchmark includes 11 metrics spanning four dimensions to assess algorithm performance. These dimensions are control-video alignment, motion effects, temporal consistency, and video quality. These metrics are both reference video-based and video-free, ensuring a comprehensive evaluation strategy. The evaluation standard proposed correlates well with human judgment, providing insights into the strengths and weaknesses of current I2V algorithms. The findings from our extensive experiments aim to stimulate further research and development in the I2V field. AIGCBench represents a significant step toward creating standardized benchmarks for the broader AIGC landscape, proposing an adaptable and equitable framework for future assessments of video generation tasks. We have open-sourced the dataset and evaluation code on the project website: https://www.benchcouncil.org/AIGCBench. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

3. Algorithmic fairness in social context.

Author: Yunyou Huang, Wenjing Liu, Wanling Gao, Xiangjiang Lu, Xiaoshuang Liang, Zhengxin Yang, Hongxiao Li, Li Ma, and Suqin Tang
Subjects: FAIRNESS, SOCIAL context, ALGORITHMS, INFRASTRUCTURE (Economics), PUBLIC transit
Abstract: Algorithmic fairness research is currently receiving significant attention, aiming to ensure that algorithms do not discriminate between different groups or individuals with similar characteristics. However, with the popularization of algorithms in all aspects of society, algorithms have changed from mere instruments to social infrastructure. For instance, facial recognition algorithms are widely used to provide user verification services and have become an indispensable part of many social infrastructures like transportation, health care, etc. As an instrument, an algorithm needs to pay attention to the fairness of its behavior. However, as a social infrastructure, it needs to pay even more attention to its impact on social fairness. Otherwise, it may exacerbate existing inequities or create new ones. For example, if an algorithm treats all passengers equally and eliminates special seats for pregnant women in the interest of fairness, it will increase the risk of pregnant women taking public transport and indirectly damage their right to fair travel. Therefore, algorithms have the responsibility to ensure social fairness, not just within their operations. It is now time to expand the concept of algorithmic fairness beyond mere behavioral equity, assessing algorithms in a broader societal context, and examining whether they uphold and promote social fairness. This article analyzes the current status and challenges of algorithmic fairness from three key perspectives: fairness definition, fairness dataset, and fairness algorithm. Furthermore, the potential directions and strategies to promote the fairness of the algorithm are proposed. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

4. SNNBench: End-to-end AI-oriented spiking neural network benchmarking.

Author: Fei Tang and Wanling Gao
Subjects: ARTIFICIAL neural networks, ARTIFICIAL intelligence, COMPUTER network architectures, INFORMATION processing, SOURCE code
Abstract: Spiking Neural Networks (SNNs) show great potential for solving Artificial Intelligence (AI) applications. At the preliminary stage of SNNs, benchmarks are essential for evaluating and optimizing SNN algorithms, software, and hardware toward AI scenarios. However, a majority of SNN benchmarks focus on evaluating SNN for brain science, which has distinct neural network architectures and targets. Even though there have several benchmarks evaluating SNN for AI, they only focus on a single stage of training and inference or a processing fragment of a whole stage without accuracy information. Thus, the existing SNN benchmarks lack an end-to-end perspective that not only covers both training and inference but also provides a whole training process to a target accuracy level. This paper presents SNNBench--the first end-to-end AI-oriented SNN benchmark covering the processing stages of training and inference and containing the accuracy information. Focusing on two typical AI applications: image classification and speech recognition, we provide nine workloads that consider the typical characteristics of SNN, i.e., the dynamics of spiking neurons, and AI, i.e., learning paradigms including supervised and unsupervised learning, learning rules like backpropagation, connection types like fully connected, and accuracy. The evaluations of SNNBench on both CPU and GPU show its effectiveness. The specifications, source code, and results will be publicly available from https://www.benchcouncil.org/SNNBench. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

5. IoTBench: A data centrical and configurable IoT benchmark suite.

Author: Simin Chen, Chunjie Luo, Wanling Gao, and Lei Wang
Subjects: INTERNET of things, MICROPROCESSORS, DATA analysis, EVALUATION, ALGORITHMS
Abstract: As the Internet of Things (IoT) industry expands, the demand for microprocessors and microcontrollers used in IoT systems has increased steadily. Benchmarks provide a valuable reference for processor evaluation. Different IoT application scenarios face different data scales, dimensions, and types. However, the current popular benchmarks only evaluate the processor's performance under fixed data formats. These benchmarks cannot adapt to the fragmented scenarios faced by processors. This paper proposes a new benchmark, namely IoTBench. The IoTBench workloads cover three types of algorithms commonly used in IoT applications: matrix processing, list operation, and convolution. Moreover, IoTBench divides the data space into different evaluation subspaces according to the data scales, data types, and data dimensions. We analyze the impact of different data types, data dimensions, and data scales on processor performance and compare ARM with RISC-V and MinorCPU with O3CPU using IoTBench. We also explored the performance of processors with different architecture configurations in different evaluation subspaces and found the optimal architecture of different evaluation subspaces. The specifications, source code, and results are publicly available from https: //www.benchcouncil.org/iotbench/. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

6. Edge AIBench 2.0: Ascalable autonomous vehicle benchmark for IoT--Edge--Cloud systems.

Author: Tianshu Hao, Wanling Gao, Chuanxin Lan, Fei Tang, Zihan Jiang, and Jianfeng Zhan
Subjects: AUTONOMOUS vehicles, INTERNET of things, BENCHMARKING (Management), WIRELESS hotspots, CYBER physical systems
Abstract: Many emerging IoT--Edge--Cloud computing systems are not yet implemented or are too confidential share the code or even tricky to replicate its execution environment, and hence their benchmarking is challenging. This paper uses autonomous vehicles as a typical scenario to build the first benchmark for Edge--Cloudsystems.We propose a set of distilling rules for replicating autonomous vehicle scenarios to extract critical tasks with intertwined interactions. The essential system-level and component-level characteristics captured while the system complexity is reduced significantly so that users can quickly evaluate and pinpoint the system and component bottlenecks. Also, we implement a scalable architecture through which users assess the systems with different sizes of workloads. We conduct several experiments to measure the performance. After testing two thousand autonomous vehicle task requests, we identify the bottleneck modules in autonomous vehicle scenarios and analyze hotspot functions. The experiment results show that the lane-keeping task is the slowest execution module, with a tail latency of 77.49 ms for the 99th percentile latency. We hope this scenario benchmark will helpful for Autonomous Vehicles and even IoT--edge--Cloud research. Now the open-source code is available from the official website https://www.benchcouncil.org/scenariobench/edgeaibench.html. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

7. HPC AI500 V3.0: A scalable HPC AI benchmarking framework.

Author: Zihan Jiang, Chunjie Luo, Wanling Gao, Lei Wang, and Jianfeng Zhan
Subjects: ARTIFICIAL intelligence, HIGH performance computing, DEEP learning, BENCHMARKING (Management), SCALABILITY
Abstract: In recent years, the convergence of High Performance Computing (HPC) and artificial intelligence (AI) makes the community desperately need a benchmark to guide the design of next-generation scalable HPC AI systems. The success of the HPL benchmarks and the affiliated TOP500 ranking indicates that scalability is the fundamental requirement to evaluate HPC systems. However, being scalable in terms of these emerging AI workloadslikedeeplearning(DL)raisesnontrivialchallenges.Thispaperformallyandsystematicallyanalyzes thefactorthatlimitsscalabilityinDLworkloadsandpresentsHPCAI500v3.0,ascalableHPCAIbenchmarking framework. The HPC AI500 V3.0 methodology is inspired by bagging, which utilizes the collective wisdom of an ensemble of base models and enables the benchmarks to be adaptively scalable to different scales of HPC systems. We implement HPC AI500 V3.0 in a highly customizable manner, maintaining the space of various optimization from both system and algorithm levels. By reusing the representative workloads in HPC AI500 V2.0, we evaluate HPC AI500 V3.0 on typical HPC systems, and the results show it has near-linear scalability. Furthermore, based on the customizable design, we present a case study to perform a trade-off between AI model quality and its training speed. The source code of HPC AI500 V3.0 is publicly available from the HPC AI500 project homepage https://www.benchcouncil.org/aibench/hpcai500/. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

8. High fusion computers: The IoTs, edges, data centers, and humans-in-the-loop as a computer.

Author: Wanling Gao, Lei Wang, Mingyu Chen, Jin Xiong, Chunjie Luo, Wenli Zhang, Yunyou Huang, Weiping Li, Guoxin Kang, Chen Zheng, Biwei Xie, Shaopeng Dai, Qian He, Hainan Ye, Yungang Bao, and Jianfeng Zhan
Subjects: COMPUTERS, DATA libraries, INTERNET of things, ACCURACY, SOURCE code
Abstract: Emerging and future applications rely heavily upon systems consisting of Internet of Things (IoT), edges, data centers, and humans-in-the-loop. Significantly different from warehouse-scale computers that serve independent concurrent user requests, this new class of computer systems directly interacts with the physical world, considering humans an essential part and performing safety-critical and mission-critical operations; their computations have intertwined dependencies between not only adjacent execution loops but also actions or decisions triggered by IoTs, edge, datacenters, or humans-in-the-loop; the systems must first satisfy the accuracy metric in predicting, interpreting, or taking action before meeting the performance goal under different cases. This article argues we need a paradigm shift to reconstruct the IoTs, edges, data centers, and humans-in-the-loop as a computer rather than a distributed system. We coin a new term, high fusion computers (HFCs), to describe this class of systems. The fusion in the term has two implications: fusing IoTs, edges, data centers, and humans-in-the-loop as a computer, fusing the physical and digital worlds through HFC systems. HFC is a pivotal case of the open-source computer systems initiative. We laid out the challenges, plan, and call for uniting our community's wisdom and actions to address the HFC challenges. Everything, including the source code, will be publicly available from the project homepage: https://www.computercouncil.org/HFC/. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

9. 2022 BenchCouncil International Symposium on benchmarking, measuring and optimizing (Bench 2022) call for papers.

Author: Chunjie Luo and Wanling Gao
Subjects: BENCHMARKING (Management), DATA management, HARDWARE, COMPUTER software, DATA
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

9 results on '"Wanling Gao"'

1. Evaluatology: The science and engineering of evaluation.

2. AIGCBench: Comprehensive evaluation of image-to-video content generated by AI.

3. Algorithmic fairness in social context.

4. SNNBench: End-to-end AI-oriented spiking neural network benchmarking.

5. IoTBench: A data centrical and configurable IoT benchmark suite.

6. Edge AIBench 2.0: Ascalable autonomous vehicle benchmark for IoT--Edge--Cloud systems.

7. HPC AI500 V3.0: A scalable HPC AI benchmarking framework.

8. High fusion computers: The IoTs, edges, data centers, and humans-in-the-loop as a computer.

9. 2022 BenchCouncil International Symposium on benchmarking, measuring and optimizing (Bench 2022) call for papers.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

9 results on '"Wanling Gao"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources