32,659 results on '"Stoica, A."'
Search Results
52. Barriers to the Development of Integrated Risk Management Within Health Insurance Entities in Central and Eastern Europe
- Author
-
Stoica, Bogdan-Stefan, Ciocoiu, Carmen Nadia, Nicolescu, Ovidiu, editor, Oprean, Constantin, editor, Titu, Aurel Mihail, editor, and Vaduva, Sebastian, editor
- Published
- 2025
- Full Text
- View/download PDF
53. From Fault Likelihood to Fault Networks: Stochastic Seismic Interpretation Through a Marked Point Process with Interactions
- Author
-
Taty Moukati, Fabrice, Stoica, Radu Stefan, Bonneau, François, Wu, Xinming, and Caumon, Guillaume
- Published
- 2024
- Full Text
- View/download PDF
54. Fairness in Serving Large Language Models
- Author
-
Sheng, Ying, Cao, Shiyi, Li, Dacheng, Zhu, Banghua, Li, Zhuohan, Zhuo, Danyang, Gonzalez, Joseph E., and Stoica, Ion
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Performance - Abstract
High-demand LLM inference services (e.g., ChatGPT and BARD) support a wide range of requests from short chat conversations to long document reading. To ensure that all client requests are processed fairly, most major LLM inference services have request rate limits, to ensure that no client can dominate the request queue. However, this rudimentary notion of fairness also results in under-utilization of the resources and poor client experience when there is spare capacity. While there is a rich literature on fair scheduling, serving LLMs presents new challenges due to their unpredictable request lengths and their unique batching characteristics on parallel accelerators. This paper introduces the definition of LLM serving fairness based on a cost function that accounts for the number of input and output tokens processed. To achieve fairness in serving, we propose a novel scheduling algorithm, the Virtual Token Counter (VTC), a fair scheduler based on the continuous batching mechanism. We prove a 2x tight upper bound on the service difference between two backlogged clients, adhering to the requirement of work-conserving. Through extensive experiments, we demonstrate the superior performance of VTC in ensuring fairness, especially in contrast to other baseline methods, which exhibit shortcomings under various conditions. The reproducible code is available at https://github.com/Ying1123/VTC-artifact
- Published
- 2023
55. SuperServe: Fine-Grained Inference Serving for Unpredictable Workloads
- Author
-
Khare, Alind, Garg, Dhruv, Kalra, Sukrit, Grandhi, Snigdha, Stoica, Ion, and Tumanov, Alexey
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Machine Learning - Abstract
The increasing deployment of ML models on the critical path of production applications in both datacenter and the edge requires ML inference serving systems to serve these models under unpredictable and bursty request arrival rates. Serving models under such conditions requires these systems to strike a careful balance between the latency and accuracy requirements of the application and the overall efficiency of utilization of scarce resources. State-of-the-art systems resolve this tension by either choosing a static point in the latency-accuracy tradeoff space to serve all requests or load specific models on the critical path of request serving. In this work, we instead resolve this tension by simultaneously serving the entire-range of models spanning the latency-accuracy tradeoff space. Our novel mechanism, SubNetAct, achieves this by carefully inserting specialized operators in weight-shared SuperNetworks. These operators enable SubNetAct to dynamically route requests through the network to meet a latency and accuracy target. SubNetAct requires upto 2.6x lower memory to serve a vastly-higher number of models than prior state-of-the-art. In addition, SubNetAct's near-instantaneous actuation of models unlocks the design space of fine-grained, reactive scheduling policies. We explore the design of one such extremely effective policy, SlackFit and instantiate both SubNetAct and SlackFit in a real system, SuperServe. SuperServe achieves 4.67% higher accuracy for the same SLO attainment and 2.85x higher SLO attainment for the same accuracy on a trace derived from the real-world Microsoft Azure Functions workload and yields the best trade-offs on a wide range of extremely-bursty synthetic traces automatically.
- Published
- 2023
56. CodeScholar: Growing Idiomatic Code Examples
- Author
-
Shetty, Manish, Sen, Koushik, and Stoica, Ion
- Subjects
Computer Science - Software Engineering ,Computer Science - Machine Learning ,Computer Science - Programming Languages - Abstract
Programmers often search for usage examples for API methods. A tool that could generate realistic, idiomatic, and contextual usage examples for one or more APIs would be immensely beneficial to developers. Such a tool would relieve the need for a deep understanding of the API landscape, augment existing documentation, and help discover interactions among APIs. We present CodeScholar, a tool that generates idiomatic code examples demonstrating the common usage of API methods. It includes a novel neural-guided search technique over graphs that grows the query APIs into idiomatic code examples. Our user study demonstrates that in 70% of cases, developers prefer CodeScholar generated examples over state-of-the-art large language models (LLM) like GPT3.5. We quantitatively evaluate 60 single and 25 multi-API queries from 6 popular Python libraries and show that across-the-board CodeScholar generates more realistic, diverse, and concise examples. In addition, we show that CodeScholar not only helps developers but also LLM-powered programming assistants generate correct code in a program synthesis setting.
- Published
- 2023
57. SGLang: Efficient Execution of Structured Language Model Programs
- Author
-
Zheng, Lianmin, Yin, Liangsheng, Xie, Zhiqiang, Sun, Chuyue, Huang, Jeff, Yu, Cody Hao, Cao, Shiyi, Kozyrakis, Christos, Stoica, Ion, Gonzalez, Joseph E., Barrett, Clark, and Sheng, Ying
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Programming Languages - Abstract
Large language models (LLMs) are increasingly used for complex tasks that require multiple generation calls, advanced prompting techniques, control flow, and structured inputs/outputs. However, efficient systems are lacking for programming and executing these applications. We introduce SGLang, a system for efficient execution of complex language model programs. SGLang consists of a frontend language and a runtime. The frontend simplifies programming with primitives for generation and parallelism control. The runtime accelerates execution with novel optimizations like RadixAttention for KV cache reuse and compressed finite state machines for faster structured output decoding. Experiments show that SGLang achieves up to 6.4x higher throughput compared to state-of-the-art inference systems on various large language and multi-modal models on tasks including agent control, logical reasoning, few-shot learning benchmarks, JSON decoding, retrieval-augmented generation pipelines, and multi-turn chat. The code is publicly available at https://github.com/sgl-project/sglang
- Published
- 2023
58. The Riemann zeta function and exact exponential sum identities of divisor functions
- Author
-
Nastasescu, Maria, Robles, Nicolas, Stoica, Bogdan, and Zaharescu, Alexandru
- Subjects
Mathematics - Number Theory - Abstract
We prove an explicit integral formula for computing the product of two shifted Riemann zeta functions everywhere in the complex plane. We show that this formula implies the existence of infinite families of exact exponential sum identities involving the divisor functions, and we provide examples of these identities. We conjecturally propose a method to compute divisor functions by matrix inversion, without employing arithmetic techniques., Comment: 24 pages, 2 figures
- Published
- 2023
59. A visual perspective on the Birch and Swinnerton-Dyer conjecture through a family of approximations of $L$-functions
- Author
-
Nastasescu, Maria, Stoica, Bogdan, and Zaharescu, Alexandru
- Subjects
Mathematics - Number Theory - Abstract
We investigate the properties of a family of approximations of the Hasse-Weil $L$-function associated to an elliptic curve $E$ over $\mathbb{Q}$. We give a precise expression for the error of the approximations, and provide a visual interpretation of the analytic rank $m$ of $E$ as a sequence of near regular polygons around the center of the critical strip, each with vertices at the zeros of the approximations., Comment: 26 pages, 4 figures
- Published
- 2023
60. Rethinking Benchmark and Contamination for Language Models with Rephrased Samples
- Author
-
Yang, Shuo, Chiang, Wei-Lin, Zheng, Lianmin, Gonzalez, Joseph E., and Stoica, Ion
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Large language models are increasingly trained on all the data ever produced by humans. Many have raised concerns about the trustworthiness of public benchmarks due to potential contamination in pre-training or fine-tuning datasets. While most data decontamination efforts apply string matching (e.g., n-gram overlap) to remove benchmark data, we show that these methods are insufficient, and simple variations of test data (e.g., paraphrasing, translation) can easily bypass these decontamination measures. Furthermore, we demonstrate that if such variation of test data is not eliminated, a 13B model can easily overfit a test benchmark and achieve drastically high performance, on par with GPT-4. We validate such observations in widely used benchmarks such as MMLU, GSK8k, and HumanEval. To address this growing risk, we propose a stronger LLM-based decontamination method and apply it to widely used pre-training and fine-tuning datasets, revealing significant previously unknown test overlap. For example, in pre-training sets such as RedPajama-Data-1T and StarCoder-Data, we identified that 8-18\% of the HumanEval benchmark overlaps. Interestingly, we also find such contamination in synthetic dataset generated by GPT-3.5/4, suggesting a potential risk of unintentional contamination. We urge the community to adopt stronger decontamination approaches when using public benchmarks. Moreover, we call for the community to actively develop fresh one-time exams to evaluate models accurately. Our decontamination tool is publicly available at https://github.com/lm-sys/llm-decontaminator.
- Published
- 2023
61. S-LoRA: Serving Thousands of Concurrent LoRA Adapters
- Author
-
Sheng, Ying, Cao, Shiyi, Li, Dacheng, Hooper, Coleman, Lee, Nicholas, Yang, Shuo, Chou, Christopher, Zhu, Banghua, Zheng, Lianmin, Keutzer, Kurt, Gonzalez, Joseph E., and Stoica, Ion
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
The "pretrain-then-finetune" paradigm is commonly adopted in the deployment of large language models. Low-Rank Adaptation (LoRA), a parameter-efficient fine-tuning method, is often employed to adapt a base model to a multitude of tasks, resulting in a substantial collection of LoRA adapters derived from one base model. We observe that this paradigm presents significant opportunities for batched inference during serving. To capitalize on these opportunities, we present S-LoRA, a system designed for the scalable serving of many LoRA adapters. S-LoRA stores all adapters in the main memory and fetches the adapters used by the currently running queries to the GPU memory. To efficiently use the GPU memory and reduce fragmentation, S-LoRA proposes Unified Paging. Unified Paging uses a unified memory pool to manage dynamic adapter weights with different ranks and KV cache tensors with varying sequence lengths. Additionally, S-LoRA employs a novel tensor parallelism strategy and highly optimized custom CUDA kernels for heterogeneous batching of LoRA computation. Collectively, these features enable S-LoRA to serve thousands of LoRA adapters on a single GPU or across multiple GPUs with a small overhead. Compared to state-of-the-art libraries such as HuggingFace PEFT and vLLM (with naive support of LoRA serving), S-LoRA can improve the throughput by up to 4 times and increase the number of served adapters by several orders of magnitude. As a result, S-LoRA enables scalable serving of many task-specific fine-tuned models and offers the potential for large-scale customized fine-tuning services. The code is available at https://github.com/S-LoRA/S-LoRA
- Published
- 2023
62. The prince and the pauper. A quantum paradox of Hilbert-space fundamentalism
- Author
-
Stoica, Ovidiu Cristinel
- Subjects
Quantum Physics ,Physics - History and Philosophy of Physics - Abstract
The quantum world is described by a unit vector in the Hilbert space and the Hamiltonian. Do they, as abstract basis-independent objects, give a complete description of the physical world, or should we include observables like positions and momenta and the decomposition into subsystems? According to "Hilbert-space fundamentalism" they give a complete description, and all other features of the physical world emerge from them (Carroll, arXiv:2103.09780). This thesis was previously refuted in (arXiv:2102.08620) in full generality. But being an abstract non-uniqueness proof, and not a constructive one, it may not be convincing enough to the busy reader who wants to avoid mathematical details. Here I give a simpler, intuitive and constructive refutation, by showing that concrete physically distinct worlds can be described by the same unit vector and evolve according to the same law., Comment: 4 pages. Comments welcomed!
- Published
- 2023
63. MemGPT: Towards LLMs as Operating Systems
- Author
-
Packer, Charles, Wooders, Sarah, Lin, Kevin, Fang, Vivian, Patil, Shishir G., Stoica, Ion, and Gonzalez, Joseph E.
- Subjects
Computer Science - Artificial Intelligence - Abstract
Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows, we propose virtual context management, a technique drawing inspiration from hierarchical memory systems in traditional operating systems that provide the appearance of large memory resources through data movement between fast and slow memory. Using this technique, we introduce MemGPT (Memory-GPT), a system that intelligently manages different memory tiers in order to effectively provide extended context within the LLM's limited context window, and utilizes interrupts to manage control flow between itself and the user. We evaluate our OS-inspired design in two domains where the limited context windows of modern LLMs severely handicaps their performance: document analysis, where MemGPT is able to analyze large documents that far exceed the underlying LLM's context window, and multi-session chat, where MemGPT can create conversational agents that remember, reflect, and evolve dynamically through long-term interactions with their users. We release MemGPT code and data for our experiments at https://memgpt.ai., Comment: Code and data available at https://research.memgpt.ai
- Published
- 2023
64. Online Speculative Decoding
- Author
-
Liu, Xiaoxuan, Hu, Lanxiang, Bailis, Peter, Cheung, Alvin, Deng, Zhijie, Stoica, Ion, and Zhang, Hao
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Speculative decoding is a pivotal technique to accelerate the inference of large language models (LLMs) by employing a smaller draft model to predict the target model's outputs. However, its efficacy can be limited due to the low predictive accuracy of the draft model, particularly when faced with diverse text inputs and a significant capability gap between the draft and target models. We introduce online speculative decoding to address this challenge. The main idea is to continuously update the (multiple) draft model(s) on observed user query data. Adapting to query distribution mitigates the shifts between the training distribution of the draft model and the query distribution, enabling the draft model to more accurately predict the target model's outputs. We develop a prototype of online speculative decoding based on knowledge distillation and evaluate it using both synthetic and real query data. The results show a substantial increase in the token acceptance rate by 0.1 to 0.65, bringing 1.42x to 2.17x latency reduction. Our code is available at https://github.com/LiuXiaoxuanPKU/OSD.
- Published
- 2023
65. DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training
- Author
-
Li, Dacheng, Shao, Rulin, Xie, Anze, Xing, Eric P., Ma, Xuezhe, Stoica, Ion, Gonzalez, Joseph E., and Zhang, Hao
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
FlashAttention (Dao, 2023) effectively reduces the quadratic peak memory usage to linear in training transformer-based large language models (LLMs) on a single GPU. In this paper, we introduce DISTFLASHATTN, a distributed memory-efficient attention mechanism optimized for long-context LLMs training. We propose three key techniques: token-level workload balancing, overlapping key-value communication, and a rematerialization-aware gradient checkpointing algorithm. We evaluate DISTFLASHATTN on Llama-7B and variants with sequence lengths from 32K to 512K. DISTFLASHATTN achieves 8x longer sequences, 4.45 - 5.64x speedup compared to Ring Self-Attention, 2 - 8x longer sequences, 1.24 - 2.01x speedup compared to Megatron-LM with FlashAttention. It achieves 1.67x and 1.26 - 1.88x speedup compared to recent Ring Attention and DeepSpeed-Ulysses. Code is available at https://github.com/RulinShao/LightSeq.
- Published
- 2023
66. Functional imaging guided stereotactic ablative body radiotherapy (SABR) with focal dose escalation and bladder trigone sparing for intermediate and high-risk prostate cancer: study protocol for phase II safo trial
- Author
-
Zapatero, Almudena, Castro, Pablo, Roch, María, Carnero, Pablo Rodríguez, Carroceda, Sara, Rosciupchin, Alexandra Elena Stoica, Hernández, Sergio Honorato, Cogorno, Leopoldo, Iturriaga, Alfonso Gómez, and García, David Büchser
- Published
- 2024
- Full Text
- View/download PDF
67. Interaction of high-fat diet and brain trauma alters adipose tissue macrophages and brain microglia associated with exacerbated cognitive dysfunction
- Author
-
Henry, Rebecca J., Barrett, James P., Vaida, Maria, Khan, Niaz Z., Makarevich, Oleg, Ritzel, Rodney M., Faden, Alan I., and Stoica, Bogdan A.
- Published
- 2024
- Full Text
- View/download PDF
68. Enhancing SiGeSn nanocrystals SWIR photosensing by high passivation in nanocrystalline HfO2 matrix
- Author
-
Dascalescu, Ioana, Palade, Catalin, Slav, Adrian, Stavarache, Ionel, Cojocaru, Ovidiu, Teodorescu, Valentin Serban, Maraloiu, Valentin-Adrian, Lepadatu, Ana-Maria, Ciurea, Magdalena Lidia, and Stoica, Toma
- Published
- 2024
- Full Text
- View/download PDF
69. Freedom in the Many-Worlds Interpretation
- Author
-
Stoica, Ovidiu Cristinel
- Published
- 2024
- Full Text
- View/download PDF
70. ℒ1 adaptive controller of VEGA Launcher subject to flexible modes
- Author
-
Naji Anees Muqdad NAJI and Adrian-Mihail STOICA
- Subjects
ℒ1 adaptive controller ,flexible launcher model ,robust stability ,aerodynamic load vs. mach number ,Motor vehicles. Aeronautics. Astronautics ,TL1-4050 - Abstract
Flexible launch vehicle modes contribute to the deterioration of its stability performance. For this reason, their influence must be considered when designing and validating appropriate solutions for its automatic control system. The first three bending modes are the most relevant for control system design and analysis. Next, considerations regarding the elastic modes of the Vega actuator, used in the simulations performed throughout this study, are presented. The IMU (Inertial Measurement Unit) measurements providing attitude angles and angular rates are strongly corrupted by the local elastic deformations resulting from structural flexibility. These measurements are fed back to the TVC (Thrust Vector Control) system which controls the pitch and the yaw motion of the vehicle. The design methodology for this control system is based on an ℒ1 adaptive technique incorporating a Butterworth filter guarantees good stability performances for wide variations of the launcher’s dynamics parameters.
- Published
- 2024
- Full Text
- View/download PDF
71. Distributed H∞ State Feedback Control for Multi-Agent Systems with Imperfect Communication Networks
- Author
-
Serena Cristiana STOICU and Adrian-Mihail STOICA
- Subjects
multi-agent systems ,distributed control ,h∞ design ,imperfect communication networks ,Motor vehicles. Aeronautics. Astronautics ,TL1-4050 - Abstract
The main objective of this paper consists in the analysis and design of distributed control systems for multi-agent systems simultaneous with various types of disturbances attenuation. The distributed structures refer to those structures for which the control laws depend only on neighbouring agents states, more precisely, communication between certain agents is missing. The design approach is based on the definition of a H∞ type cost function for the entire system, a frequently used method, especially for aerospace applications with a single agent. As a case study, a flight formation consisting of four agents with identical dynamics is used, whose time evolutions are analysed according to the effects of different communication networks failures.
- Published
- 2024
- Full Text
- View/download PDF
72. Exploitation of red beet peel powder as a natural food ingredient in whey-fruit based beverage
- Author
-
Florina Stoica, Roxana Nicoleta Rațu, Florin Daniel Lipșa, Iuliana Motrescu, Irina Gabriela Cara, Gabriela Rapeanu, Iuliana Aprodu, Denis Țopa, and Gerard Jităreanu
- Subjects
Beetroot peel ,betalains ,antioxidant activity ,natural ingredients ,value-added products ,Nutrition. Foods and food supply ,TX341-641 ,Food processing and manufacture ,TP368-456 - Abstract
Scientists and food producers are studying the potential of utilizing different by-products as highly nutritious food components to meet consumers’ growing demand for healthier and natural goods. Beetroot belongs to the Amaranthaceae family and is highly rich in antioxidants, specifically phenolics and betalains with excellent health properties. The peel of red beetroot, which is generally discarded, has a high concentration of bioactive phytochemicals (betalains, dietary fibers). These compounds are well-known for their strong antioxidant effects, improving cardiovascular health, reducing oxidative stress, and enhancing immunological function. Incorporating red beetroot peel (RBP) powder into whey-fruit-based beverages is a promising method to improve nutritional content and sensory attractiveness. This study investigated the influence of adding RBP to whey-fruit-based beverages, specifically examining its effects on physicochemical and phytochemical characteristics, antioxidant capacity, color, microbiological, and rheological attributes, and consumer preference. The RBP extract exhibited high amounts of total polyphenols (1239.35 ± 0.51 mg GAE/100 g dw) and antioxidant activity (90.02 ± 0.22%). The findings indicated that RBP had a notable impact on the antioxidant capacity (16.38 ± 0.37 μmol TE/g dw for BRBP1, 26.69 ± 0.10 μmol TE/g dw for BRBP3, 36.75 ± 0.31 μmol TE/g dw for BRBP6) of the beverages and enhanced their color without negatively impacting their sensory characteristics. The dynamic oscillatory rheological measurements revealed that beverage supplementation with increasing amounts of RBP obtained stronger networks with solid-like viscoelastic behavior. Using RBP powder as a natural food ingredient in whey-fruit-based beverages enhances their nutritional value and promotes sustainable food processing by making use of by-products from the food industry into innovative food options.
- Published
- 2024
- Full Text
- View/download PDF
73. Images on journalism and the power of media to turn the page in the history of crises: a Malian case
- Author
-
Diana S. Stoica
- Subjects
media reality ,radio journalists ,freedom of speech ,conflict zones ,vulnerability of journalists ,deontology ,Literature (General) ,PN1-6790 - Abstract
A few images on journalism and its actors in the Sahel, and specifically Mali, that are shaped by the media itself, are presented. These images, identified in articles that capitalize on the challenges of political, social, and cultural crisis, as well as the risks that a journalist takes when reporting and writing on the realities and events he observes in Sahel, are supposed to shape the power of the media to bring change at an epistemological level in the society it unpacks and depicts. This epistemological metamorphosis is seen as a turning of a page in the history of crises and the proposal herein is to see what type of change this turning refers to, re-launching critical perspectives on the new powers or non-powers of journalists and journalism to inform, create and maintain a critical resistance meant to leave a relevant sign in the history, through deconstruction and management of crises communication, that would finally assure the durable control of crises, exercised by an informed and empowered society with the scope to overcome crises at a pragmatic and epistemic level. The analysis is qualitative and intends to invite the reader to more reflection on the interdependencies between the reality, the journalistic reality, and the journalist’s power or non-power to coin the two. Crises whose management from the point of view of public awareness and truth knowledge is compromised by a second-level crisis affecting the journalists, the leaders of the civil society, and broadly the media, who have concomitant powers and non-powers to change the perception of the public on the first level crises to such a point that, in the holistic interpretation of the term, this management of crisis through deflection should be considered a new page turned in the history of crises.
- Published
- 2024
- Full Text
- View/download PDF
74. Weight loss in subjects with type 2 diabetes before and after SARS- CoV2 infection - A retrospective observational study
- Author
-
Roxana Adriana Stoica and Florentina Gherghiceanu
- Subjects
malnutrition ,weight loss ,visceral fat ,sars-cov2 infection ,type 2 diabetes mellitus ,Medicine (General) ,R5-920 - Abstract
Objectives. As weight modification during the COVID-19 pandemic was reported in several circumstances, we aimed to assess the body composition changes using bio impedance in patients with type 2 diabetes mellitus (T2DM) during this period. Materials and Methods. We conducted an observational, retrospective study, from January 2021- June 2021, in two outpatient clinics, enrolling all patients with T2DM and SARS-CoV2 infection that presented for evaluation after the infection. Blood tests (serum creatinine, urea, blood glucose, lipid profile, transaminases, HbA1c) were available before the onset of infection as well as at an interval of 1-3 months post-infection. Results. From a total of 118 patients, 101 subjects were eligible, 50.5% males. 68.6% had a mild form of SARS-CoV2 infection. There is a significant decrease in mean weight (91.9 ± 26.00 kg before and 90.00 ± 23.00 kg after infection vs. control, p
- Published
- 2024
75. Obesity in children: systematic review over a 6-year period, including the Covid-19 pandemic
- Author
-
Cecilia Curis, Valeriu Ardeleanu, Lavinia Alexandra Moroianu, Corina Manole, Roxana Adriana Stoica, Florentina Gherghiceanu, and Anca Pantea Stoian
- Subjects
obesity ,diet ,children ,pandemic ,sars-cov2 infection ,Medicine (General) ,R5-920 - Abstract
Although obesity is a frequently formulated diagnosis at all ages, due to the long-term projection of the consequences of this condition in children it is a real public health problem. The etiology of obesity is multiple and its complexity requires a multidisciplinary medical approach from which the psychological component cannot be omitted. Thus, diseases such as diabetes, dyslipidemias, cardiovascular diseases, sleep apnea syndrome, non-alcoholic fatty liver or neoplasia are encountered with a higher incidence in this category of individuals. During the COVID-19 pandemic, the isolation and, consequently, the reduction of access to ways of performing physical exercise increased the balance between caloric intake and caloric consumption resulting in the accumulation of surplus calories in the form of adipose tissue. The purpose of the present work is to emphasize the interest manifested by the medical scientific world regarding obesity in the pediatric population, in the pre-pandemic period, during the pandemic period and one year after its’ end (2018-2023). We performed systematic review of clinical studies on obesity in the pediatric population, including 98 articles published in the PubMed database. The number of studies published during the pandemic period (53) vs the number of studies published ex-pandemic (45), corresponds to a ratio of 1.17:1 in favor of the first. Obesity remains a research topic of major interest in early life, regardless of the coexistence of COVID-19.
- Published
- 2024
- Full Text
- View/download PDF
76. A Qualitative Approach Regarding the Impact of Digitalization and Automation on the Accounting and Auditing Profession
- Author
-
Oana-Cristina STOICA and Liliana IONESCU-FELEAGA
- Subjects
digitalization ,automation ,accounting profession ,auditing ,impact ,Accounting. Bookkeeping ,HF5601-5689 ,Finance ,HG1-9999 - Abstract
In recent years, companies worldwide have faced a rapid pace of digitization and automation, which has led to change and adaptation of business models. From this point of view, new technologies have revolutionized the field of accounting and auditing, having both positive and negative effects on companies and employees. This paper highlights how changes brought about by technological progress influence the accounting and auditing profession and the role of other factors in this direction, using a qualitative method based on semi-structured interviews. The study results show that the benefits are visible at the company level. However, certain obstacles still exist, such as employees' resistance to change, the size of the initial costs or the systems used. On the other hand, professionals expect some entry-level jobs to disappear. Instead, other opportunities will be available for practitioners in the field. In this sense, universities will have a unique role in training the new generations by developing skills for the digital age. The present study may be of interest to researchers examining related issues. From a practical point of view, this paper could be helpful to professionals as it highlights several current needs of the business environment due to the impact of technological innovations.
- Published
- 2024
- Full Text
- View/download PDF
77. Behavioral Models that Influence Consumers Purchase Decisions
- Author
-
Dimitrie Stoica, Cezar Ionut Bichescu, and Maricica Stoica
- Subjects
consumer behavior ,traditional models ,modern models ,decision-making process ,marketing strategies ,Electronic computers. Computer science ,QA75.5-76.95 ,Economic theory. Demography ,HB1-3840 ,Economics as a science ,HB71-74 - Abstract
This article delves into the complexities of consumer behavior, offering an analysis of traditional and modern behavioral models to understand purchasing decisions. It begins with presentation of the Marshallian, Pavlovian, Freudian, and Veblenian models, which focus on economic, psychological, and social influences. Subsequently, the study explores contemporary frameworks like the Nicosia, Engel-Kollat-Blackwell, and Bettman models, highlighting their contributions to comprehending the consumer decision-making process in a dynamic market environment. By integrating insights from various models, the article underscores the significance of understanding consumer psychology, external stimuli, and individual roles in purchase activities. This review aims to equip marketers with the knowledge to devise effective strategies that address current market challenges and enhance consumer satisfaction and organizational profitability.
- Published
- 2024
- Full Text
- View/download PDF
78. Endocarditis with 'Streptococcus pseudoporcinus' associated with Mastocytosis and spondylodiscitis-a coincidental association?: A case report
- Author
-
Birlutiu, Victoria, Birlutiu, Rares-Mircea, Teodoru, Minodora, Catana, Alina Camelia, and Stoica, Cristian Ioan
- Published
- 2023
79. LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
- Author
-
Zheng, Lianmin, Chiang, Wei-Lin, Sheng, Ying, Li, Tianle, Zhuang, Siyuan, Wu, Zhanghao, Zhuang, Yonghao, Li, Zhuohan, Lin, Zi, Xing, Eric P., Gonzalez, Joseph E., Stoica, Ion, and Zhang, Hao
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Studying how people interact with large language models (LLMs) in real-world scenarios is increasingly important due to their widespread use in various applications. In this paper, we introduce LMSYS-Chat-1M, a large-scale dataset containing one million real-world conversations with 25 state-of-the-art LLMs. This dataset is collected from 210K unique IP addresses in the wild on our Vicuna demo and Chatbot Arena website. We offer an overview of the dataset's content, including its curation process, basic statistics, and topic distribution, highlighting its diversity, originality, and scale. We demonstrate its versatility through four use cases: developing content moderation models that perform similarly to GPT-4, building a safety benchmark, training instruction-following models that perform similarly to Vicuna, and creating challenging benchmark questions. We believe that this dataset will serve as a valuable resource for understanding and advancing LLM capabilities. The dataset is publicly available at https://huggingface.co/datasets/lmsys/lmsys-chat-1m.
- Published
- 2023
80. Efficient Memory Management for Large Language Model Serving with PagedAttention
- Author
-
Kwon, Woosuk, Li, Zhuohan, Zhuang, Siyuan, Sheng, Ying, Zheng, Lianmin, Yu, Cody Hao, Gonzalez, Joseph E., Zhang, Hao, and Stoica, Ion
- Subjects
Computer Science - Machine Learning ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
High throughput serving of large language models (LLMs) requires batching sufficiently many requests at a time. However, existing systems struggle because the key-value cache (KV cache) memory for each request is huge and grows and shrinks dynamically. When managed inefficiently, this memory can be significantly wasted by fragmentation and redundant duplication, limiting the batch size. To address this problem, we propose PagedAttention, an attention algorithm inspired by the classical virtual memory and paging techniques in operating systems. On top of it, we build vLLM, an LLM serving system that achieves (1) near-zero waste in KV cache memory and (2) flexible sharing of KV cache within and across requests to further reduce memory usage. Our evaluations show that vLLM improves the throughput of popular LLMs by 2-4$\times$ with the same level of latency compared to the state-of-the-art systems, such as FasterTransformer and Orca. The improvement is more pronounced with longer sequences, larger models, and more complex decoding algorithms. vLLM's source code is publicly available at https://github.com/vllm-project/vllm, Comment: SOSP 2023
- Published
- 2023
81. Statistically bias-minimized peculiar velocity catalogs from Gibbs point processes and Bayesian inference
- Author
-
Sorce, Jenny G., Stoica, Radu S., and Tempel, Elmo
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
Galaxy peculiar velocities are excellent cosmological probes provided that biases inherent to their measurements are contained before any study. This paper proposes a new algorithm based on an object point process model whose probability density is built to statistically reduce the effects of Malmquist biases and uncertainties due to lognormal errors in radial peculiar velocity catalogs. More precisely, a simulated annealing algorithm permits maximizing the probability density describing the point process model. The resulting configurations are bias-minimized catalogs. Tests are conducted on synthetic catalogs mimicking the second and third distance modulus catalogs of the Cosmicflows project from which peculiar velocity catalogs are derived. By reducing the local peculiar velocity variance in catalogs by an order of magnitude, the algorithm permits recovering the expected one while preserving the small-scale velocity correlation. It also permits retrieving the expected clustering. The algorithm is then applied to the observational catalogs. The large-scale structure reconstructed with the Wiener-filter technique applied to the bias-minimized observational catalogs matches with great success the local cosmic web as depicted by redshift surveys of local galaxies. These new bias-minimized versions of peculiar velocity catalogs can be used as a starting point for several studies from possibly estimating the most probable Hubble constant, H0, value to the production of simulations constrained to reproduce the local Universe., Comment: Accepted for publication in A&A, 26 pages, 22 figures, 3 tables
- Published
- 2023
- Full Text
- View/download PDF
82. Analyzing domain shift when using additional data for the MICCAI KiTS23 Challenge
- Author
-
Stoica, George, Breaban, Mihaela, and Barbu, Vlad
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Using additional training data is known to improve the results, especially for medical image 3D segmentation where there is a lack of training material and the model needs to generalize well from few available data. However, the new data could have been acquired using other instruments and preprocessed such its distribution is significantly different from the original training data. Therefore, we study techniques which ameliorate domain shift during training so that the additional data becomes better usable for preprocessing and training together with the original data. Our results show that transforming the additional data using histogram matching has better results than using simple normalization., Comment: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in https://link.springer.com/book/10.1007/978-3-031-54806-2, and is available online at https://doi.org/10.1007/978-3-031-54806-2_4
- Published
- 2023
- Full Text
- View/download PDF
83. Leveraging Cloud Computing to Make Autonomous Vehicles Safer
- Author
-
Schafhalter, Peter, Kalra, Sukrit, Xu, Le, Gonzalez, Joseph E., and Stoica, Ion
- Subjects
Computer Science - Robotics - Abstract
The safety of autonomous vehicles (AVs) depends on their ability to perform complex computations on high-volume sensor data in a timely manner. Their ability to run these computations with state-of-the-art models is limited by the processing power and slow update cycles of their onboard hardware. In contrast, cloud computing offers the ability to burst computation to vast amounts of the latest generation of hardware. However, accessing these cloud resources requires traversing wireless networks that are often considered to be too unreliable for real-time AV driving applications. Our work seeks to harness this unreliable cloud to enhance the accuracy of an AV's decisions, while ensuring that it can always fall back to its on-board computational capabilities. We identify three mechanisms that can be used by AVs to safely leverage the cloud for accuracy enhancements, and elaborate why current execution systems fail to enable these mechanisms. To address these limitations, we provide a system design based on the speculative execution of an AV's pipeline in the cloud, and show the efficacy of this approach in simulations of complex real-world scenarios that apply these mechanisms., Comment: IROS 2023 (to appear); 8 pages, 7 figures, 2 tables
- Published
- 2023
84. Hug model: parameter estimation via the ABC Shadow algorithm
- Author
-
Reype, Christophe, Stoica, Radu S., Gemmerlé, Didier, Richard, Antonin, and Deaconu, Madalina
- Subjects
Physics - Geophysics - Abstract
Studying geological fluids mixing systems allows to understand interaction among water sources. The Hug model is an interaction point process model that can be used to estimate the number and the chemical composition of the water sources involved in a geological fluids mixing system from the chemical composition of samples Reype (2022); Reype et al. (2020, 2022). The source detection using the Hug model needs prior knowledge for the model parameters. The present work shows how the parameter estimation method known as the ABC Shadow algorithm Stoica et al. (2021, 2017) can be used in order to construct priors for the parameters of the Hug model. The long term perspective of this work is to integrate geological expertise within fully unsupervised models., Comment: RING Meeting, Ecole Nationale Sup{\'e}rieure G{\'e}ologie Nancy, Sep 2023, Nancy, France
- Published
- 2023
85. Are observers reducible to structures?
- Author
-
Stoica, Ovidiu Cristinel
- Subjects
Physics - History and Philosophy of Physics ,Quantum Physics - Abstract
Physical systems are characterized by their structure and dynamics. But the physical laws only express relations, and their symmetries allow any possible relational structure to be also possible in a different parametrization or basis of the state space. If observers were reducible to their structure, observer-like structures from different parametrizations would identify differently the observables with physical properties. They would perceive the same system as being in a different state. This leads to the question: is there a unique correspondence between observables and physical properties, or this correspondence is relative to the parametrization in which the observer-like structure making the observation exists? I show that, if observer-like structures from all parametrizations were observers, their memory of the external world would have no correspondence with the facts, it would be no better than random guess. Since our experience shows that this is not the case, there must be more to the observers than their structure. This implies that the correspondence between observables and physical properties is unique, and it becomes manifest through the observers. This result is independent of the measurement problem, applying to both quantum and classical physics. It has implications for structural realism, philosophy of mind, the foundations of quantum and classical physics, and quantum-first approaches., Comment: 9 pages. Comments welcome!
- Published
- 2023
86. AI and the EU Digital Markets Act: Addressing the Risks of Bigness in Generative AI
- Author
-
Yasar, Ayse Gizem, Chong, Andrew, Dong, Evan, Gilbert, Thomas Krendl, Hladikova, Sarah, Maio, Roland, Mougan, Carlos, Shen, Xudong, Singh, Shubham, Stoica, Ana-Andreea, Thais, Savannah, and Zilka, Miri
- Subjects
Computer Science - Computers and Society ,Computer Science - Artificial Intelligence - Abstract
As AI technology advances rapidly, concerns over the risks of bigness in digital markets are also growing. The EU's Digital Markets Act (DMA) aims to address these risks. Still, the current framework may not adequately cover generative AI systems that could become gateways for AI-based services. This paper argues for integrating certain AI software as core platform services and classifying certain developers as gatekeepers under the DMA. We also propose an assessment of gatekeeper obligations to ensure they cover generative AI services. As the EU considers generative AI-specific rules and possible DMA amendments, this paper provides insights towards diversity and openness in generative AI services., Comment: ICML'23 Workshop Generative AI + Law (GenLaw)
- Published
- 2023
87. AutoML Insights: Gaining Confidence to Operationalize Predictive Models
- Author
-
Stoica, Florin, primary and Florentina Stoica, Laura, additional
- Published
- 2024
- Full Text
- View/download PDF
88. Apixaban for Prevention of Thromboembolism in Pediatric Heart Disease
- Author
-
Payne, R Mark, Burns, Kristin M, Glatz, Andrew C, Male, Christoph, Donti, Andrea, Brandão, Leonardo R, Balling, Gunter, VanderPluym, Christina J, Bu'Lock, Frances, Kochilas, Lazaros K, Stiller, Brigitte, Cnota, James F, Rahkonen, Otto, Khan, Asra, Adorisio, Rachele, Stoica, Serban, May, Lindsay, Burns, Jane C, Saraiva, Jose Francisco K, McHugh, Kimberly E, Kim, John S, Rubio, Agustin, Chía-Vazquez, Nadia G, Meador, Marcie R, Dyme, Joshua L, Reedy, Alison M, Ajavon-Hartmann, Toni, Jarugula, Praneeth, Carlson-Taneja, Lauren E, Mills, Donna, Wheaton, Olivia, and Monagle, Paul
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Clinical Sciences ,Orphan Drug ,Heart Disease ,Hematology ,Pediatric ,Clinical Research ,Women's Health ,Rare Diseases ,Prevention ,Patient Safety ,Cardiovascular ,Clinical Trials and Supportive Activities ,6.1 Pharmaceuticals ,Child ,Humans ,Infant ,Newborn ,Anticoagulants ,Fibrinolytic Agents ,Heart Diseases ,Hemorrhage ,Heparin ,Low-Molecular-Weight ,Pyridones ,Quality of Life ,Venous Thromboembolism ,Vitamin K ,anticoagulation ,apixaban ,congenital ,heart ,pediatric ,thromboembolism ,Cardiorespiratory Medicine and Haematology ,Public Health and Health Services ,Cardiovascular System & Hematology ,Cardiovascular medicine and haematology - Abstract
BackgroundChildren with heart disease frequently require anticoagulation for thromboprophylaxis. Current standard of care (SOC), vitamin K antagonists or low-molecular-weight heparin, has significant disadvantages.ObjectivesThe authors sought to describe safety, pharmacokinetics (PK), pharmacodynamics, and efficacy of apixaban, an oral, direct factor Xa inhibitor, for prevention of thromboembolism in children with congenital or acquired heart disease.MethodsPhase 2, open-label trial in children (ages, 28 days to
- Published
- 2023
89. Single-cell DNA methylome and 3D multi-omic atlas of the adult mouse brain.
- Author
-
Liu, Hanqing, Zeng, Qiurui, Zhou, Jingtian, Bartlett, Anna, Wang, Bang-An, Tian, Wei, Kenworthy, Mia, Altshul, Jordan, Nery, Joseph, Chen, Huaming, Castanon, Rosa, Zu, Songpeng, Li, Yang, Lucero, Jacinta, Osteen, Julia, Pinto-Duarte, Antonio, Lee, Jasper, Rink, Jon, Cho, Silvia, Emerson, Nora, Nunn, Michael, OConnor, Carolyn, Wu, Zhanghao, Stoica, Ion, Yao, Zizhen, Smith, Kimberly, Tasic, Bosiljka, Luo, Chongyuan, Dixon, Jesse, Zeng, Hongkui, Ren, Bing, Behrens, M, Ecker, Joseph, and Berube, Peter
- Subjects
Animals ,Mice ,Brain ,Chromatin ,Cytosine ,Datasets as Topic ,DNA Methylation ,Epigenome ,Multiomics ,Single-Cell Analysis ,Transcription Factors ,Transcription ,Genetic - Abstract
Cytosine DNA methylation is essential in brain development and is implicated in various neurological disorders. Understanding DNA methylation diversity across the entire brain in a spatial context is fundamental for a complete molecular atlas of brain cell types and their gene regulatory landscapes. Here we used single-nucleus methylome sequencing (snmC-seq3) and multi-omic sequencing (snm3C-seq)1 technologies to generate 301,626 methylomes and 176,003 chromatin conformation-methylome joint profiles from 117 dissected regions throughout the adult mouse brain. Using iterative clustering and integrating with companion whole-brain transcriptome and chromatin accessibility datasets, we constructed a methylation-based cell taxonomy with 4,673 cell groups and 274 cross-modality-annotated subclasses. We identified 2.6 million differentially methylated regions across the genome that represent potential gene regulation elements. Notably, we observed spatial cytosine methylation patterns on both genes and regulatory elements in cell types within and across brain regions. Brain-wide spatial transcriptomics data validated the association of spatial epigenetic diversity with transcription and improved the anatomical mapping of our epigenetic datasets. Furthermore, chromatin conformation diversities occurred in important neuronal genes and were highly associated with DNA methylation and transcription changes. Brain-wide cell-type comparisons enabled the construction of regulatory networks that incorporate transcription factors, regulatory elements and their potential downstream gene targets. Finally, intragenic DNA methylation and chromatin conformation patterns predicted alternative gene isoform expression observed in a whole-brain SMART-seq2 dataset. Our study establishes a brain-wide, single-cell DNA methylome and 3D multi-omic atlas and provides a valuable resource for comprehending the cellular-spatial and regulatory genome diversity of the mouse brain.
- Published
- 2023
90. Biopolymeric Matrices for Food Packaging
- Author
-
Stoica, Maricica, Abd-Elsalam, Kamel A., Bichescu, Cezar Ionuț, Ivan, Angela Stela, Moraru, Dana Iulia, Săracu, Alina Florentina, Șavga, Larisa, Stoica, Dimitrie, Prasad, Ram, Series Editor, Abd-Elsalam, Kamel A., editor, Hashim, Ayat F., editor, Ahmed, Farah K., editor, and Thomas, Sabu, editor
- Published
- 2024
- Full Text
- View/download PDF
91. Gen Z buying patterns: comparing the influence of professional advising and social media engagement
- Author
-
Stoica, Michael and Hickman, Thomas M.
- Published
- 2024
- Full Text
- View/download PDF
92. Speaking Well and Feeling Good: Age-Related Differences in the Affective Language of Resting State Thought
- Author
-
Stoica, Teodora, Andrews, Eric S., Deffner, Austin M., Griffith, Christopher, Grilli, Matthew D., and Andrews-Hanna, Jessica R.
- Published
- 2024
- Full Text
- View/download PDF
93. Statistical inference for random T-tessellations models. Application to agricultural landscape modeling
- Author
-
Adamczyk-Chauvat, Katarzyna, Kassa, Mouna, Papaïx, Julien, Kiêu, Kiên, and Stoica, Radu S.
- Published
- 2024
- Full Text
- View/download PDF
94. Picosecond volume expansion drives a later-time insulator–metal transition in a nano-textured Mott insulator
- Author
-
Verma, Anita, Golež, Denis, Gorobtsov, Oleg Yu., Kaj, Kelson, Russell, Ryan, Kaaret, Jeffrey Z., Lamb, Erik, Khalsa, Guru, Nair, Hari P., Sun, Yifei, Bouck, Ryan, Schreiber, Nathaniel, Ruf, Jacob P., Ramaprasad, Varun, Kubota, Yuya, Togashi, Tadashi, Stoica, Vladimir A., Padmanabhan, Hari, Freeland, John W., Benedek, Nicole A., Shpyrko, Oleg G., Harter, John W., Averitt, Richard D., Schlom, Darrell G., Shen, Kyle M., Millis, Andrew J., and Singer, Andrej
- Published
- 2024
- Full Text
- View/download PDF
95. The Relation between Wavefunction and 3D Space Implies Many Worlds with Local Beables and Probabilities
- Author
-
Stoica, Ovidiu Cristinel
- Subjects
Quantum Physics ,Physics - History and Philosophy of Physics - Abstract
We show that the quantum wavefunctional can be seen as a set of classical fields on the 3D space aggregated by a measure. We obtain a complete description of the wavefunctional in terms of classical local beables. With this correspondence, classical explanations of the macro level and of probabilities transfer almost directly to the quantum. A key difference is that, in quantum theory, the classical states coexist in parallel, so the probabilities come from self-location uncertainty. We show that these states are distributed according to the Born rule. The coexistence of classical states implies that there are many worlds, even if we assume the collapse postulate. This leads automatically to a new version of the many-worlds interpretation in which the major objections are addressed naturally. We show that background-free quantum gravity provides additional support for this proposal and suggests why branching happens toward the future., Comment: 10 pages. Presented at the MWI Workshop, Tel-Aviv University, 18-24 October 2022
- Published
- 2023
- Full Text
- View/download PDF
96. Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
- Author
-
Zheng, Lianmin, Chiang, Wei-Lin, Sheng, Ying, Zhuang, Siyuan, Wu, Zhanghao, Zhuang, Yonghao, Lin, Zi, Li, Zhuohan, Li, Dacheng, Xing, Eric P., Zhang, Hao, Gonzalez, Joseph E., and Stoica, Ion
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Evaluating large language model (LLM) based chat assistants is challenging due to their broad capabilities and the inadequacy of existing benchmarks in measuring human preferences. To address this, we explore using strong LLMs as judges to evaluate these models on more open-ended questions. We examine the usage and limitations of LLM-as-a-judge, including position, verbosity, and self-enhancement biases, as well as limited reasoning ability, and propose solutions to mitigate some of them. We then verify the agreement between LLM judges and human preferences by introducing two benchmarks: MT-bench, a multi-turn question set; and Chatbot Arena, a crowdsourced battle platform. Our results reveal that strong LLM judges like GPT-4 can match both controlled and crowdsourced human preferences well, achieving over 80% agreement, the same level of agreement between humans. Hence, LLM-as-a-judge is a scalable and explainable way to approximate human preferences, which are otherwise very expensive to obtain. Additionally, we show our benchmark and traditional benchmarks complement each other by evaluating several variants of LLaMA and Vicuna. The MT-bench questions, 3K expert votes, and 30K conversations with human preferences are publicly available at https://github.com/lm-sys/FastChat/tree/main/fastchat/llm_judge., Comment: NeurIPS 2023 Datasets and Benchmarks Track
- Published
- 2023
97. Modelling the Wall Pressure Fluctuations on the VEGA-C Launcher in Supersonic Conditions
- Author
-
Camussi, R., Di Marco, Alessandro, De Paola, Elisa, Stoica, Gerorgiana Luana, Stoica, Cornelius, Paglia, Fabio, Romano, Luca, and Barbagallo, Daniele
- Published
- 2024
- Full Text
- View/download PDF
98. Two new algorithms for maximum likelihood estimation of sparse covariance matrices with applications to graphical modeling
- Author
-
Fatima, Ghania, Babu, Prabhu, and Stoica, Petre
- Subjects
Statistics - Methodology ,Electrical Engineering and Systems Science - Signal Processing - Abstract
In this paper, we propose two new algorithms for maximum-likelihood estimation (MLE) of high dimensional sparse covariance matrices. Unlike most of the state of-the-art methods, which either use regularization techniques or penalize the likelihood to impose sparsity, we solve the MLE problem based on an estimated covariance graph. More specifically, we propose a two-stage procedure: in the first stage, we determine the sparsity pattern of the target covariance matrix (in other words the marginal independence in the covariance graph under a Gaussian graphical model) using the multiple hypothesis testing method of false discovery rate (FDR), and in the second stage we use either a block coordinate descent approach to estimate the non-zero values or a proximal distance approach that penalizes the distance between the estimated covariance graph and the target covariance matrix. Doing so gives rise to two different methods, each with its own advantage: the coordinate descent approach does not require tuning of any hyper-parameters, whereas the proximal distance approach is computationally fast but requires a careful tuning of the penalty parameter. Both methods are effective even in cases where the number of observed samples is less than the dimension of the data. For performance evaluation, we test the proposed methods on both simulated and real-world data and show that they provide more accurate estimates of the sparse covariance matrix than two state-of-the-art methods.
- Published
- 2023
99. Pearson-Matthews correlation coefficients for binary and multinary classification and hypothesis testing
- Author
-
Stoica, Petre and Babu, Prabhu
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Statistics - Machine Learning - Abstract
The Pearson-Matthews correlation coefficient (usually abbreviated MCC) is considered to be one of the most useful metrics for the performance of a binary classification or hypothesis testing method (for the sake of conciseness we will use the classification terminology throughout, but the concepts and methods discussed in the paper apply verbatim to hypothesis testing as well). For multinary classification tasks (with more than two classes) the existing extension of MCC, commonly called the $\text{R}_{\text{K}}$ metric, has also been successfully used in many applications. The present paper begins with an introductory discussion on certain aspects of MCC. Then we go on to discuss the topic of multinary classification that is the main focus of this paper and which, despite its practical and theoretical importance, appears to be less developed than the topic of binary classification. Our discussion of the $\text{R}_{\text{K}}$ is followed by the introduction of two other metrics for multinary classification derived from the multivariate Pearson correlation (MPC) coefficients. We show that both $\text{R}_{\text{K}}$ and the MPC metrics suffer from the problem of not decisively indicating poor classification results when they should, and introduce three new enhanced metrics that do not suffer from this problem. We also present an additional new metric for multinary classification which can be viewed as a direct extension of MCC.
- Published
- 2023
100. Fair principal component analysis (PCA): minorization-maximization algorithms for Fair PCA, Fair Robust PCA and Fair Sparse PCA
- Author
-
Babu, Prabhu and Stoica, Petre
- Subjects
Statistics - Machine Learning ,Electrical Engineering and Systems Science - Signal Processing - Abstract
In this paper we propose a new iterative algorithm to solve the fair PCA (FPCA) problem. We start with the max-min fair PCA formulation originally proposed in [1] and derive a simple and efficient iterative algorithm which is based on the minorization-maximization (MM) approach. The proposed algorithm relies on the relaxation of a semi-orthogonality constraint which is proved to be tight at every iteration of the algorithm. The vanilla version of the proposed algorithm requires solving a semi-definite program (SDP) at every iteration, which can be further simplified to a quadratic program by formulating the dual of the surrogate maximization problem. We also propose two important reformulations of the fair PCA problem: a) fair robust PCA -- which can handle outliers in the data, and b) fair sparse PCA -- which can enforce sparsity on the estimated fair principal components. The proposed algorithms are computationally efficient and monotonically increase their respective design objectives at every iteration. An added feature of the proposed algorithms is that they do not require the selection of any hyperparameter (except for the fair sparse PCA case where a penalty parameter that controls the sparsity has to be chosen by the user). We numerically compare the performance of the proposed methods with two of the state-of-the-art approaches on synthetic data sets and a real-life data set.
- Published
- 2023
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.