24,202 results on '"Danial, A."'
Search Results
2. MIRI Deep Imaging Survey (MIDIS) of the Hubble Ultra Deep Field
- Author
-
Östlin, Göran, Pérez-González, Pablo G., Melinder, Jens, Gillman, Steven, Iani, Edoardo, Costantin, Luca, Boogaard, Leindert A., Rinaldi, Pierluigi, Colina, Luis, Nørgaard-Nielsen, Hans Ulrik, Dicken, Daniel, Greve, Thomas R., Wright, Gillian, Alonso-Herrero, Almudena, Alvarez-Marquez, Javier, Annunziatella, Marianna, Bik, Arjan, Bosman, Sarah E. I., Caputi, Karina I., Gomez, Alejandro Crespo, Eckart, Andreas, Garcia-Marin, Macarena, Hjorth, Jens, Ilbert, Olivier, Jermann, Iris, Kendrew, Sarah, Labiano, Alvaro, Langeroodi, Danial, Fevre, Olivier Le, Libralato, Mattia, Meyer, Romain A., Moutard, Thibaud, Peissker, Florian, Pye, John P., Tikkanen, Tuomo V., Topinka, Martin, Walter, Fabian, Ward, Martin, van der Werf, Paul, van Dishoeck, Ewine F., Henning, Manuel Güdel Thomas, Lagage, Pierre-Olivier, Ray, Tom P., and Vandenbussche, Bart
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
The recently launched James Webb Space Telescope (JWST) is opening new observing windows on the distant universe. Among JWST's instruments, the Mid Infrared Instrument (MIRI) offers the unique capability of imaging observations at wavelengths $\lambda > 5\mu$m. This enables unique access to the rest frame near infra-red (NIR, $\lambda \ge 1$\mum) emission from galaxies at redshifts $z>4$ and the visual ($\lambda \gtrsim 5000$\AA) rest frame for $z>9$. We here report on the guaranteed time observations (GTO) from the MIRI European Consortium, of the Hubble Ultra Deep Field (HUDF), forming the MIRI Deep Imaging Survey (MIDIS), consisting of an on source integration time of $\sim41$ hours in the MIRI/F560W (5.6 $\mu$m) filter. To our knowledge, this constitutes the longest single filter exposure obtained with JWST of an extragalactic field as yet., Comment: submitted to A&A on July 30, 2024
- Published
- 2024
3. High-Performances AlGaN-based DUV-LED via Under-Level Multiple Quantum Well Configuration
- Author
-
Aman, Mohammad Amirul Hairol, Azrisham, Nurul Fathinah, Noorden, Ahmad Fakhrurrazi Ahmad, Danial, Wan Hazman, and Kadir, Muhamad Zamzuri Abdul
- Subjects
Physics - Optics ,Physics - Applied Physics ,Physics - Computational Physics - Abstract
Low internal and external quantum efficiencies in high Aluminium content AlGaN-based deep-ultraviolet light-emitting diode (DUV-LED) occurred due to strong polarization effects, spontaneous and piezoelectric polarization, at the interface between two materials. It also leads to a low carrier confinement and Quantum Confined Stark Effect (QCSE), contributing to the efficiency droop of the DUV-LED. This work demonstrates an under-level MQW configuration implemented in a DUV-LED with a 257 nm emission wavelength. Three DUV-LED structures, above-, same- and under-level MQW were investigated, covering important optoelectronics properties such as energy band diagram, carrier concentrations, radiative recombination rates and electric field distribution. It is found that the quantum efficiencies, luminescence intensity and light output power of the under-level configuration has been enhanced by nine-, ten- and five-folds, relative to the above-level MQW configuration., Comment: 22 Pages, 6 Figures, 1 Table
- Published
- 2024
4. NBMLSS: probabilistic forecasting of electricity prices via Neural Basis Models for Location Scale and Shape
- Author
-
Brusaferri, Alessandro, Ramin, Danial, and Ballarino, Andrea
- Subjects
Computer Science - Machine Learning - Abstract
Forecasters using flexible neural networks (NN) in multi-horizon distributional regression setups often struggle to gain detailed insights into the underlying mechanisms that lead to the predicted feature-conditioned distribution parameters. In this work, we deploy a Neural Basis Model for Location, Scale and Shape, that blends the principled interpretability of GAMLSS with a computationally scalable shared basis decomposition, combined by linear projections supporting dedicated stepwise and parameter-wise feature shape functions aggregations. Experiments have been conducted on multiple market regions, achieving probabilistic forecasting performance comparable to that of distributional neural networks, while providing more insights into the model behavior through the learned nonlinear feature level maps to the distribution parameters across the prediction steps., Comment: 23 pages
- Published
- 2024
5. Exploring the Impact of Quizzes Interleaved with Write-Code Tasks in Elementary-Level Visual Programming
- Author
-
Ghosh, Ahana, Malva, Liina, Gotovos, Alkis, Hooshyar, Danial, and Singla, Adish
- Subjects
Computer Science - Computers and Society - Abstract
We explore the role of quizzes in elementary visual programming domains popularly used for K-8 computing education. Prior work has studied various quiz types, such as fill-in-the-gap write-code questions. However, the overall impact of these quizzes is unclear: studies often show utility in the learning phase when enhanced with quizzes, though limited transfer of utility in the post-learning phase. In this paper, we aim to better understand the impact of different quiz types and whether quizzes focusing on diverse skills (e.g., code debugging and task design) would have higher utility. We design a study with Hour of Code: Maze Challenge by code.org as the base curriculum, interleaved with different quiz types. Specifically, we examine two learning groups: (i) HoC-ACE with diverse quizzes including solution tracing, code debugging, code equivalence, and task design; (ii) HoC-Fill with simple quizzes on solution finding. We conducted a large-scale study with 405 students in grades 6--7. Our results highlight that the curriculum enhanced with richer quizzes led to higher utility during the post-learning phase., Comment: Preprint of the SIGCSE'25 paper
- Published
- 2024
6. Quantum Algorithm for Vibronic Dynamics: Case Study on Singlet Fission Solar Cell Design
- Author
-
Motlagh, Danial, Lang, Robert A., Campos-Gonzalez-Angulo, Jorge A., Zeng, Tao, Aspuru-Guzik, Alan, and Arrazola, Juan Miguel
- Subjects
Quantum Physics - Abstract
Vibronic interactions between nuclear motion and electronic states are critical for the accurate modeling of photochemistry. However, accurate simulations of fully quantum non-adiabatic dynamics are often prohibitively expensive for classical methods beyond small systems. In this work, we present a quantum algorithm based on product formulas for simulating time evolution under a general vibronic Hamiltonian in real space, capable of handling an arbitrary number of electronic states and vibrational modes. We develop the first trotterization scheme for vibronic Hamiltonians beyond two electronic states and introduce an array of optimization techniques for the exponentiation of each fragment in the product formula, resulting in a remarkably low cost of implementation. To demonstrate practical relevance, we outline a proof-of-principle integration of our algorithm into a materials discovery pipeline for designing more efficient singlet fission-based organic solar cells. Based on commutator bounds, we estimate that a $100$ femtosecond evolution using a second-order Trotter product formula of a $4$-state model of an anthracene-fullerene interface requires $117$ qubits and $1.5 \times 10^7$ Toffoli gates in a reduced dimensionality of $11$ modes. In its full dimensionality of $246$ modes, it requires $1065$ qubits and $2.7 \times 10^9$ Toffoli gates.
- Published
- 2024
7. Effective Predictive Modeling for Emergency Department Visits and Evaluating Exogenous Variables Impact: Using Explainable Meta-learning Gradient Boosting
- Author
-
Neshat, Mehdi, Phipps, Michael, Jha, Nikhil, Khojasteh, Danial, Tong, Michael, and Gandomi, Amir
- Subjects
Computer Science - Machine Learning ,Computer Science - Neural and Evolutionary Computing - Abstract
Over an extensive duration, administrators and clinicians have endeavoured to predict Emergency Department (ED) visits with precision, aiming to optimise resource distribution. Despite the proliferation of diverse AI-driven models tailored for precise prognostication, this task persists as a formidable challenge, besieged by constraints such as restrained generalisability, susceptibility to overfitting and underfitting, scalability issues, and complex fine-tuning hyper-parameters. In this study, we introduce a novel Meta-learning Gradient Booster (Meta-ED) approach for precisely forecasting daily ED visits and leveraging a comprehensive dataset of exogenous variables, including socio-demographic characteristics, healthcare service use, chronic diseases, diagnosis, and climate parameters spanning 23 years from Canberra Hospital in ACT, Australia. The proposed Meta-ED consists of four foundational learners-Catboost, Random Forest, Extra Tree, and lightGBoost-alongside a dependable top-level learner, Multi-Layer Perceptron (MLP), by combining the unique capabilities of varied base models (sub-learners). Our study assesses the efficacy of the Meta-ED model through an extensive comparative analysis involving 23 models. The evaluation outcomes reveal a notable superiority of Meta-ED over the other models in accuracy at 85.7% (95% CI ;85.4%, 86.0%) and across a spectrum of 10 evaluation metrics. Notably, when compared with prominent techniques, XGBoost, Random Forest (RF), AdaBoost, LightGBoost, and Extra Tree (ExT), Meta-ED showcases substantial accuracy enhancements of 58.6%, 106.3%, 22.3%, 7.0%, and 15.7%, respectively. Furthermore, incorporating weather-related features demonstrates a 3.25% improvement in the prediction accuracy of visitors' numbers. The encouraging outcomes of our study underscore Meta-ED as a foundation model for the precise prediction of daily ED visitors.
- Published
- 2024
8. LLM-assisted Physical Invariant Extraction for Cyber-Physical Systems Anomaly Detection
- Author
-
Abshari, Danial, Fu, Chenglong, and Sridhar, Meera
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Artificial Intelligence - Abstract
Modern industrial infrastructures rely heavily on Cyber-Physical Systems (CPS), but these are vulnerable to cyber-attacks with potentially catastrophic effects. To reduce these risks, anomaly detection methods based on physical invariants have been developed. However, these methods often require domain-specific expertise to manually define invariants, making them costly and difficult to scale. To address this limitation, we propose a novel approach to extract physical invariants from CPS testbeds for anomaly detection. Our insight is that CPS design documentation often contains semantically rich descriptions of physical procedures, which can profile inter-correlated dynamics among system components. Leveraging the built-in physics and engineering knowledge of recent generative AI models, we aim to automate this traditionally manual process, improving scalability and reducing costs. This work focuses on designing and optimizing a Retrieval-Augmented-Generation (RAG) workflow with a customized prompting system tailored for CPS documentation, enabling accurate extraction of semantic information and inference of physical invariants from complex, multimodal content. Then, rather than directly applying the inferred invariants for anomaly detection, we introduce an innovative statistics-based learning approach that integrates these invariants into the training dataset. This method addresses limitations such as hallucination and concept drift, enhancing the reliability of the model. We evaluate our approach on real-world public CPS security dataset which contains 86 data points and 58 attacking cases. The results show that our approach achieves a high precision of 0.923, accurately detecting anomalies while minimizing false alarms.
- Published
- 2024
9. Floquet Topological Dissipative Kerr Solitons and Incommensurate Frequency Combs
- Author
-
Hashemi, Seyed Danial and Mittal, Sunil
- Subjects
Physics - Optics - Abstract
Generating coherent optical frequency combs in micro-ring resonators with Kerr nonlinearity has remarkably advanced the fundamental understanding and applications of temporal dissipative solitons. However, the spectrum of such soliton combs is restricted to the conventional definition of combs as phase-locked, equidistant lines in frequency. Here, we introduce a new class of floquet topological soliton combs that emerge in two-dimensional arrays of strongly coupled resonators engineered using floquet topology. Specifically, we demonstrate novel incommensurate combs where the comb lines are not equidistant but remain phase-locked. These incommensurate combs are generated by self-organized, phase-locked floquet topological soliton molecules that circulate the edge of the array. We show that these floquet topological solitons are robust and they navigate around defects, allowing for agile tunability of the comb line spacing. Our results introduce a new paradigm in using floquet engineering to generate unconventional frequency combs beyond those achievable with single or weakly coupled resonators.
- Published
- 2024
- Full Text
- View/download PDF
10. UQ of 2D Slab Burner DNS: Surrogates, Uncertainty Propagation, and Parameter Calibration
- Author
-
Georgalis, Georgios, Becerra, Alejandro, Budzinski, Kenneth, McGurn, Matthew, Faghihi, Danial, DesJardin, Paul E., and Patra, Abani
- Subjects
Physics - Computational Physics ,Computer Science - Machine Learning - Abstract
The goal of this paper is to demonstrate and address challenges related to all aspects of performing a complete uncertainty quantification (UQ) analysis of a complicated physics-based simulation like a 2D slab burner direct numerical simulation (DNS). The UQ framework includes the development of data-driven surrogate models, propagation of parametric uncertainties to the fuel regression rate--the primary quantity of interest--and Bayesian calibration of critical parameters influencing the regression rate using experimental data. Specifically, the parameters calibrated include the latent heat of sublimation and a chemical reaction temperature exponent. Two surrogate models, a Gaussian Process (GP) and a Hierarchical Multiscale Surrogate (HMS) were constructed using an ensemble of 64 simulations generated via Latin Hypercube sampling. Both models exhibited comparable performance during cross-validation. However, the HMS was more stable due to its ability to handle multiscale effects, in contrast with the GP which was very sensitive to kernel choice. Analysis revealed that the surrogates do not accurately predict all spatial locations of the slab burner as-is. Subsequent Bayesian calibration of the physical parameters against experimental observations resulted in regression rate predictions that closer align with experimental observation in specific regions. This study highlights the importance of surrogate model selection and parameter calibration in quantifying uncertainty in predictions of fuel regression rates in complex combustion systems.
- Published
- 2024
11. Automated Defect Detection and Grading of Piarom Dates Using Deep Learning
- Author
-
Azimi, Nasrin and Rezaei, Danial Mohammad
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Grading and quality control of Piarom dates, a premium and high-value variety cultivated predominantly in Iran, present significant challenges due to the complexity and variability of defects, as well as the absence of specialized automated systems tailored to this fruit. Traditional manual inspection methods are labor intensive, time consuming, and prone to human error, while existing AI-based sorting solutions are insufficient for addressing the nuanced characteristics of Piarom dates. In this study, we propose an innovative deep learning framework designed specifically for the real-time detection, classification, and grading of Piarom dates. Leveraging a custom dataset comprising over 9,900 high-resolution images annotated across 11 distinct defect categories, our framework integrates state-of-the-art object detection algorithms and Convolutional Neural Networks (CNNs) to achieve high precision in defect identification. Furthermore, we employ advanced segmentation techniques to estimate the area and weight of each date, thereby optimizing the grading process according to industry standards. Experimental results demonstrate that our system significantly outperforms existing methods in terms of accuracy and computational efficiency, making it highly suitable for industrial applications requiring real-time processing. This work not only provides a robust and scalable solution for automating quality control in the Piarom date industry but also contributes to the broader field of AI-driven food inspection technologies, with potential applications across various agricultural products.
- Published
- 2024
12. Comparative Study of Multilingual Idioms and Similes in Large Language Models
- Author
-
Khoshtab, Paria, Namazifard, Danial, Masoudi, Mostafa, Akhgary, Ali, Sani, Samin Mahdizadeh, and Yaghoobzadeh, Yadollah
- Subjects
Computer Science - Computation and Language - Abstract
This study addresses the gap in the literature concerning the comparative performance of LLMs in interpreting different types of figurative language across multiple languages. By evaluating LLMs using two multilingual datasets on simile and idiom interpretation, we explore the effectiveness of various prompt engineering strategies, including chain-of-thought, few-shot, and English translation prompts. We extend the language of these datasets to Persian as well by building two new evaluation sets. Our comprehensive assessment involves both closed-source (GPT-3.5, GPT-4o mini, Gemini 1.5), and open-source models (Llama 3.1, Qwen2), highlighting significant differences in performance across languages and figurative types. Our findings reveal that while prompt engineering methods are generally effective, their success varies by figurative type, language, and model. We also observe that open-source models struggle particularly with low-resource languages in similes. Additionally, idiom interpretation is nearing saturation for many languages, necessitating more challenging evaluations., Comment: 22 pages, 4 figures
- Published
- 2024
13. Rapid Dust Formation in the Early Universe
- Author
-
Langeroodi, Danial, Hjorth, Jens, Ferrara, Andrea, and Gall, Christa
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Interstellar dust links the formation of the first stars to the rocky planet we inhabit by playing a pivotal role in the cooling and fragmentation of molecular clouds, and catalyzing the formation of water and organic molecules. Despite its central role, the origin of dust and its formation timescale remain unknown. Some models favor rapid production in supernova ejecta as the primary origin of dust, while others invoke slower production by evolved asymptotic giant branch stars or grain growth in the interstellar medium (ISM). The dust content of young early-universe galaxies is highly sensitive to the dust formation timescales. Here, we evaluate the dust content of 631 galaxies at $3 < z_{\rm spec} < 14$ based on rest-UV to optical spectroscopy obtained with JWST NIRSpec. We find that dust appears rapidly. Attenuation immediately follows star formation on timescales shorter than $\sim 30$ Myr, favoring dust production by supernovae. The degree of attenuation is $\sim 30$ times lower than expected if the entire supernova dust yield were preserved in the ISM, and had Milky Way-like grain properties. This can be reconciled if the early-universe dust is composed mostly of silicate or grains much larger than those in the Milky Way, and if significant dust destruction or ejection by outflows takes place., Comment: Submitted!
- Published
- 2024
14. Hierarchical Gaussian Process-Based Bayesian Optimization for Materials Discovery in High Entropy Alloy Spaces
- Author
-
Alvi, Sk Md Ahnaf Akif, Janssen, Jan, Khatamsaz, Danial, Perez, Danny, Allaire, Douglas, and Arroyave, Raymundo
- Subjects
Condensed Matter - Materials Science - Abstract
Bayesian optimization (BO) is a powerful and data-efficient method for iterative materials discovery and design, particularly valuable when prior knowledge is limited, underlying functional relationships are complex or unknown, and the cost of querying the materials space is significant. Traditional BO methodologies typically utilize conventional Gaussian Processes (cGPs) to model the relationships between material inputs and properties, as well as correlations within the input space. However, cGP-BO approaches often fall short in multi-objective optimization scenarios, where they are unable to fully exploit correlations between distinct material properties. Leveraging these correlations can significantly enhance the discovery process, as information about one property can inform and improve predictions about others. This study addresses this limitation by employing advanced kernel structures to capture and model multi-dimensional property correlations through multi-task (MTGPs) or deep Gaussian Processes (DGPs), thus accelerating the discovery process. We demonstrate the effectiveness of MTGP-BO and DGP-BO in rapidly and robustly solving complex materials design challenges that occur within the context of complex multi-objective optimization -- carried out by leveraging the pyiron workflow manager over FCC FeCrNiCoCu high entropy alloy (HEA) spaces, where traditional cGP-BO approaches fail. Furthermore, we highlight how the differential costs associated with querying various material properties can be strategically leveraged to make the materials discovery process more cost-efficient.
- Published
- 2024
15. Partial-to-Full Registration based on Gradient-SDF for Computer-Assisted Orthopedic Surgery
- Author
-
Li, Tiancheng, Walker, Peter, Hammoud, Danial, Zhao, Liang, and Huang, Shoudong
- Subjects
Computer Science - Robotics - Abstract
In computer-assisted orthopedic surgery (CAOS), accurate pre-operative to intra-operative bone registration is an essential and critical requirement for providing navigational guidance. This registration process is challenging since the intra-operative 3D points are sparse, only partially overlapped with the pre-operative model, and disturbed by noise and outliers. The commonly used method in current state-of-the-art orthopedic robotic system is bony landmarks based registration, but it is very time-consuming for the surgeons. To address these issues, we propose a novel partial-to-full registration framework based on gradient-SDF for CAOS. The simulation experiments using bone models from publicly available datasets and the phantom experiments performed under both optical tracking and electromagnetic tracking systems demonstrate that the proposed method can provide more accurate results than standard benchmarks and be robust to 90% outliers. Importantly, our method achieves convergence in less than 1 second in real scenarios and mean target registration error values as low as 2.198 mm for the entire bone model. Finally, it only requires random acquisition of points for registration by moving a surgical probe over the bone surface without correspondence with any specific bony landmarks, thus showing significant potential clinical value.
- Published
- 2024
16. Early Prediction of Mid-Term and Final Scores Using Deep Learning Models
- Author
-
Danial Hooshyar, Nour El Mawas, and Yeongwook Yang
- Abstract
The use of learner modelling approaches is critical for providing adaptive support in educational computer games, with predictive learner modelling being among the key approaches. While adaptive supports have been shown to improve the effectiveness of educational games, improperly customized support can have negative effects on learning outcomes. To tackle these challenges, we present a novel approach, called DeepLM, that considers a series of time windows representing both sequences of learners' actions during gameplay and estimation of their current competencies (using stealth assessment) to model learners and accordingly predict their future performance. The approach employs a variant of deep neural networks to early predict learners' midterm and final scores simultaneously. The results show that using 20-50% of learners' action sequences can early predict their final scores, with a cross-validated convolutional neural network (CNN) achieving an area under the curve (AUC) and accuracy of 0.879 and 85.3%, respectively. The same model can also achieve high accuracy in predicting midterm and final scores at the same time, with an AUC and accuracy of 0.848 and 77.9%. Overall, the CNN model outperforms recurrent neural network, long short-term memory, and baseline multilayer perceptron (MLP) models in predicting learners' final performance and performs better than the baseline MLP model in predicting learners' midterm and final performance using both cross-validation and independent datasets. These findings show the potential of the proposed approach in accurately early predicting learners' performance, allowing educators and game designers to tailor interventions and support mechanisms that could lead to optimized learning outcomes.
- Published
- 2024
17. Iranian High School EFL Students' Attitude towards Distance Learning throughout the COVID-19 Pandemic
- Author
-
Nourollah Gharanjik, Behrooz Ghoorchaei, Nematullah Shomoossi, and Danial Babajani Azizi
- Abstract
As in other contexts, Iranian students encountered challenges in the closedown of schools due to the outbreak of COVID-19 pandemic. The present study investigated the attitudes of 108 high school EFL students, using an online questionnaire to find out whether students preferred either to receive distance e-learning or the traditional face-to-face methods of instruction at schools. To this end, the participants completed a 20-item questionnaire. Also, the study attempted to explore EFL students' perceptions of the use of distance learning using individual semi-structured interviews. Thematic analysis was used to analyze the qualitative data and the major emergent themes were discussed. The results revealed that most of the students were not ready to adopt the ongoing distance e-learning system. They complained that the pandemic had negatively impacted their learning and found the distance e-learning system ineffective. The results suggested the need for further studies on empowering the learners and teachers to improve the effectiveness of instruction in critical situations.
- Published
- 2024
18. Analyzing Linguistic Disparities in Telehealth Care Outcomes at a Multidisciplinary Craniofacial Center.
- Author
-
Danial, Elizabeth, Rios, Jennifer, Badiee, Ryan, Rosenbluth, Glenn, and Pomerantz, Jason
- Subjects
ethics/health policies ,pediatrics ,social support ,Female ,Humans ,Male ,Communication Barriers ,Health Services Accessibility ,Healthcare Disparities ,Hispanic or Latino ,Language ,Patient Satisfaction ,Retrospective Studies ,Surveys and Questionnaires ,Telemedicine ,United States - Abstract
OBJECTIVE: To examine linguistic disparities between English- and Spanish-speaking patients in access to care, satisfaction, and telehealth appointment attendance. DESIGN: Retrospective study recording demographics for non-attendance analysis and conducting phone surveys assessing satisfaction with telehealth. SETTING: Data was collected between March and December 2020 at the UCSF Craniofacial Center (CFC), a multidisciplinary pediatric clinic. Patients: English- and Spanish-speaking patients with a telehealth appointment. Interventions: The CFC offered language-concordant outreach, assistance with the telehealth platform, and interpreters at all telehealth appointments. MAIN OUTCOME MEASURES: Demographics and patient-reported satisfaction with telehealth, barriers, and instruction clarity. RESULTS: Medicaid insurance was the only predictor of non-attendance. Surveys revealed that Spanish-speakers had 12.4 times the odds of lacking access to telehealth technology and 10.7 times the odds of needing help with logging on compared to English-speakers. There were no significant differences in satisfaction outcomes. CONCLUSIONS: We attribute this equity in satisfaction to our language-concordant outreach efforts.
- Published
- 2024
19. Predicting the 21-cm field with a hybrid effective field theory approach
- Author
-
Baradaran, Danial, Hadzhiyska, Boryana, White, Martin, and Sailer, Noah
- Subjects
Astronomical Sciences ,Physical Sciences - Published
- 2024
20. A graphical framework for global optimization of mixed-integer nonlinear programs
- Author
-
Davarnia, Danial and Kiaghadi, Mohammadreza
- Subjects
Mathematics - Optimization and Control - Abstract
While mixed-integer linear programming and convex programming solvers have advanced significantly over the past several decades, solution technologies for general mixed-integer nonlinear programs (MINLPs) have yet to reach the same level of maturity. Various problem structures across different application domains remain challenging to model and solve using modern global solvers, primarily due to the lack of efficient parsers and convexification routines for their complex algebraic representations. In this paper, we introduce a novel graphical framework for globally solving MINLPs based on decision diagrams (DDs), which enable the modeling of complex problem structures that are intractable for conventional solution techniques. We describe the core components of this framework, including a graphical reformulation of MINLP constraints, convexification techniques derived from the constructed graphs, efficient cutting plane methods to generate linear outer approximations, and a spatial branch-and-bound scheme with convergence guarantees. In addition to providing a global solution method for tackling challenging MINLPs, our framework addresses a longstanding gap in the DD literature by developing a general-purpose DD-based approach for solving general MINLPs. To demonstrate its capabilities, we apply our framework to solve instances from one of the most difficult classes of unsolved test problems in the MINLP Library, which are otherwise inadmissible for state-of-the-art global solvers.
- Published
- 2024
21. Functional Classification of Spiking Signal Data Using Artificial Intelligence Techniques: A Review
- Author
-
Sharifrazi, Danial, Javed, Nouman, Joloudari, Javad Hassannataj, Alizadehsani, Roohallah, Paradkar, Prasad N., Tan, Ru-San, Acharya, U. Rajendra, and Bhatti, Asim
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Quantitative Biology - Neurons and Cognition - Abstract
Human brain neuron activities are incredibly significant nowadays. Neuronal behavior is assessed by analyzing signal data such as electroencephalography (EEG), which can offer scientists valuable information about diseases and human-computer interaction. One of the difficulties researchers confront while evaluating these signals is the existence of large volumes of spike data. Spikes are some considerable parts of signal data that can happen as a consequence of vital biomarkers or physical issues such as electrode movements. Hence, distinguishing types of spikes is important. From this spot, the spike classification concept commences. Previously, researchers classified spikes manually. The manual classification was not precise enough as it involves extensive analysis. Consequently, Artificial Intelligence (AI) was introduced into neuroscience to assist clinicians in classifying spikes correctly. This review discusses the importance and use of AI in spike classification, focusing on the recognition of neural activity noises. The task is divided into three main components: preprocessing, classification, and evaluation. Existing methods are introduced and their importance is determined. The review also highlights the need for more efficient algorithms. The primary goal is to provide a perspective on spike classification for future research and provide a comprehensive understanding of the methodologies and issues involved. The review organizes materials in the spike classification field for future studies. In this work, numerous studies were extracted from different databases. The PRISMA-related research guidelines were then used to choose papers. Then, research studies based on spike classification using machine learning and deep learning approaches with effective preprocessing were selected., Comment: 8 figures, 32 pages
- Published
- 2024
22. Supply Risk-Aware Alloy Discovery and Design
- Author
-
Mulukutla, Mrinalini, Robinson, Robert, Khatamsaz, Danial, Vela, Brent, Vu, Nhu, and Arróyave, Raymundo
- Subjects
Computer Science - Machine Learning ,Condensed Matter - Materials Science - Abstract
Materials design is a critical driver of innovation, yet overlooking the technological, economic, and environmental risks inherent in materials and their supply chains can lead to unsustainable and risk-prone solutions. To address this, we present a novel risk-aware design approach that integrates Supply-Chain Aware Design Strategies into the materials development process. This approach leverages existing language models and text analysis to develop a specialized model for predicting materials feedstock supply risk indices. To efficiently navigate the multi-objective, multi-constraint design space, we employ Batch Bayesian Optimization (BBO), enabling the identification of Pareto-optimal high entropy alloys (HEAs) that balance performance objectives with minimized supply risk. A case study using the MoNbTiVW system demonstrates the efficacy of our approach in four scenarios, highlighting the significant impact of incorporating supply risk into the design process. By optimizing for both performance and supply risk, we ensure that the developed alloys are not only high-performing but also sustainable and economically viable. This integrated approach represents a critical step towards a future where materials discovery and design seamlessly consider sustainability, supply chain dynamics, and comprehensive life cycle analysis., Comment: 26 pages, 11 figures, submitted to Materialia
- Published
- 2024
23. Genesis-Metallicity: Universal Non-Parametric Gas-Phase Metallicity Estimation
- Author
-
Langeroodi, Danial and Hjorth, Jens
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We introduce genesis-metallicity, a gas-phase metallicity measurement python software employing the direct and strong-line methods depending on the available oxygen lines. The non-parametric strong-line estimator is calibrated based on a kernel density estimate in the 4-dimensional space of O2 = [O II]$\lambda\lambda 3727,29$/H$\beta$; O3 = [O III]$\lambda 5007$/H$\beta$; H$\beta$ equivalent width EW(H$\beta$); and gas-phase metallicity $12 + \log$(O/H). We use a calibration sample of 1551 galaxies at $0 < z < 10$, with direct-method metallicity measurements compiled from the JWST/NIRSpec and ground-based observations. In particular, we report 145 new NIRSpec direct-method metallicity measurements at $z > 1$. We show that the O2, O3, and EW(H$\beta$) measurements are sufficient for a gas-phase metallicity estimate that is more accurate than 0.09 dex. Our calibration is universal, meaning that its accuracy does not depend on the target redshift. Furthermore, the direct-method module employs a non-parametric $t_e$(O II) electron temperature estimator based on a kernel density estimate in the 5-dimensional space of O2, O3, EW(H$\beta$), $t_e$(O II), and $t_e$(O III). This $t_e$(O II) estimator is calibrated based on 1001 spectra with [O III]$\lambda 4363$ and [O II]$\lambda\lambda 7320,30$ detections, notably reporting 30 new NIRSpec detections of the [O II]$\lambda\lambda 7320,30$ doublet. We make genesis-metallicity and its calibration data publicly available and commit to keeping both up-to-date in light of the incoming data., Comment: code available at https://github.com/langeroodi/genesis_metallicity
- Published
- 2024
24. Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education
- Author
-
Forootani, Ali, Aliabadi, Danial Esmaeili, and Thraen, Daniela
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
This article introduces Bio-Eng-LMM AI chatbot, a versatile platform designed to enhance user interaction for educational and research purposes. Leveraging cutting-edge open-source Large Language Models (LLMs), Bio-Eng-LMM operates as a sophisticated AI assistant, exploiting the capabilities of traditional models like ChatGPT. Central to Bio-Eng-LMM is its implementation of Retrieval Augmented Generation (RAG) through three primary methods: integration of preprocessed documents, real-time processing of user-uploaded files, and information retrieval from any specified website. Additionally, the chatbot incorporates image generation via a Stable Diffusion Model (SDM), image understanding and response generation through LLAVA, and search functionality on the internet powered by secure search engine such as DuckDuckGo. To provide comprehensive support, Bio-Eng-LMM offers text summarization, website content summarization, and both text and voice interaction. The chatbot maintains session memory to ensure contextually relevant and coherent responses. This integrated platform builds upon the strengths of RAG-GPT and Web-Based RAG Query (WBRQ) where the system fetches relevant information directly from the web to enhance the LLMs response generation.
- Published
- 2024
25. Learning Task-Based Trainable Neuromorphic ADCs via Power-Aware Distillation
- Author
-
Vol, Tal, Danial, Loai, and Shlezinger, Nir
- Subjects
Computer Science - Neural and Evolutionary Computing - Abstract
The ability to process signals in digital form depends on analog-to-digital converters (ADCs). Traditionally, ADCs are designed to ensure that the digital representation closely matches the analog signal. However, recent studies have shown that significant power and memory savings can be achieved through task-based acquisition, where the acquisition process is tailored to the downstream processing task. An emerging technology for task-based acquisition involves the use of memristors, which are considered key enablers for neuromorphic computing. Memristors can implement ADCs with tunable mappings, allowing adaptation to specific system tasks or power constraints. In this work, we study task-based acquisition for a generic classification task using memristive ADCs. We consider the unique characteristics of this such neuromorphic ADCs, including their power consumption and noisy read-write behavior, and propose a physically compliant model based on resistive successive approximation register ADCs integrated with memristor components, enabling the adjustment of quantization regions. To optimize performance, we introduce a data-driven algorithm that jointly tunes task-based memristive ADCs alongside both digital and analog processing. Our design addresses the inherent stochasticity of memristors through power-aware distillation, complemented by a specialized learning algorithm that adapts to their unique analog-to-digital mapping. The proposed approach is shown to enhance accuracy by up to 27% and reduce power consumption by up to 66% compared to uniform ADCs. Even under noisy conditions, our method achieves substantial gains, with accuracy improvements of up to 19% and power reductions of up to 57%. These results highlight the effectiveness of our power-aware neuromorphic ADCs in improving system performance across diverse tasks., Comment: Under review in the IEEE
- Published
- 2024
26. Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data
- Author
-
Ghaffarzadeh-Esfahani, Mohammadreza, Ghaffarzadeh-Esfahani, Mahdi, Salahi-Niri, Arian, Toreyhi, Hossein, Atf, Zahra, Mohsenzadeh-Kermani, Amirali, Sarikhani, Mahshad, Tajabadi, Zohreh, Shojaeian, Fatemeh, Bagheri, Mohammad Hassan, Feyzi, Aydin, Tarighatpayma, Mohammadamin, Gazmeh, Narges, Heydari, Fateme, Afshar, Hossein, Allahgholipour, Amirreza, Alimardani, Farid, Salehi, Ameneh, Asadimanesh, Naghmeh, Khalafi, Mohammad Amin, Shabanipour, Hadis, Moradi, Ali, Zadeh, Sajjad Hossein, Yazdani, Omid, Esbati, Romina, Maleki, Moozhan, Nasr, Danial Samiei, Soheili, Amirali, Majlesi, Hossein, Shahsavan, Saba, Soheilipour, Alireza, Goudarzi, Nooshin, Taherifard, Erfan, Hatamabadi, Hamidreza, Samaan, Jamil S, Savage, Thomas, Sakhuja, Ankit, Soroush, Ali, Nadkarni, Girish, Darazam, Ilad Alavi, Pourhoseingholi, Mohamad Amin, and Safavi-Naini, Seyed Amir Ahmad
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,92C50, 68T50 ,J.3 - Abstract
Background: This study aimed to evaluate and compare the performance of classical machine learning models (CMLs) and large language models (LLMs) in predicting mortality associated with COVID-19 by utilizing a high-dimensional tabular dataset. Materials and Methods: We analyzed data from 9,134 COVID-19 patients collected across four hospitals. Seven CML models, including XGBoost and random forest (RF), were trained and evaluated. The structured data was converted into text for zero-shot classification by eight LLMs, including GPT-4 and Mistral-7b. Additionally, Mistral-7b was fine-tuned using the QLoRA approach to enhance its predictive capabilities. Results: Among the CML models, XGBoost and RF achieved the highest accuracy, with F1 scores of 0.87 for internal validation and 0.83 for external validation. In the LLM category, GPT-4 was the top performer with an F1 score of 0.43. Fine-tuning Mistral-7b significantly improved its recall from 1% to 79%, resulting in an F1 score of 0.74, which was stable during external validation. Conclusion: While LLMs show moderate performance in zero-shot classification, fine-tuning can significantly enhance their effectiveness, potentially aligning them closer to CML models. However, CMLs still outperform LLMs in high-dimensional tabular data tasks., Comment: Code is available at: https://github.com/mohammad-gh009/Large-Language-Models-vs-Classical-Machine-learning and https://github.com/Sdamirsa/Tehran_COVID_Cohort. The datasets are available from the corresponding author on reasonable request (sdamirsa@ymail.com)
- Published
- 2024
27. Constraining Dust Formation in the Superluminous Supernova 2017gci with JWST Observations
- Author
-
Gomez, Sebastian, Temim, Tea, Fox, Ori, Villar, V. Ashley, Shahbandeh, Melissa, Ashall, Chris, Jencson, Jacob E., Langeroodi, Danial, De Looze, Ilse, Milisavljevic, Dan, Pierel, Justin, Rest, Armin, Szalai, Tamás, and Tinyanont, Samaporn
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
We present JWST/MIRI observations of the Type I superluminous supernova (SLSN) 2017gci taken over 2000 rest-frame days after the supernova (SN) exploded, which represent the latest phase images taken of any known SLSN. We find that archival \WISE detections of SN\,2017gci taken 70 to 200 days after explosion are most likely explained by an IR dust echo from a $\sim 3 \times 10^{-4}$ M$_\odot$ shell of pre-existing dust, as opposed to freshly-formed dust. New JWST observations reveal IR emission in the field of SN\,2017gci, which we determine is most likely dominated by the host galaxy of the SN, based on the expected flux of the galaxy and the measurable separation between said emission and the location of the SN. Based on models for IR emission of carbonate dust, we place a $3\sigma$ upper limit of $0.83$ M$_\odot$ of dust formed in SN\,2017gci, with a lowest $1\sigma$ limit of $0.44$ M$_\odot$. Infrared (IR) detections of other SLSNe have suggested that SLSNe could be among the most efficient dust producers in the universe. Our results suggest that SLSNe do not necessarily form more dust than other types of SNe, but instead might have a more accelerated dust formation process. More IR observations of a larger sample of SLSNe will be required to determine how efficient dust production is in SLSNe., Comment: 11 pages, 5 figures, submitted to ApJ
- Published
- 2024
28. Snuffy: Efficient Whole Slide Image Classifier
- Author
-
Jafarinia, Hossein, Alipanah, Alireza, Hamdi, Danial, Razavi, Saeed, Mirzaie, Nahal, and Rohban, Mohammad Hossein
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Neural and Evolutionary Computing ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
Whole Slide Image (WSI) classification with multiple instance learning (MIL) in digital pathology faces significant computational challenges. Current methods mostly rely on extensive self-supervised learning (SSL) for satisfactory performance, requiring long training periods and considerable computational resources. At the same time, no pre-training affects performance due to domain shifts from natural images to WSIs. We introduce Snuffy architecture, a novel MIL-pooling method based on sparse transformers that mitigates performance loss with limited pre-training and enables continual few-shot pre-training as a competitive option. Our sparsity pattern is tailored for pathology and is theoretically proven to be a universal approximator with the tightest probabilistic sharp bound on the number of layers for sparse transformers, to date. We demonstrate Snuffy's effectiveness on CAMELYON16 and TCGA Lung cancer datasets, achieving superior WSI and patch-level accuracies. The code is available on https://github.com/jafarinia/snuffy., Comment: Accepted for ECCV 2024
- Published
- 2024
29. A Fully Open-Source End-to-End Private 5G Network over Unlicensed Frequency Bands
- Author
-
Bouhafs, Faycal, Hoseini, Sayed Amir, Shah, Syed Danial Ali, and Hartog, Frank den
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
The fifth generation of mobile networks (5G) represents the latest development in mobile communications. It has been designed to support several types of data traffic and to meet more performance requirements than ever before. These characteristics make 5G very attractive for current but also novel public and private industries and services. However, because of coverage, regulatory, business, and security reasons, many of these novel applications can only be deployed as part of a private network. The cost of licensed frequencies makes such approach prohibitive for many stakeholders, and therefore unlicensed frequency bands represent a more affordable option. Even so, private 5G networks for use in globally unlicensed frequency bands do not yet exist. In this paper we present the first end-to-end private 5G network operating in a globally unlicensed frequency band, using general purpose computers, open-source software and software-defined radio. We evidence its working and show that the choice of the hardware can significantly affect the performance of the network., Comment: arXiv admin note: This version has been removed by arXiv administrators as the submitter did not have the right to agree to the license at the time of submission
- Published
- 2024
30. Leveraging GPT for the Generation of Multi-Platform Social Media Datasets for Research
- Author
-
Tari, Henry, Khan, Danial, Rutten, Justus, Othman, Darian, Kaushal, Rishabh, Bertaglia, Thales, and Iamnitchi, Adriana
- Subjects
Computer Science - Computers and Society - Abstract
Social media datasets are essential for research on disinformation, influence operations, social sensing, hate speech detection, cyberbullying, and other significant topics. However, access to these datasets is often restricted due to costs and platform regulations. As such, acquiring datasets that span multiple platforms which are crucial for a comprehensive understanding of the digital ecosystem is particularly challenging. This paper explores the potential of large language models to create lexically and semantically relevant social media datasets across multiple platforms, aiming to match the quality of real datasets. We employ ChatGPT to generate synthetic data from two real datasets, each consisting of posts from three different social media platforms. We assess the lexical and semantic properties of the synthetic data and compare them with those of the real data. Our empirical findings suggest that using large language models to generate synthetic multi-platform social media data is promising. However, further enhancements are necessary to improve the fidelity of the outputs.
- Published
- 2024
31. Cavity QED in a High NA Resonator
- Author
-
Shadmany, Danial, Kumar, Aishwarya, Soper, Anna, Palm, Lukas, Yin, Chuan, Ando, Henry, Li, Bowen, Taneja, Lavanya, Jaffe, Matt, Schuster, David, and Simon, Jon
- Subjects
Physics - Atomic Physics ,Condensed Matter - Quantum Gases ,Quantum Physics - Abstract
From fundamental studies of light-matter interaction to applications in quantum networking and sensing, cavity quantum electrodynamics (QED) provides a platform-crossing toolbox to control interactions between atoms and photons. The coherence of such interactions is determined by the product of the single-pass atomic absorption and the number of photon round-trips. Reducing the cavity loss has enabled resonators supporting nearly 1-million optical roundtrips at the expense of severely limited optical material choices and increased alignment sensitivity. The single-pass absorption probability can be increased through the use of near-concentric, fiber or nanophotonic cavities, which reduce the mode waists at the expense of constrained optical access and exposure to surface fields. Here we present a new high numerical-aperture, lens-based resonator that pushes the single-atom-single-photon absorption probability per round trip close to its fundamental limit by reducing the mode size at the atom below a micron while keeping the atom mm-to-cm away from all optics. This resonator provides strong light-matter coupling in a cavity where the light circulates only ~ 10 times. We load a single 87Rb atom into such a cavity, observe strong coupling, demonstrate cavity-enhanced atom detection with imaging fidelity of 99.55(6) percent and survival probability of 99.89(4) percent in 130 microseconds, and leverage this new platform for a time-resolved exploration of cavity cooling. The resonator's loss-resilience paves the way to coupling of atoms to nonlinear and adaptive optical elements and provides a minimally invasive route to readout of defect centers. Introduction of intra-cavity imaging systems will enable the creation of cavity arrays compatible with Rydberg atom array computing technologies, vastly expanding the applicability of the cavity QED toolbox.
- Published
- 2024
32. A Novel Portable and Wearable Broadband Near-Infrared Spectroscopy Device for In-Vivo Oxygenation and Metabolism Measurements
- Author
-
Talati, Musa, Lange, Frederic, Airantzis, Dimitrios, Chitnis, Danial, Illukwe, Temisan, Gopal, Darshana, Pinti, Paola, Ranaei-Zamani, Niccole, Kowobari, Olayinka, Hillman, Sara, Siassakos, Dimitrios, David, Anna, Mitra, Subhabrata, and Tachtsidis, Ilias
- Subjects
Physics - Medical Physics - Abstract
Broadband NIRS (bNIRS) is an extension of fNIRS that provides the same assessment of oxygenation biomarkers along with a valuable marker for oxygen metabolism at a cellular level, the oxidation state of cytochrome-c-oxidase (oxCCO). bNIRS implements many (100s) NIR wavelengths in the full NIR spectrum to address this and provide insight to tissue energetics. To supply these many wavelengths of light, broadband sources are required, and spectrometers are employed to distinguish power per wavelength. Current multi-channel bNIRS instruments are bulky and only semi-portable due to technological limitations. We propose a design for a bNIRS device that has been miniaturized to allow for portable use. This design leverages the innovations in photonic devices that have created a new line of microspectrometers and broadband NIR high-power LEDs; the Hamamatsu SMD-type spectrometer C14384MA and the Ushio SMBBIR45-1100 LED. This first-of-itskind device, referred to as microCYRIL (after its two predecessors CYRIL and miniCYRIL), has been developed for oxygenation and metabolism measurements with dual channel operation. To verify functionality, concentration changes in oxygenated (HbO2) and deoxygenated (HHb) haemoglobin and oxCCO were successfully tracked during a cuff-induced venous and arterial occlusion., Comment: To be published in Advances in Experimental Medicine and Biology
- Published
- 2024
33. CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes
- Author
-
Qashqai, Danial, Mousavian, Emad, Shokouhi, Shahriar Baradaran, and Mirzakuchaki, Sattar
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Semantic segmentation, as a crucial component of complex visual interpretation, plays a fundamental role in autonomous vehicle vision systems. Recent studies have significantly improved the accuracy of semantic segmentation by exploiting complementary information and developing multimodal methods. Despite the gains in accuracy, multimodal semantic segmentation methods suffer from high computational complexity and low inference speed. Therefore, it is a challenging task to implement multimodal methods in driving applications. To address this problem, we propose the Cosine Similarity Fusion Network (CSFNet) as a real-time RGB-X semantic segmentation model. Specifically, we design a Cosine Similarity Attention Fusion Module (CS-AFM) that effectively rectifies and fuses features of two modalities. The CS-AFM module leverages cross-modal similarity to achieve high generalization ability. By enhancing the fusion of cross-modal features at lower levels, CS-AFM paves the way for the use of a single-branch network at higher levels. Therefore, we use dual and single-branch architectures in an encoder, along with an efficient context module and a lightweight decoder for fast and accurate predictions. To verify the effectiveness of CSFNet, we use the Cityscapes, MFNet, and ZJU datasets for the RGB-D/T/P semantic segmentation. According to the results, CSFNet has competitive accuracy with state-of-the-art methods while being state-of-the-art in terms of speed among multimodal semantic segmentation models. It also achieves high efficiency due to its low parameter count and computational complexity. The source code for CSFNet will be available at https://github.com/Danial-Qashqai/CSFNet.
- Published
- 2024
34. MIDIS: MIRI uncovers Virgil, an extended source at $z\simeq 6.6$ with the photometric properties of Little Red Dots
- Author
-
Iani, Edoardo, Rinaldi, Pierluigi, Caputi, Karina I., Annunziatella, Marianna, Langeroodi, Danial, Melinder, Jens, Pérez-González, Pablo G., Álvarez-Márquez, Javier, Boogaard, Leindert A., Bosman, Sarah E. I., Costantin, Luca, Moutard, Thibaud, Colina, Luis, Östlin, Göran, Greve, Thomas R., Wright, Gillian, Alonso-Herrero, Almudena, Bik, Arjan, Gillman, Steven, Gómez, Alejandro Crespo, Hjorth, Jens, Labiano, Alvaro, Pye, John P., Tikkanen, Tuomo V., and van der Werf, Paul P.
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We present Virgil, a MIRI extremely red object (MERO) detected with the F1000W filter as part of the MIRI Deep Imaging Survey (MIDIS) observations of the Hubble Ultra Deep Field (HUDF). Virgil is a Lyman-$\alpha$ emitter (LAE) at $z_{spec} = 6.6312\pm 0.0019$ (from VLT/MUSE) with a rest-frame UV-to-optical spectral energy distribution (SED) typical of LAEs at similar redshifts. However, MIRI observations reveal an unexpected extremely red color at rest-frame near-infrared wavelengths, $\rm F444W - F1000W = 2.33 \pm 0.06$. Such steep rise in the near-infrared, completely missed without MIRI imaging, is poorly reproduced by models including only stellar populations and hints towards the presence of an Active Galactic Nucleus (AGN). Interestingly, the overall SED shape of Virgil resembles that of the recently discovered population of Little Red Dots (LRDs) but does not meet their compactness criterion: at rest-frame UV-optical wavelengths Virgil's morphology follows a 2D-S\'ersic profile with average index $n = 0.93^{+0.85}_{-0.31}$ and $r_e = 0.43$~pkpc. Only at MIRI wavelengths Virgil is unresolved due to the coarser PSF. We also estimate a bolometric luminosity $L_{\rm bol} = (8.4-11.1)\times 10^{44}\rm~erg~s^{-1}$ and a supermassive black hole mass $M_{\rm BH} = (4-7)\times 10^7\rm ~ M_\odot$ in agreement with recently reported values for LRDs. This discovery demonstrates the crucial importance of deep MIRI surveys to find AGN amongst high-$z$ galaxies that otherwise would be completely missed and raises the question of how common Virgil-like objects could be in the early Universe., Comment: 17 pages, 10 figures, 3 tables. Submitted to ApJ
- Published
- 2024
35. Are Logistic Models Really Interpretable?
- Author
-
Dervovic, Danial, Lécué, Freddy, Marchesotti, Nicolás, and Magazzeni, Daniele
- Subjects
Computer Science - Machine Learning - Abstract
The demand for open and trustworthy AI models points towards widespread publishing of model weights. Consumers of these model weights must be able to act accordingly with the information provided. That said, one of the simplest AI classification models, Logistic Regression (LR), has an unwieldy interpretation of its model weights, with greater difficulties when extending LR to generalised additive models. In this work, we show via a User Study that skilled participants are unable to reliably reproduce the action of small LR models given the trained parameters. As an antidote to this, we define Linearised Additive Models (LAMs), an optimal piecewise linear approximation that augments any trained additive model equipped with a sigmoid link function, requiring no retraining. We argue that LAMs are more interpretable than logistic models -- survey participants are shown to solve model reasoning tasks with LAMs much more accurately than with LR given the same information. Furthermore, we show that LAMs do not suffer from large performance penalties in terms of ROC-AUC and calibration with respect to their logistic counterparts on a broad suite of public financial modelling data., Comment: 36 pages, 5 Figures. Extended version of paper accepted to IJCAI 2024. arXiv admin note: substantial text overlap with arXiv:2211.06360
- Published
- 2024
36. Scalable Expressiveness through Preprocessed Graph Perturbations
- Author
-
Saber, Danial and Salehi-Abari, Amirali
- Subjects
Computer Science - Machine Learning - Abstract
Graph Neural Networks (GNNs) have emerged as the predominant method for analyzing graph-structured data. However, canonical GNNs have limited expressive power and generalization capability, thus triggering the development of more expressive yet computationally intensive methods. One such approach is to create a series of perturbed versions of input graphs and then repeatedly conduct multiple message-passing operations on all variations during training. Despite their expressive power, this approach does not scale well on larger graphs. To address this scalability issue, we introduce Scalable Expressiveness through Preprocessed Graph Perturbation (SE2P). This model offers a flexible, configurable balance between scalability and generalizability with four distinct configuration classes. At one extreme, the configuration prioritizes scalability through minimal learnable feature extraction and extensive preprocessing; at the other extreme, it enhances generalizability with more learnable feature extractions, though this increases scalability costs. We conduct extensive experiments on real-world datasets to evaluate the generalizability and scalability of SE2P variants compared to various state-of-the-art benchmarks. Our results indicate that, depending on the chosen SE2P configuration, the model can enhance generalizability compared to benchmarks while achieving significant speed improvements of up to 8-fold., Comment: 14 pages, 3 figures
- Published
- 2024
37. A Parameterized Nonlinear Magnetic Equivalent Circuit for Design and Fast Analysis of Radial Flux Magnetic Gears
- Author
-
Kazemikia, Danial and Gardner, Matthew
- Subjects
Electrical Engineering and Systems Science - Systems and Control ,Computer Science - Computational Engineering, Finance, and Science - Abstract
Magnetic gears offer advantages over mechanical gears, including contactless power transfer, but require robust analysis tools for optimization and commercialization. This study proposes a rapid and accurate 2D nonlinear magnetic equivalent circuit (MEC) model for radial flux magnetic gears (RFMG). The model, featuring a parameterized gear geometry and adjustable flux tube distribution, accommodates nonlinear effects like magnetic saturation while maintaining quick simulation times. Comparison with a nonlinear finite element analysis (FEA) model demonstrates the MEC's accuracy in torque and flux density predictions across diverse designs. Additionally, a parametric optimization study of 140,000 designs confirms the MEC's high accuracy, achieving close agreement with FEA torque predictions, with simulations running up to 100 times faster. Finally, the MEC shows good agreement with 2D FEA for a prototype RFMG.
- Published
- 2024
38. A Pipelined Memristive Neural Network Analog-to-Digital Converter
- Author
-
Danial, Loai, Sharma, Kanishka, and Kvatinsky, Shahar
- Subjects
Electrical Engineering and Systems Science - Systems and Control ,Computer Science - Neural and Evolutionary Computing - Abstract
With the advent of high-speed, high-precision, and low-power mixed-signal systems, there is an ever-growing demand for accurate, fast, and energy-efficient analog-to-digital (ADCs) and digital-to-analog converters (DACs). Unfortunately, with the downscaling of CMOS technology, modern ADCs trade off speed, power and accuracy. Recently, memristive neuromorphic architectures of four-bit ADC/DAC have been proposed. Such converters can be trained in real-time using machine learning algorithms, to break through the speedpower-accuracy trade-off while optimizing the conversion performance for different applications. However, scaling such architectures above four bits is challenging. This paper proposes a scalable and modular neural network ADC architecture based on a pipeline of four-bit converters, preserving their inherent advantages in application reconfiguration, mismatch selfcalibration, noise tolerance, and power optimization, while approaching higher resolution and throughput in penalty of latency. SPICE evaluation shows that an 8-bit pipelined ADC achieves 0.18 LSB INL, 0.20 LSB DNL, 7.6 ENOB, and 0.97 fJ/conv FOM. This work presents a significant step towards the realization of large-scale neuromorphic data converters.
- Published
- 2024
39. Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models
- Author
-
Tang, Wenzhuo, Mao, Haitao, Dervovic, Danial, Brugere, Ivan, Mishra, Saumitra, Xie, Yuying, and Tang, Jiliang
- Subjects
Computer Science - Machine Learning - Abstract
Models for natural language and images benefit from data scaling behavior: the more data fed into the model, the better they perform. This 'better with more' phenomenon enables the effectiveness of large-scale pre-training on vast amounts of data. However, current graph pre-training methods struggle to scale up data due to heterogeneity across graphs. To achieve effective data scaling, we aim to develop a general model that is able to capture diverse data patterns of graphs and can be utilized to adaptively help the downstream tasks. To this end, we propose UniAug, a universal graph structure augmentor built on a diffusion model. We first pre-train a discrete diffusion model on thousands of graphs across domains to learn the graph structural patterns. In the downstream phase, we provide adaptive enhancement by conducting graph structure augmentation with the help of the pre-trained diffusion model via guided generation. By leveraging the pre-trained diffusion model for structure augmentation, we consistently achieve performance improvements across various downstream tasks in a plug-and-play manner. To the best of our knowledge, this study represents the first demonstration of a data-scaling graph structure augmentor on graphs across domains.
- Published
- 2024
40. Double-sided van der Waals epitaxy of topological insulators across an atomically thin membrane
- Author
-
Park, Joon Young, Shin, Young Jae, Shin, Jeacheol, Kim, Jehyun, Jo, Janghyun, Yoo, Hyobin, Haei, Danial, Hyun, Chohee, Yun, Jiyoung, Huber, Robert M., Gupta, Arijit, Watanabe, Kenji, Taniguchi, Takashi, Park, Wan Kyu, Shin, Hyeon Suk, Kim, Miyoung, Kim, Dohun, Yi, Gyu-Chul, and Kim, Philip
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
Atomically thin van der Waals (vdW) films provide a novel material platform for epitaxial growth of quantum heterostructures. However, unlike the remote epitaxial growth of three-dimensional bulk crystals, the growth of two-dimensional (2D) material heterostructures across atomic layers has been limited due to the weak vdW interaction. Here, we report the double-sided epitaxy of vdW layered materials through atomic membranes. We grow vdW topological insulators (TIs) Sb$_2$Te$_3$ and Bi$_2$Se$_3$ by molecular beam epitaxy on both surfaces of atomically thin graphene or hBN, which serve as suspended 2D vdW "$\textit{substrate}$" layers. Both homo- and hetero- double-sided vdW TI tunnel junctions are fabricated, with the atomically thin hBN acting as a crystal-momentum-conserving tunnelling barrier with abrupt and epitaxial interface. By performing field-angle dependent magneto-tunnelling spectroscopy on these devices, we reveal the energy-momentum-spin resonant tunnelling of massless Dirac electrons between helical Landau levels developed in the topological surface states at the interface., Comment: 24 pages, 4 main figures, 7 extended data figures
- Published
- 2024
41. Nonlinear Spectroscopy via Generalized Quantum Phase Estimation
- Author
-
Loaiza, Ignacio, Motlagh, Danial, Hejazi, Kasra, Zini, Modjtaba Shokrian, Delgado, Alain, and Arrazola, Juan Miguel
- Subjects
Quantum Physics - Abstract
Response theory has a successful history of connecting experimental observations with theoretical predictions. Of particular interest is the optical response of matter, from which spectroscopy experiments can be modelled. However, the calculation of response properties for quantum systems is often prohibitively expensive, especially for nonlinear spectroscopy, as it requires access to either the time evolution of the system or to excited states. In this work, we introduce a generalized quantum phase estimation framework designed for multi-variate phase estimation. This allows the treatment of general correlation functions enabling the recovery of response properties of arbitrary orders. The generalized quantum phase estimation circuit has an intuitive construction that is linked with a physical process of interest, and can directly sample frequencies from the distribution that would be obtained experimentally. In addition, we provide a single-ancilla modification of the new framework for early fault-tolerant quantum computers. Overall, our framework enables the efficient simulation of spectroscopy experiments beyond the linear regime, such as Raman spectroscopy. This opens up an exciting new field of applications for quantum computers with potential technological impact., Comment: 16 pages, 3 figures, 1 table
- Published
- 2024
42. Lusifer: LLM-based User SImulated Feedback Environment for online Recommender systems
- Author
-
Ebrat, Danial, Paradalis, Eli, and Rueda, Luis
- Subjects
Computer Science - Information Retrieval ,Computer Science - Machine Learning - Abstract
Training reinforcement learning-based recommender systems is often hindered by the lack of dynamic and realistic user interactions. To address this limitation, we introduce Lusifer, a novel environment leveraging Large Language Models (LLMs) to generate simulated user feedback. Lusifer synthesizes user profiles and interaction histories to simulate responses and behaviors toward recommended items, with profiles updated after each rating to reflect evolving user characteristics. Utilizing the MovieLens dataset as a proof of concept, we limited our implementation to the last 40 interactions for each user, representing approximately 39% and 22% of the training sets, to focus on recent user behavior. For consistency and to gain insights into the performance of traditional methods with limited data, we implemented baseline approaches using the same data subset. Our results demonstrate that Lusifer accurately emulates user behavior and preferences, even with reduced training data having an RMSE of 1.3 across various test sets. This paper presents Lusifer's operational pipeline, including prompt generation and iterative user profile updates, and compares its performance against baseline methods. The findings validate Lusifer's ability to produce realistic dynamic feedback and suggest that it offers a scalable and adjustable framework for user simulation in online reinforcement learning recommender systems for future studies, particularly when training data is limited.
- Published
- 2024
43. Illustrating an Effective Workflow for Accelerated Materials Discovery
- Author
-
Mulukutla, Mrinalini, Person, A. Nicole, Voigt, Sven, Kuettner, Lindsey, Kappes, Branden, Khatamsaz, Danial, Robinson, Robert, Salas, Daniel, Xu, Wenle, Lewis, Daniel, Eoh, Hongkyu, Xiao, Kailu, Wang, Haoren, Saini, Jaskaran Singh, Mahat, Raj, Hastings, Trevor, Skokan, Matthew, Attari, Vahid, Elverud, Michael, Paramore, James D., Butler, Brady, Vecchio, Kenneth, Kalidindi, Surya R., Allaire, Douglas, Karaman, Ibrahim, Thomas, Edwin L., Pharr, George, Srivastava, Ankit, and Arróyave, Raymundo
- Subjects
Condensed Matter - Materials Science - Abstract
Algorithmic materials discovery is a multi-disciplinary domain that integrates insights from specialists in alloy design, synthesis, characterization, experimental methodologies, computational modeling, and optimization. Central to this effort is a robust data management system paired with an interactive work platform. This platform should empower users to not only access others data but also integrate their analyses, paving the way for sophisticated data pipelines. To realize this vision, there is a need for an integrative collaboration platform, streamlined data sharing and analysis tools, and efficient communication channels. Such a collaborative mechanism should transcend geographical barriers, facilitating remote interaction and fostering a challenge-response dynamic. In this paper, we present our ongoing efforts in addressing the critical challenges related to an accelerated Materials Discovery Framework as a part of the High-Throughput Materials Discovery for Extreme Conditions Initiative. Our BIRDSHOT Center has successfully harnessed various tools and strategies, including the utilization of cloud-based storage, a standardized sample naming convention, a structured file system, the implementation of sample travelers, a robust sample tracking method, and the incorporation of knowledge graphs for efficient data management. Additionally, we present the development of a data collection platform, reinforcing seamless collaboration among our team members. In summary, this paper provides an illustration and insight into the various elements of an efficient and effective workflow within an accelerated materials discovery framework while highlighting the dynamic and adaptable nature of the data management tools and sharing platforms., Comment: 28 pages, 9 figures, 2 tables, with appendix that has 8 pages, accepted for publication at IMMI
- Published
- 2024
- Full Text
- View/download PDF
44. An Interoperable Multi Objective Batch Bayesian Optimization Framework for High Throughput Materials Discovery
- Author
-
Hastings, Trevor, Mulukutla, Mrinalini, Khatamsaz, Danial, Salas, Daniel, Xu, Wenle, Lewis, Daniel, Person, Nicole, Skokan, Matthew, Miller, Braden, Paramore, James, Butler, Brady, Allaire, Douglas, Karaman, Ibrahim, Pharr, George, Srivastava, Ankit, and Arroyave, Raymundo
- Subjects
Condensed Matter - Materials Science - Abstract
In this study, we introduce a groundbreaking framework for materials discovery, we efficiently navigate a vast phase space of material compositions by leveraging Batch Bayesian statistics in order to achieve specific performance objectives. This approach addresses the challenge of identifying optimal materials from an untenably large array of possibilities in a reasonable timeframe with high confidence. Crucially, our batchwise methods align seamlessly with existing material processing infrastructure for synthesizing and characterizing materials. By applying this framework to a specific high entropy alloy system, we demonstrate its versatility and robustness in optimizing properties like strain hardening, hardness, and strain rate sensitivity. The fact that the Bayesian model is adept in refining and expanding the property Pareto front highlights its broad applicability across various materials, including steels, shape memory alloys, ceramics, and composites. This study advances the field of materials science and sets a new benchmark for material discovery methodologies. By proving the effectiveness of Bayesian optimization, we showcase its potential to redefine the landscape of materials discovery., Comment: 12 pages, 6 figures, with Supplementary Appendix that has 17 pages, 9 figures
- Published
- 2024
45. Beyond Deepfake Images: Detecting AI-Generated Videos
- Author
-
Vahdati, Danial Samadi, Nguyen, Tai D., Azizpour, Aref, and Stamm, Matthew C.
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent advances in generative AI have led to the development of techniques to generate visually realistic synthetic video. While a number of techniques have been developed to detect AI-generated synthetic images, in this paper we show that synthetic image detectors are unable to detect synthetic videos. We demonstrate that this is because synthetic video generators introduce substantially different traces than those left by image generators. Despite this, we show that synthetic video traces can be learned, and used to perform reliable synthetic video detection or generator source attribution even after H.264 re-compression. Furthermore, we demonstrate that while detecting videos from new generators through zero-shot transferability is challenging, accurate detection of videos from a new generator can be achieved through few-shot learning., Comment: To be published in CVPRW24
- Published
- 2024
46. NIRSpec View of the Appearance and Evolution of Balmer Breaks and the Transition from Bursty to Smooth Star Formation Histories from Deep Within the Epoch of Reionization to Cosmic Noon
- Author
-
Langeroodi, Danial and Hjorth, Jens
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Theoretical models and observational evidence suggest that high-redshift galaxies grow under the bursty mode of star formation, with large temporal star formation rate (SFR) fluctuations around some mean value. From an observational perspective, it has not been clear at which redshift and stellar population characteristics the transition from bursty to smooth star formation occurs. Here, we investigate these using a uniformly reduced sample of NIRSpec prism spectra of 631 galaxies at $3 < z_{\rm spec} < 14$, stacked in 8 redshift and 8 UV slope bins. We evaluate the burstiness of star formation histories using the Balmer break strengths as well as the ratios of SFRs as measured from the emission lines to those measured from the UV continua. The break strength increases monotonically from $z = 10$ to $z = 3$, and from $\beta_{\rm UV} = -3.0$ to $\beta_{\rm UV} = 0.0$. The break strength is tightly anti-correlated with specific SFR (sSFR), and in dusty galaxies, strongly correlated with dust attenuation. Based on the SFR ratios, we find that bursty star formation thrives in the highest redshift, bluest, and lowest stellar mass galaxies, which exhibit the highest sSFRs. The burstiness appears to plateau at $z > 6$, suggesting that we might be observing the peak of star formation burstiness at these redshifts. The $z < 4$ galaxies do not appear particularly bursty, suggesting that the smooth mode of star formation starts taking over right before cosmic noon. As galaxies mature and develop redder UV colors and more pronounced Balmer breaks, their ability to sustain star formation over longer timescales increases, signalling their transition from bursty to smooth star formation., Comment: Comments Welcome! Submitted to ApJL
- Published
- 2024
47. Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports
- Author
-
Cao, Tianyu, Raman, Natraj, Dervovic, Danial, and Tan, Chenhao
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
As large language models (LLMs) expand the power of natural language processing to handle long inputs, rigorous and systematic analyses are necessary to understand their abilities and behavior. A salient application is summarization, due to its ubiquity and controversy (e.g., researchers have declared the death of summarization). In this paper, we use financial report summarization as a case study because financial reports are not only long but also use numbers and tables extensively. We propose a computational framework for characterizing multimodal long-form summarization and investigate the behavior of Claude 2.0/2.1, GPT-4/3.5, and Cohere. We find that GPT-3.5 and Cohere fail to perform this summarization task meaningfully. For Claude 2 and GPT-4, we analyze the extractiveness of the summary and identify a position bias in LLMs. This position bias disappears after shuffling the input for Claude, which suggests that Claude seems to recognize important information. We also conduct a comprehensive investigation on the use of numeric data in LLM-generated summaries and offer a taxonomy of numeric hallucination. We employ prompt engineering to improve GPT-4's use of numbers with limited success. Overall, our analyses highlight the strong capability of Claude 2 in handling long multimodal inputs compared to GPT-4. The generated summaries and evaluation code are available at https://github.com/ChicagoHAI/characterizing-multimodal-long-form-summarization.
- Published
- 2024
48. Ground State Preparation via Dynamical Cooling
- Author
-
Motlagh, Danial, Zini, Modjtaba Shokrian, Arrazola, Juan Miguel, and Wiebe, Nathan
- Subjects
Quantum Physics - Abstract
Quantum algorithms for probing ground-state properties of quantum systems require good initial states. Projection-based methods such as eigenvalue filtering rely on inputs that have a significant overlap with the low-energy subspace, which can be challenging for large, strongly-correlated systems. This issue has motivated the study of physically-inspired dynamical approaches such as thermodynamic cooling. In this work, we introduce a ground-state preparation algorithm based on the simulation of quantum dynamics. Our main insight is to transform the Hamiltonian by a shifted sign function via quantum signal processing, effectively mapping eigenvalues into positive and negative subspaces separated by a large gap. This automatically ensures that all states within each subspace conserve energy with respect to the transformed Hamiltonian. Subsequent time-evolution with a perturbed Hamiltonian induces transitions to lower-energy states while preventing unwanted jumps to higher energy states. The approach does not rely on a priori knowledge of energy gaps and requires no additional qubits to model a bath. Furthermore, it makes $\tilde{\mathcal{O}}(d^{\,3/2}/\epsilon)$ queries to the time-evolution operator of the system and $\tilde{\mathcal{O}}(d^{\,3/2})$ queries to a block-encoding of the perturbation, for $d$ cooling steps and an $\epsilon$-accurate energy resolution. Our results provide a framework for combining quantum signal processing and Hamiltonian simulation to design heuristic quantum algorithms for ground-state preparation.
- Published
- 2024
49. Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich Feedback
- Author
-
Kordzanganeh, Mo, Keshvary, Danial, and Arian, Nariman
- Subjects
Computer Science - Machine Learning ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Latent diffusion models are the state-of-the-art for synthetic image generation. To align these models with human preferences, training the models using reinforcement learning on human feedback is crucial. Black et. al 2024 introduced denoising diffusion policy optimisation (DDPO), which accounts for the iterative denoising nature of the generation by modelling it as a Markov chain with a final reward. As the reward is a single value that determines the model's performance on the entire image, the model has to navigate a very sparse reward landscape and so requires a large sample count. In this work, we extend the DDPO by presenting the Pixel-wise Policy Optimisation (PXPO) algorithm, which can take feedback for each pixel, providing a more nuanced reward to the model., Comment: 6 pages, 7 figures
- Published
- 2024
50. Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPT
- Author
-
Abaskohi, Amirhossein, Baruni, Sara, Masoudi, Mostafa, Abbasi, Nesa, Babalou, Mohammad Hadi, Edalat, Ali, Kamahi, Sepehr, Sani, Samin Mahdizadeh, Naghavian, Nikoo, Namazifard, Danial, Sadeghi, Pouya, and Yaghoobzadeh, Yadollah
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
This paper explores the efficacy of large language models (LLMs) for Persian. While ChatGPT and consequent LLMs have shown remarkable performance in English, their efficiency for more low-resource languages remains an open question. We present the first comprehensive benchmarking study of LLMs across diverse Persian language tasks. Our primary focus is on GPT-3.5-turbo, but we also include GPT-4 and OpenChat-3.5 to provide a more holistic evaluation. Our assessment encompasses a diverse set of tasks categorized into classic, reasoning, and knowledge-based domains. To enable a thorough comparison, we evaluate LLMs against existing task-specific fine-tuned models. Given the limited availability of Persian datasets for reasoning tasks, we introduce two new benchmarks: one based on elementary school math questions and another derived from the entrance exams for 7th and 10th grades. Our findings reveal that while LLMs, especially GPT-4, excel in tasks requiring reasoning abilities and a broad understanding of general knowledge, they often lag behind smaller pre-trained models fine-tuned specifically for particular tasks. Additionally, we observe improved performance when test sets are translated to English before inputting them into GPT-3.5. These results highlight the significant potential for enhancing LLM performance in the Persian language. This is particularly noteworthy due to the unique attributes of Persian, including its distinct alphabet and writing styles., Comment: 14 pages, 1 figure, 6 tables, Proceeding of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING)
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.