Author: "Youssef, P" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Youssef, P"' showing total 19,722 results

Start Over Author "Youssef, P"

19,722 results on '"Youssef, P"'

1. Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning

Author: Chen, Suei-Wen, Ross, Keith, and Youssef, Pierre
Subjects: Computer Science - Machine Learning
Abstract: Monte Carlo Exploring Starts (MCES), which aims to learn the optimal policy using only sample returns, is a simple and natural algorithm in reinforcement learning which has been shown to converge under various conditions. However, the convergence rate analysis for MCES-style algorithms in the form of sample complexity has received very little attention. In this paper we develop a finite sample bound for a modified MCES algorithm which solves the stochastic shortest path problem. To this end, we prove a novel result on the convergence rate of the policy iteration algorithm. This result implies that with probability at least $1-\delta$, the algorithm returns an optimal policy after $\tilde{O}(SAK^3\log^3\frac{1}{\delta})$ sampled episodes, where $S$ and $A$ denote the number of states and actions respectively, $K$ is a proxy for episode length, and $\tilde{O}$ hides logarithmic factors and constants depending on the rewards of the environment that are assumed to be known., Comment: 13 pages
Published: 2024

2. AirTags for Human Localization, Not Just Objects

Author: Hany, Mohamed I., Rizk, Hamada, and Youssef, Moustafa
Subjects: Computer Science - Networking and Internet Architecture
Abstract: Indoor localization has become increasingly important due to its wide-ranging applications in indoor navigation, emergency services, the Internet of Things (IoT), and accessibility for individuals with special needs. Traditional localization systems often require extensive calibration to achieve high accuracy. We introduce UbiLoc, an innovative, calibration-free indoor localization system that leverages Apple AirTags in a novel way to localize users instead of tracking objects. By utilizing the ubiquitous presence of AirTags and their Ultra-Wideband (UWB) technology, UbiLoc achieves centimeter-level accuracy, surpassing traditional WiFi and Bluetooth Low Energy (BLE) systems. UbiLoc addresses key challenges, including ranging errors caused by multipath and noise, through a novel AirTag selection technique. The system operates without the need for manual calibration, ensuring robustness and self-maintenance. Deployed on various Apple devices and tested in real-world environments, UbiLoc achieved median localization errors as low as 26 cm in a campus building and 31.5 cm in an apartment setting. These results demonstrate that UbiLoc is the first system to offer reliable, cm-level accuracy using widely available technology without requiring calibration, making it a promising solution for next-generation indoor localization systems., Comment: Accepted for publication in 2nd ACM SIGSPATIAL International Workshop on Geo-Privacy and Data Utility for Smart Societies: 7 pages, 9 figures
Published: 2024

3. An Efficient Scaled spectral preconditioner for sequences of symmetric positive definite linear systems

Author: Diouane, Youssef, Gürol, Selime, Mouhtal, Oussama, and Orban, Dominique
Subjects: Mathematics - Numerical Analysis, Mathematics - Optimization and Control, 68Q25, 68R10, 68U05
Abstract: We explore a scaled spectral preconditioner for the efficient solution of sequences of symmetric and positive-definite linear systems. We design the scaled preconditioner not only as an approximation of the inverse of the linear system but also with consideration of its use within the conjugate gradient (CG) method. We propose three different strategies for selecting a scaling parameter, which aims to position the eigenvalues of the preconditioned matrix in a way that reduces the energy norm of the error, the quantity that CG monotonically decreases at each iteration. Our focus is on accelerating convergence especially in the early iterations, which is particularly important when CG is truncated due to computational cost constraints. Numerical experiments provide in data assimilation confirm that the scaled spectral preconditioner can significantly improve early CG convergence with negligible computational cost.
Published: 2024
Full Text: View/download PDF

4. A nonsmooth exact penalty method for equality-constrained optimization: complexity and implementation

Author: Diouane, Youssef, Gollier, Maxence, and Orban, Dominique
Subjects: Mathematics - Optimization and Control, 90C06, 90C30, 90C53
Abstract: Penalty methods are a well known class of algorithms for constrained optimization. They transform a constrained problem into a sequence of unconstrained penalized problems in the hope that approximate solutions of the latter converge to a solution of the former. If Lagrange multipliers exist, exact penalty methods ensure that the penalty parameter only need increase a finite number of times, but are typically scorned in smooth optimization for the penalized problems are not smooth. This led researchers to consider the implementation of exact penalty methods inconvenient. Recently, advances in proximal methods have led to increasingly efficient solvers for nonsmooth optimization. We show that the exact $\ell_2$-penalty method for equality-constrained optimization can in fact be implemented efficiently by solving the penalized problem with a proximal-type algorithm. We study the convergence of our algorithm and establish a worst-case complexity bound of $O(\epsilon^{-2})$ to bring a stationarity measure below $\epsilon > 0$ under the Mangarasian-Fromowitz constraint qualification and Lipschitz continuity of the objective gradient and constraint Jacobian. In a degenerate scenario where the penalty parameter grows unbounded, the complexity becomes $O(\epsilon^{-8})$, which is worse than another bound found in the literature. We justify the difference by arguing that our feasibility measure is properly scaled. Finally, we report numerical experience on small-scale problems from a standard collection and compare our solver with an augmented-Lagrangian and an SQP method. Our preliminary implementation is on par with the augmented Lagrangian in terms of robustness and efficiency. It is on par with the SQP method in terms of robustness, though the former remains ahead in terms of number of problem function evaluations.
Published: 2024
Full Text: View/download PDF

5. Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients

Author: Allouah, Youssef, Mrini, Abdellah El, Guerraoui, Rachid, Gupta, Nirupam, and Pinot, Rafael
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security
Abstract: Federated learning (FL) is an appealing paradigm that allows a group of machines (a.k.a. clients) to learn collectively while keeping their data local. However, due to the heterogeneity between the clients' data distributions, the model obtained through the use of FL algorithms may perform poorly on some client's data. Personalization addresses this issue by enabling each client to have a different model tailored to their own data while simultaneously benefiting from the other clients' data. We consider an FL setting where some clients can be adversarial, and we derive conditions under which full collaboration fails. Specifically, we analyze the generalization performance of an interpolated personalized FL framework in the presence of adversarial clients, and we precisely characterize situations when full collaboration performs strictly worse than fine-tuned personalization. Our analysis determines how much we should scale down the level of collaboration, according to data heterogeneity and the tolerable fraction of adversarial clients. We support our findings with empirical results on mean estimation and binary classification problems, considering synthetic and benchmark image classification datasets.
Published: 2024

6. A multimodal LLM for the non-invasive decoding of spoken text from brain recordings

Author: Hmamouche, Youssef, Chihab, Ismail, Kdouri, Lahoucine, and Seghrouchni, Amal El Fallah
Subjects: Quantitative Biology - Neurons and Cognition, Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Signal Processing, Quantitative Biology - Quantitative Methods
Abstract: Brain-related research topics in artificial intelligence have recently gained popularity, particularly due to the expansion of what multimodal architectures can do from computer vision to natural language processing. Our main goal in this work is to explore the possibilities and limitations of these architectures in spoken text decoding from non-invasive fMRI recordings. Contrary to vision and textual data, fMRI data represent a complex modality due to the variety of brain scanners, which implies (i) the variety of the recorded signal formats, (ii) the low resolution and noise of the raw signals, and (iii) the scarcity of pretrained models that can be leveraged as foundation models for generative learning. These points make the problem of the non-invasive decoding of text from fMRI recordings very challenging. In this paper, we propose and end-to-end multimodal LLM for decoding spoken text from fMRI signals. The proposed architecture is founded on (i) an encoder derived from a specific transformer incorporating an augmented embedding layer for the encoder and a better-adjusted attention mechanism than that present in the state of the art, and (ii) a frozen large language model adapted to align the embedding of the input text and the encoded embedding of brain activity to decode the output text. A benchmark in performed on a corpus consisting of a set of interactions human-human and human-robot interactions where fMRI and conversational signals are recorded synchronously. The obtained results are very promising, as our proposal outperforms the evaluated models, and is able to generate text capturing more accurate semantics present in the ground truth. The implementation code is provided in https://github.com/Hmamouche/brain_decode., Comment: 15 pages, 4 figures
Published: 2024

7. A House United Within Itself: SLO-Awareness for On-Premises Containerized ML Inference Clusters via Faro

Author: Jeon, Beomyeol, Wang, Chen, Arroyo, Diana, Youssef, Alaa, and Gupta, Indranil
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: This paper tackles the challenge of running multiple ML inference jobs (models) under time-varying workloads, on a constrained on-premises production cluster. Our system Faro takes in latency Service Level Objectives (SLOs) for each job, auto-distills them into utility functions, "sloppifies" these utility functions to make them amenable to mathematical optimization, automatically predicts workload via probabilistic prediction, and dynamically makes implicit cross-job resource allocations, in order to satisfy cluster-wide objectives, e.g., total utility, fairness, and other hybrid variants. A major challenge Faro tackles is that using precise utilities and high-fidelity predictors, can be too slow (and in a sense too precise!) for the fast adaptation we require. Faro's solution is to "sloppify" (relax) its multiple design components to achieve fast adaptation without overly degrading solution quality. Faro is implemented in a stack consisting of Ray Serve running atop a Kubernetes cluster. Trace-driven cluster deployments show that Faro achieves 2.3$\times$-23$\times$ lower SLO violations compared to state-of-the-art systems., Comment: 13 pages, 16 figures, To appear in Eurosys 2025
Published: 2024
Full Text: View/download PDF

8. A Proximal Modified Quasi-Newton Method for Nonsmooth Regularized Optimization

Author: Diouane, Youssef, Habiboullah, Mohamed Laghdaf, and Orban, Dominique
Subjects: Mathematics - Optimization and Control, Computer Science - Machine Learning
Abstract: We develop R2N, a modified quasi-Newton method for minimizing the sum of a $\mathcal{C}^1$ function $f$ and a lower semi-continuous prox-bounded $h$. Both $f$ and $h$ may be nonconvex. At each iteration, our method computes a step by minimizing the sum of a quadratic model of $f$, a model of $h$, and an adaptive quadratic regularization term. A step may be computed by a variant of the proximal-gradient method. An advantage of R2N over trust-region (TR) methods is that proximal operators do not involve an extra TR indicator. We also develop the variant R2DH, in which the model Hessian is diagonal, which allows us to compute a step without relying on a subproblem solver when $h$ is separable. R2DH can be used as standalone solver, but also as subproblem solver inside R2N. We describe non-monotone variants of both R2N and R2DH. Global convergence of a first-order stationarity measure to zero holds without relying on local Lipschitz continuity of $\nabla f$, while allowing model Hessians to grow unbounded, an assumption particularly relevant to quasi-Newton models. Under Lipschitz-continuity of $\nabla f$, we establish a tight worst-case complexity bound of $O(1 / \epsilon^{2/(1 - p)})$ to bring said measure below $\epsilon > 0$, where $0 \leq p < 1$ controls the growth of model Hessians. The latter must not diverge faster than $|\mathcal{S}_k|^p$, where $\mathcal{S}_k$ is the set of successful iterations up to iteration $k$. When $p = 1$, we establish the tight exponential complexity bound $O(\exp(c \epsilon^{-2}))$ where $c > 0$ is a constant. We describe our Julia implementation and report numerical experience on a basis-pursuit problem, image denoising, minimum-rank matrix completion, and a nonlinear support vector machine. In particular, the minimum-rank problem cannot be solved directly at this time by a TR approach as corresponding proximal operators are not known analytically.
Published: 2024
Full Text: View/download PDF

9. SwiftDossier: Tailored Automatic Dossier for Drug Discovery with LLMs and Agents

Author: Fossi, Gabriele, Boulaimen, Youssef, Outemzabet, Leila, Jeanray, Nathalie, Gerart, Stephane, Vachenc, Sebastien, Giemza, Joanna, and Raieli, Salvatore
Subjects: Computer Science - Artificial Intelligence, 68T07, 92C50, 68T09, I.2.7, J.3
Abstract: The advancement of artificial intelligence algorithms has expanded their application to several fields such as the biomedical domain. Artificial intelligence systems, including Large Language Models (LLMs), can be particularly advantageous in drug discovery, which is a very long and expensive process. However, LLMs by themselves lack in-depth knowledge about specific domains and can generate factually incorrect information. Moreover, they are not able to perform more complex actions that imply the usage of external tools. Our work is focused on these two issues. Firstly, we show how the implementation of an advanced RAG system can help the LLM to generate more accurate answers to drug-discovery-related questions. The results show that the answers generated by the LLM with the RAG system surpass in quality the answers produced by the model without RAG. Secondly, we show how to create an automatic target dossier using LLMs and incorporating them with external tools that they can use to execute more intricate tasks to gather data such as accessing databases and executing code. The result is a production-ready target dossier containing the acquired information summarized into a PDF and a PowerPoint presentation., Comment: 10 pages, 7 figures, 2 tables
Published: 2024

10. Invisible Servoing: a Visual Servoing Approach with Return-Conditioned Latent Diffusion

Author: Gerges, Bishoy, Bazzana, Barbara, Botteghi, Nicolò, Aboudorra, Youssef, and Franchi, Antonio
Subjects: Computer Science - Robotics
Abstract: In this paper, we present a novel visual servoing (VS) approach based on latent Denoising Diffusion Probabilistic Models (DDPMs). Opposite to classical VS methods, the proposed approach allows reaching the desired target view, even when the target is initially not visible. This is possible thanks to the learning of a latent representation that the DDPM uses for planning and a dataset of trajectories encompassing target-invisible initial views. The latent representation is learned using a Cross-Modal Variational Autoencoder, and used to estimate the return for conditioning the trajectory generation of the DDPM. Given the current image, the DDPM generates trajectories in the latent space driving the robotic platform to the desired visual target. The approach is applicable to any velocity-based controlled platform. We test our method with simulated and real-world experiments using generic multi-rotor Uncrewed Aerial Vehicles (UAVs). A video of our experiments can be found at https://youtu.be/yu-aTxqceOA.
Published: 2024

11. A Stochastic Iteratively Regularized Gauss-Newton Method

Author: Bergou, El Houcine, Chada, Neil K., and Diouane, Youssef
Subjects: Mathematics - Numerical Analysis, Mathematics - Optimization and Control, 65N21, 65C35, 65K10, 93E24
Abstract: This work focuses on developing and motivating a stochastic version of a wellknown inverse problem methodology. Specifically, we consider the iteratively regularized Gauss-Newton method, originally proposed by Bakushinskii for infinite-dimensional problems. Recent work have extended this method to handle sequential observations, rather than a single instance of the data, demonstrating notable improvements in reconstruction accuracy. In this paper, we further extend these methods to a stochastic framework through mini-batching, introducing a new algorithm, the stochastic iteratively regularized Gauss-Newton method (SIRGNM). Our algorithm is designed through the use randomized sketching. We provide an analysis for the SIRGNM, which includes a preliminary error decomposition and a convergence analysis, related to the residuals. We provide numerical experiments on a 2D elliptic PDE example. This illustrates the effectiveness of the SIRGNM, through maintaining a similar level of accuracy while reducing on the computational time., Comment: 23 pages
Published: 2024

12. Fusion in Context: A Multimodal Approach to Affective State Recognition

Author: Mohamed, Youssef, Lemaignan, Severin, Guneysu, Arzu, Jensfelt, Patric, and Smith, Christian
Subjects: Computer Science - Robotics
Abstract: Accurate recognition of human emotions is a crucial challenge in affective computing and human-robot interaction (HRI). Emotional states play a vital role in shaping behaviors, decisions, and social interactions. However, emotional expressions can be influenced by contextual factors, leading to misinterpretations if context is not considered. Multimodal fusion, combining modalities like facial expressions, speech, and physiological signals, has shown promise in improving affect recognition. This paper proposes a transformer-based multimodal fusion approach that leverages facial thermal data, facial action units, and textual context information for context-aware emotion recognition. We explore modality-specific encoders to learn tailored representations, which are then fused using additive fusion and processed by a shared transformer encoder to capture temporal dependencies and interactions. The proposed method is evaluated on a dataset collected from participants engaged in a tangible tabletop Pacman game designed to induce various affective states. Our results demonstrate the effectiveness of incorporating contextual information and multimodal fusion for affective state recognition.
Published: 2024

13. Lattice Light Shift Evaluations In a Dual-Ensemble Yb Optical Lattice Clock

Author: Bothwell, Tobias, Hunt, Benjamin D., Siegel, Jacob L., Hassan, Youssef S., Grogan, Tanner, Kobayashi, Takumi, Gibble, Kurt, Porsev, Sergey G., Safronova, Marianna S., Brown, Roger C., Beloy, Kyle, and Ludlow, Andrew D.
Subjects: Physics - Atomic Physics, Quantum Physics
Abstract: In state-of-the-art optical lattice clocks, beyond-electric-dipole polarizability terms lead to a break-down of magic wavelength trapping. In this Letter, we report a novel approach to evaluate lattice light shifts, specifically addressing recent discrepancies in the atomic multipolarizability term between experimental techniques and theoretical calculations. We combine imaging and multi-ensemble techniques to evaluate lattice light shift atomic coefficients, leveraging comparisons in a dual-ensemble lattice clock to rapidly evaluate differential frequency shifts. Further, we demonstrate application of a running wave field to probe both the multipolarizability and hyperpolarizability coefficients, establishing a new technique for future lattice light shift evaluations., Comment: 17 pages, 6 figures
Published: 2024

14. A Likelihood Ratio-Based Approach to Segmenting Unknown Objects

Author: Nayal, Nazir, Shoeb, Youssef, and Güney, Fatma
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Addressing the Out-of-Distribution (OoD) segmentation task is a prerequisite for perception systems operating in an open-world environment. Large foundational models are frequently used in downstream tasks, however, their potential for OoD remains mostly unexplored. We seek to leverage a large foundational model to achieve robust representation. Outlier supervision is a widely used strategy for improving OoD detection of the existing segmentation networks. However, current approaches for outlier supervision involve retraining parts of the original network, which is typically disruptive to the model's learned feature representation. Furthermore, retraining becomes infeasible in the case of large foundational models. Our goal is to retrain for outlier segmentation without compromising the strong representation space of the foundational model. To this end, we propose an adaptive, lightweight unknown estimation module (UEM) for outlier supervision that significantly enhances the OoD segmentation performance without affecting the learned feature representation of the original network. UEM learns a distribution for outliers and a generic distribution for known classes. Using the learned distributions, we propose a likelihood-ratio-based outlier scoring function that fuses the confidence of UEM with that of the pixel-wise segmentation inlier network to detect unknown objects. We also propose an objective to optimize this score directly. Our approach achieves a new state-of-the-art across multiple datasets, outperforming the previous best method by 5.74% average precision points while having a lower false-positive rate. Importantly, strong inlier performance remains unaffected., Comment: 13 pages, 2 figures, and 4 tables
Published: 2024

15. Neural MP: A Generalist Neural Motion Planner

Author: Dalal, Murtaza, Yang, Jiahui, Mendonca, Russell, Khaky, Youssef, Salakhutdinov, Ruslan, and Pathak, Deepak
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The current paradigm for motion planning generates solutions from scratch for every new problem, which consumes significant amounts of time and computational resources. For complex, cluttered scenes, motion planning approaches can often take minutes to produce a solution, while humans are able to accurately and safely reach any goal in seconds by leveraging their prior experience. We seek to do the same by applying data-driven learning at scale to the problem of motion planning. Our approach builds a large number of complex scenes in simulation, collects expert data from a motion planner, then distills it into a reactive generalist policy. We then combine this with lightweight optimization to obtain a safe path for real world deployment. We perform a thorough evaluation of our method on 64 motion planning tasks across four diverse environments with randomized poses, scenes and obstacles, in the real world, demonstrating an improvement of 23%, 17% and 79% motion planning success rate over state of the art sampling, optimization and learning based planning methods. Video results available at mihdalal.github.io/neuralmotionplanner, Comment: Website at mihdalal.github.io/neuralmotionplanner. Main paper: 7 pages, 4 figures, 2 tables. Appendix: 9 pages, 5 figures, 6 tables
Published: 2024

16. A System and Benchmark for LLM-based Q&A on Heterogeneous Data

Author: Fokoue, Achille, Jayaraman, Srideepika, Khabiri, Elham, Kephart, Jeffrey O., Li, Yingjie, Shah, Dhruv, Drissi, Youssef, Heath III, Fenno F., Bhamidipaty, Anu, Tipu, Fateh A., and Baseman, Robert J.
Subjects: Computer Science - Databases, Computer Science - Artificial Intelligence
Abstract: In many industrial settings, users wish to ask questions whose answers may be found in structured data sources such as a spreadsheets, databases, APIs, or combinations thereof. Often, the user doesn't know how to identify or access the right data source. This problem is compounded even further if multiple (and potentially siloed) data sources must be assembled to derive the answer. Recently, various Text-to-SQL applications that leverage Large Language Models (LLMs) have addressed some of these problems by enabling users to ask questions in natural language. However, these applications remain impractical in realistic industrial settings because they fail to cope with the data source heterogeneity that typifies such environments. In this paper, we address heterogeneity by introducing the siwarex platform, which enables seamless natural language access to both databases and APIs. To demonstrate the effectiveness of siwarex, we extend the popular Spider dataset and benchmark by replacing some of its tables by data retrieval APIs. We find that siwarex does a good job of coping with data source heterogeneity. Our modified Spider benchmark will soon be available to the research community
Published: 2024

17. Unmasking Covert Intrusions: Detection of Fault-Masking Cyberattacks on Differential Protection Systems

Author: Saber, Ahmad Mohammad, Youssef, Amr, Svetinovic, Davor, Zeineldin, Hatem, and El-Saadany, Ehab F.
Subjects: Electrical Engineering and Systems Science - Systems and Control, Computer Science - Cryptography and Security, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: Line Current Differential Relays (LCDRs) are high-speed relays progressively used to protect critical transmission lines. However, LCDRs are vulnerable to cyberattacks. Fault-Masking Attacks (FMAs) are stealthy cyberattacks performed by manipulating the remote measurements of the targeted LCDR to disguise faults on the protected line. Hence, they remain undetected by this LCDR. In this paper, we propose a two-module framework to detect FMAs. The first module is a Mismatch Index (MI) developed from the protected transmission line's equivalent physical model. The MI is triggered only if there is a significant mismatch in the LCDR's local and remote measurements while the LCDR itself is untriggered, which indicates an FMA. After the MI is triggered, the second module, a neural network-based classifier, promptly confirms that the triggering event is a physical fault that lies on the line protected by the LCDR before declaring the occurrence of an FMA. The proposed framework is tested using the IEEE 39-bus benchmark system. Our simulation results confirm that the proposed framework can accurately detect FMAs on LCDRs and is not affected by normal system disturbances, variations, or measurement noise. Our experimental results using OPAL-RT's real-time simulator confirm the proposed solution's real-time performance capability., Comment: Accepted to IEEE Transactions on Systems, Man, and Cybernetics: Systems. \c{opyright} 2024 IEEE
Published: 2024

18. An Efficient Quantum Binary-Neuron Algorithm for Accurate Multi-Story Floor Localization

Author: Zook, Yousef, Shokry, Ahmed, and Youssef, Moustafa
Subjects: Quantum Physics
Abstract: Accurate floor localization in a multi-story environment is an important but challenging task. Among the current floor localization techniques, fingerprinting is the mainstream technology due to its accuracy in noisy environments. To achieve accurate floor localization in a building with many floors, we have to collect sufficient data on each floor, which needs significant storage and running time; preventing fingerprinting techniques from scaling to support large multi-story buildings, especially on a worldwide scale. In this paper, we propose a quantum algorithm for accurate multi-story localization. The proposed algorithm leverages quantum computing concepts to provide an exponential enhancement in both space and running time compared to the classical counterparts. In addition, it builds on an efficient binary-neuron implementation that can be implemented using fewer qubits compared to the typical non-binary neurons, allowing for easier deployment with near-term quantum devices. We implement the proposed algorithm on a real IBM quantum machine and evaluate it on three real indoor testbeds. Results confirm the exponential saving in both time and space for the proposed quantum algorithm, while keeping the same localization accuracy compared to the traditional classical techniques, and using half the number of qubits required for other quantum localization algorithms.
Published: 2024

19. How Could Generative AI Support Compliance with the EU AI Act? A Review for Safe Automated Driving Perception

Author: Keser, Mert, Shoeb, Youssef, and Knoll, Alois
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep Neural Networks (DNNs) have become central for the perception functions of autonomous vehicles, substantially enhancing their ability to understand and interpret the environment. However, these systems exhibit inherent limitations such as brittleness, opacity, and unpredictable behavior in out-of-distribution scenarios. The European Union (EU) Artificial Intelligence (AI) Act, as a pioneering legislative framework, aims to address these challenges by establishing stringent norms and standards for AI systems, including those used in autonomous driving (AD), which are categorized as high-risk AI. In this work, we explore how the newly available generative AI models can potentially support addressing upcoming regulatory requirements in AD perception, particularly with respect to safety. This short review paper summarizes the requirements arising from the EU AI Act regarding DNN-based perception systems and systematically categorizes existing generative AI applications in AD. While generative AI models show promise in addressing some of the EU AI Acts requirements, such as transparency and robustness, this review examines their potential benefits and discusses how developers could leverage these methods to enhance compliance with the Act. The paper also highlights areas where further research is needed to ensure reliable and safe integration of these technologies.
Published: 2024

20. Non-Reciprocal Transport of Thermally-Generated Magnons

Author: Cosset-Chéneau, M., Tirion, S. H., Wei, X. Y., Youssef, J. Ben, and van Wees, B. J.
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Condensed Matter - Materials Science
Abstract: We demonstrate the non-reciprocity of electrically and thermally-generated incoherent magnon transport using the magnetization direction of a Py wire placed on top of an ultrathin YIG film. We show that the transport properties of thermally-generated magnons under a Py wire depends on the relative orientation between the temperature gradient and the Py-magnetization direction. The symmetries of this non-reciprocal magnon transport match with those predicted by the remote dipolar interaction between YIG and Py magnons, controlled by the chirality of the YIG magnon dipolar stray fields. We also show that the directional magnon generation by the spin Seebeck effect from the Py wire displays the symmetries expected from the chiral spin Seebeck effect.
Published: 2024

21. On the design of scalable, high-precision spherical-radial Fourier features

Author: Belhadji, Ayoub, Zhu, Qianyu Julie, and Marzouk, Youssef
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Approximation using Fourier features is a popular technique for scaling kernel methods to large-scale problems, with myriad applications in machine learning and statistics. This method replaces the integral representation of a shift-invariant kernel with a sum using a quadrature rule. The design of the latter is meant to reduce the number of features required for high-precision approximation. Specifically, for the squared exponential kernel, one must design a quadrature rule that approximates the Gaussian measure on $\mathbb{R}^d$. Previous efforts in this line of research have faced difficulties in higher dimensions. We introduce a new family of quadrature rules that accurately approximate the Gaussian measure in higher dimensions by exploiting its isotropy. These rules are constructed as a tensor product of a radial quadrature rule and a spherical quadrature rule. Compared to previous work, our approach leverages a thorough analysis of the approximation error, which suggests natural choices for both the radial and spherical components. We demonstrate that this family of Fourier features yields improved approximation bounds.
Published: 2024

22. ml_edm package: a Python toolkit for Machine Learning based Early Decision Making

Author: Renault, Aurélien, Achenchabe, Youssef, Bertrand, Édouard, Bondu, Alexis, Cornuéjols, Antoine, Lemaire, Vincent, and Dachraoui, Asma
Subjects: Computer Science - Machine Learning
Abstract: \texttt{ml\_edm} is a Python 3 library, designed for early decision making of any learning tasks involving temporal/sequential data. The package is also modular, providing researchers an easy way to implement their own triggering strategy for classification, regression or any machine learning task. As of now, many Early Classification of Time Series (ECTS) state-of-the-art algorithms, are efficiently implemented in the library leveraging parallel computation. The syntax follows the one introduce in \texttt{scikit-learn}, making estimators and pipelines compatible with \texttt{ml\_edm}. This software is distributed over the BSD-3-Clause license, source code can be found at \url{https://github.com/ML-EDM/ml_edm}.
Published: 2024

23. Advances in Preference-based Reinforcement Learning: A Review

Author: Abdelkareem, Youssef, Shehata, Shady, and Karray, Fakhri
Subjects: Computer Science - Artificial Intelligence
Abstract: Reinforcement Learning (RL) algorithms suffer from the dependency on accurately engineered reward functions to properly guide the learning agents to do the required tasks. Preference-based reinforcement learning (PbRL) addresses that by utilizing human preferences as feedback from the experts instead of numeric rewards. Due to its promising advantage over traditional RL, PbRL has gained more focus in recent years with many significant advances. In this survey, we present a unified PbRL framework to include the newly emerging approaches that improve the scalability and efficiency of PbRL. In addition, we give a detailed overview of the theoretical guarantees and benchmarking work done in the field, while presenting its recent applications in complex real-world tasks. Lastly, we go over the limitations of the current approaches and the proposed future research directions.
Published: 2024
Full Text: View/download PDF

24. TimeSense: Multi-Person Device-free Indoor Localization via RTT

Author: Mohsen, Mohamed, Rizk, Hamada, Yamaguch, Hirozumi, and Youssef, Moustafa
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Locating the persons moving through an environment without the necessity of them being equipped with special devices has become vital for many applications including security, IoT, healthcare, etc. Existing device-free indoor localization systems commonly rely on the utilization of Received Signal Strength Indicator (RSSI) and WiFi Channel State Information (CSI) techniques. However, the accuracy of RSSI is adversely affected by environmental factors like multi-path interference and fading. Additionally, the lack of standardization in CSI necessitates the use of specialized hardware and software. In this paper, we present TimeSense, a deep learning-based multi-person device-free indoor localization system that addresses these challenges. TimeSense leverages Time of Flight information acquired by the fine-time measurement protocol of IEEE 802.11-2016 standard. Specifically, the measured round trip time between the transmitter and receiver is influenced by the dynamic changes in the environment induced by human presence. TimeSense effectively detects this anomalous behavior using a stacked denoising auto-encoder model, thereby estimating the user's location. The system incorporates a probabilistic approach on top of the deep learning model to ensure seamless tracking of the users. The evaluation of TimeSene in two realistic environments demonstrates its efficacy, achieving a median localization accuracy of 1.57 and 2.65 meters. This surpasses the performance of state-of-the-art techniques by 49% and 103% in the two testbeds.
Published: 2024

25. A Novel Approach to Classify Power Quality Signals Using Vision Transformers

Author: Saber, Ahmad Mohammad, Selim, Alaa, Hammad, Mohamed M., Youssef, Amr, Kundur, Deepa, and El-Saadany, Ehab
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: With the rapid integration of electronically interfaced renewable energy resources and loads into smart grids, there is increasing interest in power quality disturbances (PQD) classification to enhance the security and efficiency of these grids. This paper introduces a new approach to PQD classification based on the Vision Transformer (ViT) model. When a PQD occurs, the proposed approach first converts the power quality signal into an image and then utilizes a pre-trained ViT to accurately determine the class of the PQD. Unlike most previous works, which were limited to a few disturbance classes or small datasets, the proposed method is trained and tested on a large dataset with 17 disturbance classes. Our experimental results show that the proposed ViT-based approach achieves PQD classification precision and recall of 98.28% and 97.98%, respectively, outperforming recently proposed techniques applied to the same dataset., Comment: IECON 2024-50th Annual Conference of the IEEE Industrial Electronics Society, Chicago, U.S.A, 2024, pp. 1-6
Published: 2024

26. Possible wormholes in $f(R)$ gravity sourced by solitonic quantum wave and cold dark matter halos and their repulsive gravity effect

Author: Errehymy, Abdelghani, Khedif, Youssef, Donmez, Orhan, Daoud, Mohammed, Myrzakulov, Kairat, and Bekov, Sabit
Subjects: General Relativity and Quantum Cosmology
Abstract: In this paper, we present new generalized wormhole (WH) solutions within the context of $f(R)$ gravity. Specifically, we focus on $f(R)$ gravitational theories formulated in the metric formalism, with our investigation centered on a power-law form represented by $f(R) = \epsilon R^{\chi}$. Here, $\epsilon$ is an arbitrary constant, and $\chi$ is a real number. Notably, this form possesses the advantageous property of reducing to Einstein gravity when $\epsilon=1$ and $\chi=1$. To obtain these novel WH solutions, we establish the general field equations for any $f(R)$ theory within the framework of Morris-Thorne spacetime, assuming metric coefficients that are independent of time. By utilizing an anisotropic matter source and a specific type of energy density associated with solitonic quantum wave (SQW) and cold dark matter (CDM) halos, we calculate two distinct WH solutions. We thoroughly investigate the properties of the exotic matter (ExoM) residing within the WH geometry and analyze the matter contents through energy conditions (ECs). Both analytical and graphical methods are employed in this analysis to examine the validity of different regions. Notably, the calculated shape functions for the WH geometry satisfy the necessary conditions in both scenarios, emphasizing their reliability. This ExoM is characterized by an energy-momentum tensor that violates the null energy condition (NEC) and, consequently, the weak energy condition as well, in the vicinity of the WH throats. Furthermore, we investigated the repulsive effect of gravity and discovered that its presence results in a negative deflection angle for photons following null geodesics. Importantly, we observed that the deflection angle consistently exhibits negative values across all $r_0$ values in both scenarios, indicating the manifestation of the repulsive gravity effect., Comment: Accepted for publication in the European Physical Journal C, 15 pages, 18 figures
Published: 2024

27. Complexity of trust-region methods in the presence of unbounded Hessian approximations

Author: Diouane, Youssef, Habiboullah, Mohamed Laghdaf, and Orban, Dominique
Subjects: Mathematics - Optimization and Control
Abstract: We extend traditional complexity analyses of trust-region methods for unconstrained, possibly nonconvex, optimization. Whereas most complexity analyses assume uniform boundedness of the model Hessians, we work with potentially unbounded model Hessians. Boundedness is not guaranteed in practical implementations, in particular ones based on quasi-Newton updates such as PSB, BFGS and SR1. Our analysis is conducted for a family of trust-region methods that includes most known methods as special cases. We examine two regimes of Hessian growth: one bounded by a power of the number of successful iterations, and one bounded by a power of the number of iterations. This allows us to formalize and confirm the profound intuition of Powell [IMA J. Numer. Ana. 30(1):289-301,2010], who studied convergence under a special case of our assumptions, but whose proof contained complexity arguments. Specifically, for $0 \leq p < 1$, we establish sharp $O(\epsilon^{-2/(1-p)})$ evaluation complexity to find an $\epsilon$-stationary point when model Hessians are $O(k^p)$, where $k$ is the iteration counter. For $p = 1$, which is the case studied by Powell, we establish a sharp $O(\exp(c\epsilon^{-2}))$ evaluation complexity for a certain constant $c > 0$. This is as Powell suspected and is far worse than other bounds surmised elsewhere in the literature. We establish similar bounds when model Hessians are $O(|\mathcal{S}_k|^p)$, where $|\mathcal{S}_k|$ is the number of iterations where the step was accepted, up to iteration $k$. To the best of our knowledge, ours is the first work to provide complexity bounds when model Hessians grow linearly with $|\mathcal{S}_k|$ or at most linearly with $k$, which covers multiple quasi-Newton approximations.
Published: 2024

28. Improved Robustness for Deep Learning-based Segmentation of Multi-Center Myocardial Perfusion MRI Datasets Using Data Adaptive Uncertainty-guided Space-time Analysis

Author: Yalcinkaya, Dilek M., Youssef, Khalid, Heydari, Bobak, Wei, Janet, Merz, Noel Bairey, Judd, Robert, Dharmakumar, Rohan, Simonetti, Orlando P., Weinsaft, Jonathan W., Raman, Subha V., and Sharif, Behzad
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Physics - Medical Physics
Abstract: Background. Fully automatic analysis of myocardial perfusion MRI datasets enables rapid and objective reporting of stress/rest studies in patients with suspected ischemic heart disease. Developing deep learning techniques that can analyze multi-center datasets despite limited training data and variations in software and hardware is an ongoing challenge. Methods. Datasets from 3 medical centers acquired at 3T (n = 150 subjects) were included: an internal dataset (inD; n = 95) and two external datasets (exDs; n = 55) used for evaluating the robustness of the trained deep neural network (DNN) models against differences in pulse sequence (exD-1) and scanner vendor (exD-2). A subset of inD (n = 85) was used for training/validation of a pool of DNNs for segmentation, all using the same spatiotemporal U-Net architecture and hyperparameters but with different parameter initializations. We employed a space-time sliding-patch analysis approach that automatically yields a pixel-wise "uncertainty map" as a byproduct of the segmentation process. In our approach, a given test case is segmented by all members of the DNN pool and the resulting uncertainty maps are leveraged to automatically select the "best" one among the pool of solutions. Results. The proposed DAUGS analysis approach performed similarly to the established approach on the internal dataset (p = n.s.) whereas it significantly outperformed on the external datasets (p < 0.005 for exD-1 and exD-2). Moreover, the number of image series with "failed" segmentation was significantly lower for the proposed vs. the established approach (4.3% vs. 17.1%, p < 0.0005). Conclusions. The proposed DAUGS analysis approach has the potential to improve the robustness of deep learning methods for segmentation of multi-center stress perfusion datasets with variations in the choice of pulse sequence, site location or scanner vendor., Comment: Accepted for publication in JCMR, 2024
Published: 2024

29. Advancing Ear Biometrics: Enhancing Accuracy and Robustness through Deep Learning

Author: Mohamed, Youssef, Youssef, Zeyad, Heakl, Ahmed, and Zaky, Ahmed
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction, Computer Science - Machine Learning, Computer Science - Multimedia
Abstract: Biometric identification is a reliable method to verify individuals based on their unique physical or behavioral traits, offering a secure alternative to traditional methods like passwords or PINs. This study focuses on ear biometric identification, exploiting its distinctive features for enhanced accuracy, reliability, and usability. While past studies typically investigate face recognition and fingerprint analysis, our research demonstrates the effectiveness of ear biometrics in overcoming limitations such as variations in facial expressions and lighting conditions. We utilized two datasets: AMI (700 images from 100 individuals) and EarNV1.0 (28,412 images from 164 individuals). To improve the accuracy and robustness of our ear biometric identification system, we applied various techniques including data preprocessing and augmentation. Our models achieved a testing accuracy of 99.35% on the AMI Dataset and 98.1% on the EarNV1.0 dataset, showcasing the effectiveness of our approach in precisely identifying individuals based on ear biometric characteristics., Comment: 6 pages, 8 figures, 3 tables, International IEEE Conference on the Intelligent Methods, Systems, and Applications
Published: 2024

30. Across Four Nations: Comparing the Discourses of Adolescents' Digital Literacy

Author: Dingxin Rao, Changhee Lee, Youssef Fdilat, Abdelmajid Bouziane, and Mark Dressman
Abstract: In this study, we investigated media reports and literacy research in four nations--China, Morocco, the Republic of (South) Korea, and the United States--about the relationship between adolescents' literacy and use of digital media, or digital literacy. We present short "snapshots" of adolescents' digital literacy in each country and then compare these to findings in a report about adolescent literacy and uses of digital media published by the Program for International Student Assessment (PISA). Our analysis indicates significant variation across countries in both literate traditions and adolescents' access to digital media, and notes that these interact to create unique conditions for adolescents' digital literacy in each country, even as, across the four nations, adolescents' capacity to innovate and solve problems with digital access seems constant. In conclusion, we are cautious about making global claims about the state of adolescents' literacy worldwide but point to important findings about how the use of the internet in schools seems to have a positive impact on reading performance and offer some implications for classroom practice.
Published: 2024
Full Text: View/download PDF

31. A High-Resolution, Large-Scale Agent-Based Transport Model for Health Outcomes Evaluation from Policy Changes

Author: Laarabi, Haitam, Xu, Xiaodan, Jin, Ling, Brauer, Michael, Spurlock, Anna, Kirchstetter, Thomas, Marshall, Julian, Arku, Raphael, Waraich, Rashid, Anenberg, Susan, and Oulhote, Youssef
Subjects: Public Health, Health Sciences, Human Society, 8.3 Policy, ethics, and research governance, Generic health relevance, Good Health and Well Being, agent-based model, air pollution, environmental health, environmental justice, policy, traffic-related
Abstract: BACKGROUND AND AIM[|]Traffic-Related Air Pollution (TrAP) adversely impacts human health, disproportionately harming disadvantaged communities. New technologies and infrastructure offer opportunities to reduce TrAP, but the health outcomes of individuals are not fully understood due to a lack of high-resolution models that grasp the complexities of transportation systems and their health implications amid evolving policies and technologies.[¤]METHOD[|]We introduce BEAM CORE (beam.lbl.gov), a high-resolution, agent-based transportation framework that simulates detailed passenger and freight activities. It captures interactions between transportation, land use, demographic and vehicle ownership changes at various scales. Validating crucial factors of emission modeling, including link-level VMT, speed and regional fleet in the San Francisco Bay Area’s nine counties, demonstrates its potential to be extended for assessing health outcomes from changes in TrAP.[¤]RESULTS[|]All major outputs from the BEAM CORE 2018 baseline have been calibrated and validated. Mode split and demographics align closely with census and survey data. Passenger and freight activities were validated against public and private data, with CO2 emissions corresponding to 3.67Mt/yr for medium/heavy-duty (MHD) and 22.79Mt/yr for all vehicles, demonstrating the model’s alignment with empirical data. The NOx, PM2.5 and PM10 from MHD exhaust, PM brake and tire wear are 14.8kt/yr, 424t/yr and 606.9t/yr under the 2018 baseline with high fractions of conventional vehicles, while the wide adoption of clean truck technologies under 2050 resulted in 87\%, 75\% and 56\% reductions respectively. BEAM CORE generates detailed fleet and activity data at high spatiotemporal resolution, enabling the integration with air quality models, including InMAP/AERMOD, to explore the causal pathway of health impacts from transport policy changes.[¤]CONCLUSIONS[|]We developed a sophisticated multi-dimensional transportation model for integration with advanced air quality, and health assessment models. It enables a thorough analysis of health impacts of transportation policies and technologies across diverse communities. It supports similar analyses in any area using local data.[¤]
Published: 2024

32. Stochastic Aggregation Diffusion-Equation : Analysis via Dirichlet Forms

Author: Bourabiaa, Jaouad, Elmadani, Youssef, and Hanine, Abdelouahab
Subjects: Mathematics - Probability, Mathematics - Analysis of PDEs, 35R60, 60J60, 60J46, 31C25
Abstract: In this article, we study the stochastic aggregation-diffusion equation with a singular drift represented by a monotone radial kernel. We demonstrate the existence and uniqueness of a diffusion process that acts as a weak solution to our equation. This process can be described as a distorted Brownian motion originating from a delocalized point. Utilizing Dirichlet form theory, we prove the existence of a weak solution for a quasi-everywhere point in a state space. However uniqueness is not assured for solutions commencing from points outside polar sets, and explicitly characterizing these sets poses a significant challenge. To address this, we employ the H_2-condition introduced by Albeverio et al.(2003). This condition provides a more thorough understanding of the uniqueness issue within the framework of Dirichlet forms. Consequently the H_2-condition is pivotal in enhancing the analysis of weak solutions, ensuring a more detailed comprehension of the problem. An explicit expression for the generalized Schr\"odinger operator associated with certain kernels is also provided.
Published: 2024

33. Transverse resistance due to electronic inhomogeneities in superconductors

Author: Sengupta, Shamashis, Farhadizadeh, Alireza, Youssef, Joe, Loucif, Sara, Pallier, Florian, Dumoulin, Louis, Saha, Kasturi, Pujari, Sumiran, Oden, Magnus, Marrache-Kikuchi, Claire, and Monteverde, Miguel
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Mesoscale and Nanoscale Physics, Condensed Matter - Statistical Mechanics
Abstract: Phase transitions in many-body systems are often associated with the emergence of spatial inhomogeneities. Such features may develop at microscopic lengthscales and are not necessarily evident in measurements of macroscopic quantities. In this work, we address the topic of distribution of current paths in superconducting films. Typical lengthscales associated with superconductivity are in the range of nanometres. Accordingly, measurements of electrical resistance over much larger distances are supposed to be insensitive to details of spatial inhomogeneities of electronic properties. We observe that, contrary to expectations, current paths adopt a highly non-uniform distribution at the onset of the superconducting transition which is manifested in the development of a finite transverse resistance. The anisotropic distribution of current density is unrelated to the structural properties of the superconducting films, and indicates the emergence of electronic inhomogeneities perceivable over macroscopic distances. Our experiments reveal the ubiquitous nature of this phenomenon in conventional superconductors.
Published: 2024

34. Optimal experimental design: Formulations and computations

Author: Huan, Xun, Jagalur, Jayanth, and Marzouk, Youssef
Subjects: Statistics - Methodology, Mathematics - Numerical Analysis, Statistics - Computation
Abstract: Questions of `how best to acquire data' are essential to modeling and prediction in the natural and social sciences, engineering applications, and beyond. Optimal experimental design (OED) formalizes these questions and creates computational methods to answer them. This article presents a systematic survey of modern OED, from its foundations in classical design theory to current research involving OED for complex models. We begin by reviewing criteria used to formulate an OED problem and thus to encode the goal of performing an experiment. We emphasize the flexibility of the Bayesian and decision-theoretic approach, which encompasses information-based criteria that are well-suited to nonlinear and non-Gaussian statistical models. We then discuss methods for estimating or bounding the values of these design criteria; this endeavor can be quite challenging due to strong nonlinearities, high parameter dimension, large per-sample costs, or settings where the model is implicit. A complementary set of computational issues involves optimization methods used to find a design; we discuss such methods in the discrete (combinatorial) setting of observation selection and in settings where an exact design can be continuously parameterized. Finally we present emerging methods for sequential OED that build non-myopic design policies, rather than explicit designs; these methods naturally adapt to the outcomes of past experiments in proposing new experiments, while seeking coordination among all experiments to be performed. Throughout, we highlight important open questions and challenges., Comment: Appears in Acta Numerica 2024. This version contains an evolving set of post-publication additions and corrections
Published: 2024

35. DeepCell: A Ubiquitous Accurate Provider-side Cellular-based Localization

Author: Shokry, Ahmed and Youssef, Moustafa
Subjects: Computer Science - Computers and Society, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: Although outdoor localization is already available to the general public and businesses through the wide spread use of the GPS, it is not supported by low-end phones, requires a direct line of sight to satellites and can drain phone battery quickly. The current fingerprinting solutions can provide high-accuracy localization but are based on the client side. This limits their ubiquitous deployment and accuracy. In this paper, we introduce DeepCell: a provider-side fingerprinting localization system that can provide high accuracy localization for any cell phone. To build its fingerprint, DeepCell leverages the unlabeled cellular measurements recorded by the cellular provider while opportunistically synchronizing with selected client devices to get location labels. The fingerprint is then used to train a deep neural network model that is harnessed for localization. To achieve this goal, DeepCell need to address a number of challenges including using unlabeled data from the provider side, handling noise and sparsity, scaling the data to large areas, and finally providing enough data that is required for training deep models without overhead. Evaluation of DeepCell in a typical realistic environment shows that it can achieve a consistent median accuracy of 29m. This accuracy outperforms the state-of-the-art client-based cellular-based systems by more than 75.4%. In addition, the same accuracy is extended to low-end phones., Comment: arXiv admin note: substantial text overlap with arXiv:2106.13632
Published: 2024

36. Handling Device Heterogeneity for Deep Learning-based Localization

Author: Shokry, Ahmed and Youssef, Moustafa
Subjects: Computer Science - Computers and Society, Computer Science - Machine Learning
Abstract: Deep learning-based fingerprinting is one of the current promising technologies for outdoor localization in cellular networks. However, deploying such localization systems for heterogeneous phones affects their accuracy as the cellular received signal strength (RSS) readings vary for different types of phones. In this paper, we introduce a number of techniques for addressing the phones heterogeneity problem in the deep-learning based localization systems. The basic idea is either to approximate a function that maps the cellular RSS measurements between different devices or to transfer the knowledge across them. Evaluation of the proposed techniques using different Android phones on four independent testbeds shows that our techniques can improve the localization accuracy by more than 220% for the four testbeds as compared to the state-of-the-art systems. This highlights the promise of the proposed device heterogeneity handling techniques for enabling a wide deployment of deep learning-based localization systems over different devices.
Published: 2024

37. An Efficient Quantum Euclidean Similarity Algorithm for Worldwide Localization

Author: Shokry, Ahmed and Youssef, Moustafa
Subjects: Quantum Physics
Abstract: Fingerprinting techniques are widely used for localization because of their accuracy, especially in the presence of wireless channel noise. However, the fingerprinting techniques require significant storage and running time, which is a concern when implementing such systems on a global worldwide scale. In this paper, we propose an efficient quantum Euclidean similarity algorithm for wireless localization systems. The proposed quantum algorithm offers exponentially improved complexity compared to its classical counterpart and even the state-of-the-art quantum localization systems, in terms of both storage space and running time. The basic idea is to entangle the test received signal strength (RSS) vector with the fingerprint vectors at different locations and perform the similarity calculation in parallel to all fingerprint locations. We give the details of how to construct the quantum fingerprint, how to encode the RSS measurements in quantum particles, and finally; present the quantum algorithm for calculating the Euclidean similarity between the online RSS measurements and the fingerprint ones. Implementation and evaluation of our algorithm in a real testbed using a real IBM quantum machine as well as a simulation for a larger testbed confirm its ability to correctly obtain the estimated location with an exponential enhancement in both time and space compared to the traditional classical fingerprinting techniques and the state-of-the-art quantum localization techniques.
Published: 2024

38. EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition

Author: Doulfoukar, Youssef, Mertens, Laurent, and Vennekens, Joost
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Convolutional Neural Networks are particularly suited for image analysis tasks, such as Image Classification, Object Recognition or Image Segmentation. Like all Artificial Neural Networks, however, they are "black box" models, and suffer from poor explainability. This work is concerned with the specific downstream task of Emotion Recognition from images, and proposes a framework that combines CAM-based techniques with Object Detection on a corpus level to better understand on which image cues a particular model, in our case EmoNet, relies to assign a specific emotion to an image. We demonstrate that the model mostly focuses on human characteristics, but also explore the pronounced effect of specific image modifications., Comment: 10 pages, 7 figures
Published: 2024

39. Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry

Author: Zhang, Zhengxin, Goldfeld, Ziv, Greenewald, Kristjan, Mroueh, Youssef, and Sriperumbudur, Bharath K.
Subjects: Mathematics - Analysis of PDEs, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: The Wasserstein space of probability measures is known for its intricate Riemannian structure, which underpins the Wasserstein geometry and enables gradient flow algorithms. However, the Wasserstein geometry may not be suitable for certain tasks or data modalities. Motivated by scenarios where the global structure of the data needs to be preserved, this work initiates the study of gradient flows and Riemannian structure in the Gromov-Wasserstein (GW) geometry, which is particularly suited for such purposes. We focus on the inner product GW (IGW) distance between distributions on $\mathbb{R}^d$. Given a functional $\mathsf{F}:\mathcal{P}_2(\mathbb{R}^d)\to\mathbb{R}$ to optimize, we present an implicit IGW minimizing movement scheme that generates a sequence of distributions $\{\rho_i\}_{i=0}^n$, which are close in IGW and aligned in the 2-Wasserstein sense. Taking the time step to zero, we prove that the discrete solution converges to an IGW generalized minimizing movement (GMM) $(\rho_t)_t$ that follows the continuity equation with a velocity field $v_t\in L^2(\rho_t;\mathbb{R}^d)$, specified by a global transformation of the Wasserstein gradient of $\mathsf{F}$. The transformation is given by a mobility operator that modifies the Wasserstein gradient to encode not only local information, but also global structure. Our gradient flow analysis leads us to identify the Riemannian structure that gives rise to the intrinsic IGW geometry, using which we establish a Benamou-Brenier-like formula for IGW. We conclude with a formal derivation, akin to the Otto calculus, of the IGW gradient as the inverse mobility acting on the Wasserstein gradient. Numerical experiments validating our theory and demonstrating the global nature of IGW interpolations are provided., Comment: 73 pages
Published: 2024

40. Anticipating Future Object Compositions without Forgetting

Author: Zahran, Youssef, Burghouts, Gertjan, and Eisma, Yke Bauke
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite the significant advancements in computer vision models, their ability to generalize to novel object-attribute compositions remains limited. Existing methods for Compositional Zero-Shot Learning (CZSL) mainly focus on image classification. This paper aims to enhance CZSL in object detection without forgetting prior learned knowledge. We use Grounding DINO and incorporate Compositional Soft Prompting (CSP) into it and extend it with Compositional Anticipation. We achieve a 70.5% improvement over CSP on the harmonic mean (HM) between seen and unseen compositions on the CLEVR dataset. Furthermore, we introduce Contrastive Prompt Tuning to incrementally address model confusion between similar compositions. We demonstrate the effectiveness of this method and achieve an increase of 14.5% in HM across the pretrain, increment, and unseen sets. Collectively, these methods provide a framework for learning various compositions with limited data, as well as improving the performance of underperforming compositions when additional data becomes available.
Published: 2024

41. Spatio-temporal neural distance fields for conditional generative modeling of the heart

Author: Sørensen, Kristine, Diez, Paula, Margeta, Jan, Youssef, Yasmin El, Pham, Michael, Pedersen, Jonas Jalili, Kühl, Tobias, de Backer, Ole, Kofoed, Klaus, Camara, Oscar, and Paulsen, Rasmus
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: The rhythmic pumping motion of the heart stands as a cornerstone in life, as it circulates blood to the entire human body through a series of carefully timed contractions of the individual chambers. Changes in the size, shape and movement of the chambers can be important markers for cardiac disease and modeling this in relation to clinical demography or disease is therefore of interest. Existing methods for spatio-temporal modeling of the human heart require shape correspondence over time or suffer from large memory requirements, making it difficult to use for complex anatomies. We introduce a novel conditional generative model, where the shape and movement is modeled implicitly in the form of a spatio-temporal neural distance field and conditioned on clinical demography. The model is based on an auto-decoder architecture and aims to disentangle the individual variations from that related to the clinical demography. It is tested on the left atrium (including the left atrial appendage), where it outperforms current state-of-the-art methods for anatomical sequence completion and generates synthetic sequences that realistically mimics the shape and motion of the real left atrium. In practice, this means we can infer functional measurements from a static image, generate synthetic populations with specified demography or disease and investigate how non-imaging clinical data effect the shape and motion of cardiac anatomies., Comment: Accepted for MICCAI2024
Published: 2024

42. A Perspective on Foundation Models for the Electric Power Grid

Author: Hamann, Hendrik F., Brunschwiler, Thomas, Gjorgiev, Blazhe, Martins, Leonardo S. A., Puech, Alban, Varbella, Anna, Weiss, Jonas, Bernabe-Moreno, Juan, Massé, Alexandre Blondin, Choi, Seong, Foster, Ian, Hodge, Bri-Mathias, Jain, Rishabh, Kim, Kibaek, Mai, Vincent, Mirallès, François, De Montigny, Martin, Ramos-Leaños, Octavio, Suprême, Hussein, Xie, Le, Youssef, El-Nasser S., Zinflou, Arnaud, Belvi, Alexander J., Bessa, Ricardo J., Bhattari, Bishnu Prasad, Schmude, Johannes, and Sobolevsky, Stanislav
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computational Engineering, Finance, and Science, Electrical Engineering and Systems Science - Systems and Control
Abstract: Foundation models (FMs) currently dominate news headlines. They employ advanced deep learning architectures to extract structural information autonomously from vast datasets through self-supervision. The resulting rich representations of complex systems and dynamics can be applied to many downstream applications. Therefore, FMs can find uses in electric power grids, challenged by the energy transition and climate change. In this paper, we call for the development of, and state why we believe in, the potential of FMs for electric grids. We highlight their strengths and weaknesses amidst the challenges of a changing grid. We argue that an FM learning from diverse grid data and topologies could unlock transformative capabilities, pioneering a new approach in leveraging AI to redefine how we manage complexity and uncertainty in the electric grid. Finally, we discuss a power grid FM concept, namely GridFM, based on graph neural networks and show how different downstream tasks benefit., Comment: Lead contact: H.F.H.; Major equal contributors: H.F.H., T.B., B.G., L.S.A.M., A.P., A.V., J.W.; Significant equal contributors: J.B., A.B.M., S.C., I.F., B.H., R.J., K.K., V.M., F.M., M.D.M., O.R., H.S., L.X., E.S.Y., A.Z.; Other equal contributors: A.J.B., R.J.B., B.P.B., J.S., S.S
Published: 2024

43. A Deployable Quantum Access Points Selection Algorithm for Large-Scale Localization

Author: Shokry, Ahmed and Youssef, Moustafa
Subjects: Quantum Physics
Abstract: Effective access points (APs) selection is a crucial step in localization systems. It directly affects both localization accuracy and computational efficiency. Classical APs selection algorithms are usually computationally expensive, hindering the deployment of localization systems in a large worldwide scale. In this paper, we introduce a quantum APs selection algorithm for large-scale localization systems. The proposed algorithm leverages quantum annealing to eliminate redundant and noisy APs. We explain how to formulate the APs selection problem as a quadratic unconstrained binary optimization (QUBO) problem, suitable for quantum annealing, and how to select the minimum number of APs that maintain the same overall localization system accuracy as the complete APs set. Based on this, we further propose a logarithmic-complexity algorithm to select the optimal number of APs. We implement our quantum algorithm on a real D-Wave Systems quantum machine and assess its performance in a real test environment for a floor localization problem. Our findings reveal that by selecting fewer than 14% of the available APs in the environment, our quantum algorithm achieves the same floor localization accuracy as utilizing the entire set of APs and a superior accuracy over utilizing the reduced dataset by classical APs selection counterparts. Moreover, the proposed quantum algorithm achieves more than an order of magnitude speedup over the corresponding classical APs selection algorithms, emphasizing the efficiency of the proposed quantum algorithm for large-scale localization systems.
Published: 2024

44. Fine-Tuning Stable Diffusion XL for Stylistic Icon Generation: A Comparison of Caption Size

Author: Sultan, Youssef, Ma, Jiangqin, and Liao, Yu-Ying
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we show different fine-tuning methods for Stable Diffusion XL; this includes inference steps, and caption customization for each image to align with generating images in the style of a commercial 2D icon training set. We also show how important it is to properly define what "high-quality" really is especially for a commercial-use environment. As generative AI models continue to gain widespread acceptance and usage, there emerge many different ways to optimize and evaluate them for various applications. Specifically text-to-image models, such as Stable Diffusion XL and DALL-E 3 require distinct evaluation practices to effectively generate high-quality icons according to a specific style. Although some images that are generated based on a certain style may have a lower FID score (better), we show how this is not absolute in and of itself even for rasterized icons. While FID scores reflect the similarity of generated images to the overall training set, CLIP scores measure the alignment between generated images and their textual descriptions. We show how FID scores miss significant aspects, such as the minority of pixel differences that matter most in an icon, while CLIP scores result in misjudging the quality of icons. The CLIP model's understanding of "similarity" is shaped by its own training data; which does not account for feature variation in our style of choice. Our findings highlight the need for specialized evaluation metrics and fine-tuning approaches when generating high-quality commercial icons, potentially leading to more effective and tailored applications of text-to-image models in professional design contexts., Comment: 11 pages, 22 figures
Published: 2024

45. Toto: Time Series Optimized Transformer for Observability

Author: Cohen, Ben, Khwaja, Emaad, Wang, Kan, Masson, Charles, Ramé, Elise, Doubli, Youssef, and Abou-Amal, Othmane
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: This technical report describes the Time Series Optimized Transformer for Observability (Toto), a new state of the art foundation model for time series forecasting developed by Datadog. In addition to advancing the state of the art on generalized time series benchmarks in domains such as electricity and weather, this model is the first general-purpose time series forecasting foundation model to be specifically tuned for observability metrics. Toto was trained on a dataset of one trillion time series data points, the largest among all currently published time series foundation models. Alongside publicly available time series datasets, 75% of the data used to train Toto consists of fully anonymous numerical metric data points from the Datadog platform. In our experiments, Toto outperforms existing time series foundation models on observability data. It does this while also excelling at general-purpose forecasting tasks, achieving state-of-the-art zero-shot performance on multiple open benchmark datasets.
Published: 2024

46. Raply: A profanity-mitigated rap generator

Author: Bendali, Omar Manil, Ferroum, Samir, Kozachenko, Ekaterina, Parviz, Youssef, Shcharbakova, Hanna, Tokareva, Anna, and Williams, Shemair
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The task of writing rap is challenging and involves producing complex rhyming schemes, yet meaningful lyrics. In this work, we propose Raply, a fine-tuned GPT-2 model capable of producing meaningful rhyming text in the style of rap. In addition to its rhyming capabilities, the model is able to generate less offensive content. It was achieved through the fine-tuning the model on a new dataset Mitislurs, a profanity-mitigated corpus. We evaluate the output of the model on two criteria: 1) rhyming based on the rhyme density metric; 2) profanity content, using the list of profanities for the English language. To our knowledge, this is the first attempt at profanity mitigation for rap lyrics generation.
Published: 2024

47. Decomposition of an $ L^{1}(T) $-bounded martingale and Applications in Riesz spaces

Author: Niouar, Mounsif, Boukara, Tarik, Ramdane, Kawtar, and Bentaleb, Youssef
Subjects: Mathematics - Probability, Mathematics - Functional Analysis, 60G48, 60G42, 47B60
Abstract: In this work, we give a decomposition of a martingale into three martingales with applications to certain types of inequalities in the new theory of Stochastic Analysis in Vector Lattices
Published: 2024

48. ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models

Author: Heakl, Ahmed, Mohamed, Youssef, Mohamed, Noran, Elsharkawy, Aly, and Zaky, Ahmed
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computers and Society, Computer Science - Machine Learning
Abstract: The increasing reliance on online recruitment platforms coupled with the adoption of AI technologies has highlighted the critical need for efficient resume classification methods. However, challenges such as small datasets, lack of standardized resume templates, and privacy concerns hinder the accuracy and effectiveness of existing classification models. In this work, we address these challenges by presenting a comprehensive approach to resume classification. We curated a large-scale dataset of 13,389 resumes from diverse sources and employed Large Language Models (LLMs) such as BERT and Gemma1.1 2B for classification. Our results demonstrate significant improvements over traditional machine learning approaches, with our best model achieving a top-1 accuracy of 92\% and a top-5 accuracy of 97.5\%. These findings underscore the importance of dataset quality and advanced model architectures in enhancing the accuracy and robustness of resume classification systems, thus advancing the field of online recruitment practices., Comment: 8 pages, 6 figures, 1 table, 6th International Conference on AI in Computational Linguistics
Published: 2024

49. ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs

Author: Heakl, Ahmed, Zaghloul, Youssef, Ali, Mennatullah, Hossam, Rania, and Gomaa, Walid
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computers and Society, Computer Science - Machine Learning
Abstract: Motivated by the widespread increase in the phenomenon of code-switching between Egyptian Arabic and English in recent times, this paper explores the intricacies of machine translation (MT) and automatic speech recognition (ASR) systems, focusing on translating code-switched Egyptian Arabic-English to either English or Egyptian Arabic. Our goal is to present the methodologies employed in developing these systems, utilizing large language models such as LLama and Gemma. In the field of ASR, we explore the utilization of the Whisper model for code-switched Egyptian Arabic recognition, detailing our experimental procedures including data preprocessing and training techniques. Through the implementation of a consecutive speech-to-text translation system that integrates ASR with MT, we aim to overcome challenges posed by limited resources and the unique characteristics of the Egyptian Arabic dialect. Evaluation against established metrics showcases promising results, with our methodologies yielding a significant improvement of $56\%$ in English translation over the state-of-the-art and $9.3\%$ in Arabic translation. Since code-switching is deeply inherent in spoken languages, it is crucial that ASR systems can effectively handle this phenomenon. This capability is crucial for enabling seamless interaction in various domains, including business negotiations, cultural exchanges, and academic discourse. Our models and code are available as open-source resources. Code: \url{http://github.com/ahmedheakl/arazn-llm}}, Models: \url{http://huggingface.co/collections/ahmedheakl/arazn-llm-662ceaf12777656607b9524e}., Comment: 9 pages, 4 figures, 5 tables, 6th International Conference on AI in Computational Linguistics
Published: 2024

50. Clock-line-mediated Sisyphus Cooling

Author: Chen, Chun-Chia, Siegel, Jacob L., Hunt, Benjamin D., Grogan, Tanner, Hassan, Youssef S., Beloy, Kyle, Gibble, Kurt, Brown, Roger C., and Ludlow, Andrew D.
Subjects: Physics - Atomic Physics
Abstract: We demonstrate sub-recoil Sisyphus cooling using the long-lived $^{3}\mathrm{P}_{0}$ clock state in alkaline-earth-like ytterbium. A 1388 nm optical standing wave nearly resonant with the $^{3}\textrm{P}_{0}$$\,\rightarrow$$\,^{3}\textrm{D}_{1}$ transition creates a spatially periodic light shift of the $^{3}\textrm{P}_{0}$ clock state. Following excitation on the ultranarrow clock transition, we observe Sisyphus cooling in this potential, as the light shift is correlated with excitation to $^{3}\textrm{D}_{1}$ and subsequent spontaneous decay to the $^{1}\textrm{S}_{0}$ ground state. We observe that cooling enhances the loading efficiency of atoms into a 759 nm magic-wavelength one-dimensional (1D) optical lattice, as compared to standard Doppler cooling on the $^{1}\textrm{S}_{0}$$\,\rightarrow\,$$^{3}\textrm{P}_{1}$ transition. Sisyphus cooling yields temperatures below 200 nK in the weakly confined, transverse dimensions of the 1D optical lattice. These lower temperatures improve optical lattice clocks by facilitating the use of shallow lattices with reduced light shifts, while retaining large atom numbers to reduce the quantum projection noise. This Sisyphus cooling can be pulsed or continuous and is applicable to a range of quantum metrology applications., Comment: 8 pages, 6 figures
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

19,722 results on '"Youssef, P"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources