Author: "Perez-Ortiz A" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Perez-Ortiz A"' showing total 612 results

Start Over Author "Perez-Ortiz A"

612 results on '"Perez-Ortiz A"'

1. How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making?

Author: Benkirane, Kenza, Kay, Jackie, and Perez-Ortiz, Maria
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Recent advancements in Large Language Models (LLMs) have positioned them as powerful tools for clinical decision-making, with rapidly expanding applications in healthcare. However, concerns about bias remain a significant challenge in the clinical implementation of LLMs, particularly regarding gender and ethnicity. This research investigates the evaluation and mitigation of bias in LLMs applied to complex clinical cases, focusing on gender and ethnicity biases. We introduce a novel Counterfactual Patient Variations (CPV) dataset derived from the JAMA Clinical Challenge. Using this dataset, we built a framework for bias evaluation, employing both Multiple Choice Questions (MCQs) and corresponding explanations. We explore prompting with eight LLMs and fine-tuning as debiasing methods. Our findings reveal that addressing social biases in LLMs requires a multidimensional approach as mitigating gender bias can occur while introducing ethnicity biases, and that gender bias in LLM embeddings varies significantly across medical specialities. We demonstrate that evaluating both MCQ response and explanation processes is crucial, as correct responses can be based on biased \textit{reasoning}. We provide a framework for evaluating LLM bias in real-world clinical cases, offer insights into the complex nature of bias in these models, and present strategies for bias mitigation.
Published: 2024

2. Crafting desirable climate trajectories with RL explored socio-environmental simulations

Author: Rudd-Jones, James, Thendean, Fiona, and Pérez-Ortiz, María
Subjects: Physics - Physics and Society, Computer Science - Artificial Intelligence
Abstract: Climate change poses an existential threat, necessitating effective climate policies to enact impactful change. Decisions in this domain are incredibly complex, involving conflicting entities and evidence. In the last decades, policymakers increasingly use simulations and computational methods to guide some of their decisions. Integrated Assessment Models (IAMs) are one of such methods, which combine social, economic, and environmental simulations to forecast potential policy effects. For example, the UN uses outputs of IAMs for their recent Intergovernmental Panel on Climate Change (IPCC) reports. Traditionally these have been solved using recursive equation solvers, but have several shortcomings, e.g. struggling at decision making under uncertainty. Recent preliminary work using Reinforcement Learning (RL) to replace the traditional solvers shows promising results in decision making in uncertain and noisy scenarios. We extend on this work by introducing multiple interacting RL agents as a preliminary analysis on modelling the complex interplay of socio-interactions between various stakeholders or nations that drives much of the current climate crisis. Our findings show that cooperative agents in this framework can consistently chart pathways towards more desirable futures in terms of reduced carbon emissions and improved economy. However, upon introducing competition between agents, for instance by using opposing reward functions, desirable climate futures are rarely reached. Modelling competition is key to increased realism in these simulations, as such we employ policy interpretation by visualising what states lead to more uncertain behaviour, to understand algorithm failure. Finally, we highlight the current limitations and avenues for further work to ensure future technology uptake for policy derivation., Comment: 23 pages, 13 Figures
Published: 2024

3. Enhancing Critical Thinking in Education by means of a Socratic Chatbot

Author: Favero, Lucile, Pérez-Ortiz, Juan Antonio, Käser, Tanja, and Oliver, Nuria
Subjects: Computer Science - Human-Computer Interaction
Abstract: While large language models (LLMs) are increasingly playing a pivotal role in education by providing instantaneous, adaptive responses, their potential to promote critical thinking remains understudied. In this paper, we fill such a gap and present an innovative educational chatbot designed to foster critical thinking through Socratic questioning. Unlike traditional intelligent tutoring systems, including educational chatbots, that tend to offer direct answers, the proposed Socratic tutor encourages students to explore various perspectives and engage in self-reflection by posing structured, thought-provoking questions. Our Socratic questioning is implemented by fine and prompt-tuning the open-source pretrained LLM with a specialized dataset that stimulates critical thinking and offers multiple viewpoints. In an effort to democratize access and to protect the students' privacy, the proposed tutor is based on small LLMs (Llama2 7B and 13B-parameter models) that are able to run locally on off-the-shelf hardware. We validate our approach in a battery of experiments consisting of interactions between a simulated student and the chatbot to evaluate its effectiveness in enhancing critical thinking skills. Results indicate that the Socratic tutor supports the development of reflection and critical thinking significantly better than standard chatbots. Our approach opens the door for improving educational outcomes by cultivating active learning and encouraging intellectual autonomy.
Published: 2024

4. Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games

Author: Herr, Nathan, Acero, Fernando, Raileanu, Roberta, Pérez-Ortiz, María, and Li, Zhibin
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Computer Science and Game Theory
Abstract: Large Language Models (LLMs) have been increasingly used in real-world settings, yet their strategic decision-making abilities remain largely unexplored. To fully benefit from the potential of LLMs, it's essential to understand their ability to function in complex social scenarios. Game theory, which is already used to understand real-world interactions, provides a good framework for assessing these abilities. This work investigates the performance and merits of LLMs in canonical game-theoretic two-player non-zero-sum games, Stag Hunt and Prisoner Dilemma. Our structured evaluation of GPT-3.5, GPT-4-Turbo, GPT-4o, and Llama-3-8B shows that these models, when making decisions in these games, are affected by at least one of the following systematic biases: positional bias, payoff bias, or behavioural bias. This indicates that LLMs do not fully rely on logical reasoning when making these strategic decisions. As a result, it was found that the LLMs' performance drops when the game configuration is misaligned with the affecting biases. When misaligned, GPT-3.5, GPT-4-Turbo, GPT-4o, and Llama-3-8B show an average performance drop of 32\%, 25\%, 34\%, and 29\% respectively in Stag Hunt, and 28\%, 16\%, 34\%, and 24\% respectively in Prisoner's Dilemma. Surprisingly, GPT-4o (a top-performing LLM across standard benchmarks) suffers the most substantial performance drop, suggesting that newer models are not addressing these issues. Interestingly, we found that a commonly used method of improving the reasoning capabilities of LLMs, chain-of-thought (CoT) prompting, reduces the biases in GPT-3.5, GPT-4o, and Llama-3-8B but increases the effect of the bias in GPT-4-Turbo, indicating that CoT alone cannot fully serve as a robust solution to this problem. We perform several additional experiments, which provide further insight into these observed behaviours.
Published: 2024

5. The African Woman is Rhythmic and Soulful: An Investigation of Implicit Biases in LLM Open-ended Text Generation

Author: Lim, Serene and Pérez-Ortiz, María
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: This paper investigates the subtle and often concealed biases present in Large Language Models (LLMs), focusing on implicit biases that may remain despite passing explicit bias tests. Implicit biases are significant because they influence the decisions made by these systems, potentially perpetuating stereotypes and discrimination, even when LLMs appear to function fairly. Traditionally, explicit bias tests or embedding-based methods are employed to detect bias, but these approaches can overlook more nuanced, implicit forms of bias. To address this, we introduce two novel psychological-inspired methodologies: the LLM Implicit Association Test (IAT) Bias and the LLM Decision Bias, designed to reveal and measure implicit biases through prompt-based and decision-making tasks. Additionally, open-ended generation tasks with thematic analysis of word generations and storytelling provide qualitative insights into the model's behavior. Our findings demonstrate that the LLM IAT Bias correlates with traditional methods and more effectively predicts downstream behaviors, as measured by the LLM Decision Bias, offering a more comprehensive framework for detecting subtle biases in AI systems. This research advances the field of AI ethics by proposing new methods to continually assess and mitigate biases in LLMs, highlighting the importance of qualitative and decision-focused evaluations to address challenges that previous approaches have not fully captured.
Published: 2024

6. JobFair: A Framework for Benchmarking Gender Hiring Bias in Large Language Models

Author: Wang, Ze, Wu, Zekun, Guan, Xin, Thaler, Michael, Koshiyama, Adriano, Lu, Skylar, Beepath, Sachin, Ertekin Jr., Ediz, and Perez-Ortiz, Maria
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: The use of Large Language Models (LLMs) in hiring has led to legislative actions to protect vulnerable demographic groups. This paper presents a novel framework for benchmarking hierarchical gender hiring bias in Large Language Models (LLMs) for resume scoring, revealing significant issues of reverse gender hiring bias and overdebiasing. Our contributions are fourfold: Firstly, we introduce a new construct grounded in labour economics, legal principles, and critiques of current bias benchmarks: hiring bias can be categorized into two types: Level bias (difference in the average outcomes between demographic counterfactual groups) and Spread bias (difference in the variance of outcomes between demographic counterfactual groups); Level bias can be further subdivided into statistical bias (i.e. changing with non-demographic content) and taste-based bias (i.e. consistent regardless of non-demographic content). Secondly, the framework includes rigorous statistical and computational hiring bias metrics, such as Rank After Scoring (RAS), Rank-based Impact Ratio, Permutation Test, and Fixed Effects Model. Thirdly, we analyze gender hiring biases in ten state-of-the-art LLMs. Seven out of ten LLMs show significant biases against males in at least one industry. An industry-effect regression reveals that the healthcare industry is the most biased against males. Moreover, we found that the bias performance remains invariant with resume content for eight out of ten LLMs. This indicates that the bias performance measured in this paper might apply to other resume datasets with different resume qualities. Fourthly, we provide a user-friendly demo and resume dataset to support the adoption and practical use of the framework, which can be generalized to other social traits and tasks., Comment: EMNLP 2024 Findings Paper
Published: 2024

7. The Effect of Model Size on LLM Post-hoc Explainability via LIME

Author: Heyen, Henning, Widdicombe, Amy, Siegel, Noah Y., Perez-Ortiz, Maria, and Treleaven, Philip
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Large language models (LLMs) are becoming bigger to boost performance. However, little is known about how explainability is affected by this trend. This work explores LIME explanations for DeBERTaV3 models of four different sizes on natural language inference (NLI) and zero-shot classification (ZSC) tasks. We evaluate the explanations based on their faithfulness to the models' internal decision processes and their plausibility, i.e. their agreement with human explanations. The key finding is that increased model size does not correlate with plausibility despite improved model performance, suggesting a misalignment between the LIME explanations and the models' internal processes as model size increases. Our results further suggest limitations regarding faithfulness metrics in NLI contexts., Comment: Published at ICLR 2024 Workshop on Secure and Trustworthy Large Language Models
Published: 2024

8. Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars

Author: Lou, Andrés, Pérez-Ortiz, Juan Antonio, Sánchez-Martínez, Felipe, and Sánchez-Cartagena, Víctor M.
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: The Mayan languages comprise a language family with an ancient history, millions of speakers, and immense cultural value, that, nevertheless, remains severely underrepresented in terms of resources and global exposure. In this paper we develop, curate, and publicly release a set of corpora in several Mayan languages spoken in Guatemala and Southern Mexico, which we call MayanV. The datasets are parallel with Spanish, the dominant language of the region, and are taken from official native sources focused on representing informal, day-to-day, and non-domain-specific language. As such, and according to our dialectometric analysis, they differ in register from most other available resources. Additionally, we present neural machine translation models, trained on as many resources and Mayan languages as possible, and evaluated exclusively on our datasets. We observe lexical divergences between the dialects of Spanish in our resources and the more widespread written standard of Spanish, and that resources other than the ones we present do not seem to improve translation performance, indicating that many such resources may not accurately capture common, real-life language usage. The MayanV dataset is available at https://github.com/transducens/mayanv., Comment: 13 pages, 3 figures, 8 tables, Submitted to NAACL 2024
Published: 2024

9. The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models

Author: Siegel, Noah Y., Camburu, Oana-Maria, Heess, Nicolas, and Perez-Ortiz, Maria
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In order to oversee advanced AI systems, it is important to understand their underlying decision-making process. When prompted, large language models (LLMs) can provide natural language explanations or reasoning traces that sound plausible and receive high ratings from human annotators. However, it is unclear to what extent these explanations are faithful, i.e., truly capture the factors responsible for the model's predictions. In this work, we introduce Correlational Explanatory Faithfulness (CEF), a metric that can be used in faithfulness tests based on input interventions. Previous metrics used in such tests take into account only binary changes in the predictions. Our metric accounts for the total shift in the model's predicted label distribution, more accurately reflecting the explanations' faithfulness. We then introduce the Correlational Counterfactual Test (CCT) by instantiating CEF on the Counterfactual Test (CT) from Atanasova et al. (2023). We evaluate the faithfulness of free-text explanations generated by few-shot-prompted LLMs from the Llama2 family on three NLP tasks. We find that our metric measures aspects of faithfulness which the CT misses., Comment: To be published in ACL 2024. 19 pages, 2 figures
Published: 2024

10. Auditing Large Language Models for Enhanced Text-Based Stereotype Detection and Probing-Based Bias Evaluation

Author: Wu, Zekun, Bulathwela, Sahan, Perez-Ortiz, Maria, and Koshiyama, Adriano Soares
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Recent advancements in Large Language Models (LLMs) have significantly increased their presence in human-facing Artificial Intelligence (AI) applications. However, LLMs could reproduce and even exacerbate stereotypical outputs from training data. This work introduces the Multi-Grain Stereotype (MGS) dataset, encompassing 51,867 instances across gender, race, profession, religion, and stereotypical text, collected by fusing multiple previously publicly available stereotype detection datasets. We explore different machine learning approaches aimed at establishing baselines for stereotype detection, and fine-tune several language models of various architectures and model sizes, presenting in this work a series of stereotypes classifier models for English text trained on MGS. To understand whether our stereotype detectors capture relevant features (aligning with human common sense) we utilise a variety of explanainable AI tools, including SHAP, LIME, and BertViz, and analyse a series of example cases discussing the results. Finally, we develop a series of stereotype elicitation prompts and evaluate the presence of stereotypes in text generation tasks with popular LLMs, using one of our best performing previously presented stereotypes detectors. Our experiments yielded several key findings: i) Training stereotype detectors in a multi-dimension setting yields better results than training multiple single-dimension classifiers.ii) The integrated MGS Dataset enhances both the in-dataset and cross-dataset generalisation ability of stereotype detectors compared to using the datasets separately. iii) There is a reduction in stereotypes in the content generated by GPT Family LLMs with newer versions., Comment: Under reviewed as a conference paper at COLM 2024
Published: 2024

11. Non-Fluent Synthetic Target-Language Data Improve Neural Machine Translation

Author: Sánchez-Cartagena, Víctor M., Esplà-Gomis, Miquel, Pérez-Ortiz, Juan Antonio, and Sánchez-Martínez, Felipe
Subjects: Computer Science - Computation and Language
Abstract: When the amount of parallel sentences available to train a neural machine translation is scarce, a common practice is to generate new synthetic training samples from them. A number of approaches have been proposed to produce synthetic parallel sentences that are similar to those in the parallel data available. These approaches work under the assumption that non-fluent target-side synthetic training samples can be harmful and may deteriorate translation performance. Even so, in this paper we demonstrate that synthetic training samples with non-fluent target sentences can improve translation performance if they are used in a multilingual machine translation framework as if they were sentences in another language. We conducted experiments on ten low-resource and four high-resource translation tasks and found out that this simple approach consistently improves translation performance as compared to state-of-the-art methods for generating synthetic training samples similar to those found in corpora. Furthermore, this improvement is independent of the size of the original training corpus, the resulting systems are much more robust against domain shift and produce less hallucinations., Comment: arXiv admin note: text overlap with arXiv:2109.03645
Published: 2024
Full Text: View/download PDF

12. Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation

Author: Sánchez-Cartagena, Víctor M., Pérez-Ortiz, Juan Antonio, and Sánchez-Martínez, Felipe
Subjects: Computer Science - Computation and Language
Abstract: This paper studies the effects of word-level linguistic annotations in under-resourced neural machine translation, for which there is incomplete evidence in the literature. The study covers eight language pairs, different training corpus sizes, two architectures, and three types of annotation: dummy tags (with no linguistic information at all), part-of-speech tags, and morpho-syntactic description tags, which consist of part of speech and morphological features. These linguistic annotations are interleaved in the input or output streams as a single tag placed before each word. In order to measure the performance under each scenario, we use automatic evaluation metrics and perform automatic error classification. Our experiments show that, in general, source-language annotations are helpful and morpho-syntactic descriptions outperform part of speech for some language pairs. On the contrary, when words are annotated in the target language, part-of-speech tags systematically outperform morpho-syntactic description tags in terms of automatic evaluation metrics, even though the use of morpho-syntactic description tags improves the grammaticality of the output. We provide a detailed analysis of the reasons behind this result., Comment: COLING 2020
Published: 2024
Full Text: View/download PDF

13. Anytime-Valid Tests of Group Invariance through Conformal Prediction

Author: Lardy, Tyron and Pérez-Ortiz, Muriel Felipe
Subjects: Statistics - Methodology, Mathematics - Statistics Theory
Abstract: We develop anytime-valid tests of invariance under the action of compact groups. The resulting test statistics are optimal in a logarithmic-growth sense. We apply our method to extend recent anytime-valid tests of independence and to construct tests of normality.
Published: 2024

14. A Toolbox for Modelling Engagement with Educational Videos

Author: Qiu, Yuxiang, Djemili, Karim, Elezi, Denis, Shalman, Aaneel, Pérez-Ortiz, María, Yilmaz, Emine, Shawe-Taylor, John, and Bulathwela, Sahan
Subjects: Computer Science - Computers and Society, Computer Science - Information Retrieval, Computer Science - Machine Learning, Statistics - Applications, H.3.3, J.1, I.2.0
Abstract: With the advancement and utility of Artificial Intelligence (AI), personalising education to a global population could be a cornerstone of new educational systems in the future. This work presents the PEEKC dataset and the TrueLearn Python library, which contains a dataset and a series of online learner state models that are essential to facilitate research on learner engagement modelling.TrueLearn family of models was designed following the "open learner" concept, using humanly-intuitive user representations. This family of scalable, online models also help end-users visualise the learner models, which may in the future facilitate user interaction with their models/recommenders. The extensive documentation and coding examples make the library highly accessible to both machine learning developers and educational data mining and learning analytics practitioners. The experiments show the utility of both the dataset and the library with predictive performance significantly exceeding comparative baseline models. The dataset contains a large amount of AI-related educational videos, which are of interest for building and validating AI-specific educational recommenders., Comment: In Proceedings of AAAI Conference on Artificial Intelligence 2024. arXiv admin note: text overlap with arXiv:2309.11527
Published: 2023

15. Can Reinforcement Learning support policy makers? A preliminary study with Integrated Assessment Models

Author: Wolf, Theodore, Nardelli, Nantas, Shawe-Taylor, John, and Perez-Ortiz, Maria
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computational Engineering, Finance, and Science
Abstract: Governments around the world aspire to ground decision-making on evidence. Many of the foundations of policy making - e.g. sensing patterns that relate to societal needs, developing evidence-based programs, forecasting potential outcomes of policy changes, and monitoring effectiveness of policy programs - have the potential to benefit from the use of large-scale datasets or simulations together with intelligent algorithms. These could, if designed and deployed in a way that is well grounded on scientific evidence, enable a more comprehensive, faster, and rigorous approach to policy making. Integrated Assessment Models (IAM) is a broad umbrella covering scientific models that attempt to link main features of society and economy with the biosphere into one modelling framework. At present, these systems are probed by policy makers and advisory groups in a hypothesis-driven manner. In this paper, we empirically demonstrate that modern Reinforcement Learning can be used to probe IAMs and explore the space of solutions in a more principled manner. While the implication of our results are modest since the environment is simplistic, we believe that this is a stepping stone towards more ambitious use cases, which could allow for effective exploration of policies and understanding of their consequences and limitations., Comment: Published at NeurIPS'23 Workshop on Tackling Climate Change with Machine Learning
Published: 2023

16. TrueLearn: A Python Library for Personalised Informational Recommendations with (Implicit) Feedback

Author: Qiu, Yuxiang, Djemili, Karim, Elezi, Denis, Shalman, Aaneel, Pérez-Ortiz, María, and Bulathwela, Sahan
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computers and Society, Computer Science - Machine Learning, Statistics - Machine Learning, H.3.3, J.1, I.2.0
Abstract: This work describes the TrueLearn Python library, which contains a family of online learning Bayesian models for building educational (or more generally, informational) recommendation systems. This family of models was designed following the "open learner" concept, using humanly-intuitive user representations. For the sake of interpretability and putting the user in control, the TrueLearn library also contains different representations to help end-users visualise the learner models, which may in the future facilitate user interaction with their own models. Together with the library, we include a previously publicly released implicit feedback educational dataset with evaluation metrics to measure the performance of the models. The extensive documentation and coding examples make the library highly accessible to both machine learning developers and educational data mining and learning analytic practitioners. The library and the support documentation with examples are available at https://truelearn.readthedocs.io/en/latest., Comment: To be presented at the ORSUM workshop at RecSys 2023
Published: 2023

17. A retrospective review of LMNB1-related autosomal dominant leukodystrophy

Author: Judit M. Perez Ortiz, Karthik Muthusamy, W. Oliver Tobin, Ralitza Gavrilova, Margot A. Cousin, and Radhika Dhamija
Subjects: LMNB1-related autosomal dominant leukodystrophy, Neurogenetics, Specialties of internal medicine, RC581-951
Abstract: Abstract Introduction LMNB1-related autosomal dominant leukodystrophy (ADLD) is a slowly progressive neurodegenerative disorder caused by overexpression of LMNB1. We retrospectively reviewed charts of all ADLD patients seen at Mayo Clinic. Methods All available data from molecularly confirmed ADLD patients was reviewed. Results Of eight patients identified, three were male. Age at symptom onset ranged from 33 to 64 years. In males, the first symptom was erectile dysfunction (2/3) or neurogenic bladder (1/3) and, in females, weakness (3/5), bladder dysfunction (2/5), or depression (1/5). Diagnostic delay from symptom onset was a median of 6 (IQR 2.3–10) years. Other reported symptoms included cognitive difficulties (8/8), fatigue (7/8), sleep issues (4/8), mood disturbances (5/8), tremor (4/8), and migraine (4/8). Family history was positive in 6. All eight patients had LMNB1 duplication. Eighteen brain MRIs were reviewed from 7 patients. All showed symmetric confluent T2W deep cerebral and periventricular white matter hyperintensities with involvement of the posterior limb of the internal capsule, corpus callosum, corticospinal tract in brain stem, and superior and middle cerebellar peduncles. Seven spine MRIs from six patients showed moderate diffuse atrophy of the spinal cord. Conclusion Typical clinical symptoms and characteristic MRI changes should prompt genetic testing for ADLD.
Published: 2024
Full Text: View/download PDF

18. Exponential Stochastic Inequality

Author: Grünwald, Peter D., Pérez-Ortiz, Muriel F., and Mhammedi, Zakaria
Subjects: Mathematics - Statistics Theory
Abstract: We develop the concept of exponential stochastic inequality (ESI), a novel notation that simultaneously captures high-probability and in-expectation statements. It is especially well suited to succinctly state, prove, and reason about excess-risk and generalization bounds in statistical learning, specifically, but not restricted to, the PAC-Bayesian type. We show that the ESI satisfies transitivity and other properties which allow us to use it like standard, nonstochastic inequalities. We substantially extend the original definition from Koolen et al. (2016) and show that general ESIs satisfy a host of useful additional properties, including a novel Markov-like inequality. We show how ESIs relate to, and clarify, PAC-Bayesian bounds, subcentered subgamma random variables and *fast-rate conditions* such as the central and Bernstein conditions. We also show how the ideas can be extended to random scaling factors (learning rates).
Published: 2023

19. Comparing the carbon costs and benefits of low-resource solar nowcasting

Author: Dixon, Ben, Pérez-Ortiz, María, and Bieker, Jacob
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Solar PV yield nowcasting is used to help anticipate peaks and troughs in demand to support grid integration. This paper compares multiple low-resource approaches to nowcasting solar PV yield, using a dataset of UK satellite imagery and solar PV energy readings over a 1 to 4-hour time range. The paper also estimates the carbon emissions generated and averted by deploying models, and finds that even small models that could be deployable in low-resource settings may have a benefit several orders of magnitude greater than its carbon cost. The paper also examines prediction errors and the activations in a CNN.
Published: 2022

20. E-Statistics, Group Invariance and Anytime Valid Testing

Author: Pérez-Ortiz, Muriel Felipe, Lardy, Tyron, de Heide, Rianne, and Grünwald, Peter
Subjects: Mathematics - Statistics Theory, Statistics - Methodology
Abstract: We study worst-case-growth-rate-optimal (GROW) e-statistics for hypothesis testing between two group models. It is known that under a mild condition on the action of the underlying group G on the data, there exists a maximally invariant statistic. We show that among all e-statistics, invariant or not, the likelihood ratio of the maximally invariant statistic is GROW, both in the absolute and in the relative sense, and that an anytime-valid test can be based on it. The GROW e-statistic is equal to a Bayes factor with a right Haar prior on G. Our treatment avoids nonuniqueness issues that sometimes arise for such priors in Bayesian contexts. A crucial assumption on the group G is its amenability, a well-known group-theoretical condition, which holds, for instance, in scale-location families. Our results also apply to finite-dimensional linear regression., Comment: 30 pages. Major rewrite of previous version. Submitted to the Annals of Statistics
Published: 2022

21. Can Population-based Engagement Improve Personalisation? A Novel Dataset and Experiments

Author: Bulathwela, Sahan, Verma, Meghana, Perez-Ortiz, Maria, Yilmaz, Emine, and Shawe-Taylor, John
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence, Computer Science - Digital Libraries, Statistics - Applications, Statistics - Machine Learning, H.3.3, J.1, I.2.0
Abstract: This work explores how population-based engagement prediction can address cold-start at scale in large learning resource collections. The paper introduces i) VLE, a novel dataset that consists of content and video based features extracted from publicly available scientific video lectures coupled with implicit and explicit signals related to learner engagement, ii) two standard tasks related to predicting and ranking context-agnostic engagement in video lectures with preliminary baselines and iii) a set of experiments that validate the usefulness of the proposed dataset. Our experimental results indicate that the newly proposed VLE dataset leads to building context-agnostic engagement prediction models that are significantly performant than ones based on previous datasets, mainly attributing to the increase of training examples. VLE dataset's suitability in building models towards Computer Science/ Artificial Intelligence education focused on e-learning/ MOOC use-cases is also evidenced. Further experiments in combining the built model with a personalising algorithm show promising improvements in addressing the cold-start problem encountered in educational recommenders. This is the largest and most diverse publicly available dataset to our knowledge that deals with learner engagement prediction tasks. The dataset, helper tools, descriptive statistics and example code snippets are available publicly., Comment: To be presented at International Conference for Educational Data Mining 2022
Published: 2022

22. Watch Less and Uncover More: Could Navigation Tools Help Users Search and Explore Videos?

Author: Perez-Ortiz, Maria, Bulathwela, Sahan, Dormann, Claire, Verma, Meghana, Kreitmayer, Stefan, Noss, Richard, Shawe-Taylor, John, Rogers, Yvonne, and Yilmaz, Emine
Subjects: Computer Science - Information Retrieval, Computer Science - Human-Computer Interaction
Abstract: Prior research has shown how 'content preview tools' improve speed and accuracy of user relevance judgements across different information retrieval tasks. This paper describes a novel user interface tool, the Content Flow Bar, designed to allow users to quickly identify relevant fragments within informational videos to facilitate browsing, through a cognitively augmented form of navigation. It achieves this by providing semantic "snippets" that enable the user to rapidly scan through video content. The tool provides visually-appealing pop-ups that appear in a time series bar at the bottom of each video, allowing to see in advance and at a glance how topics evolve in the content. We conducted a user study to evaluate how the tool changes the users search experience in video retrieval, as well as how it supports exploration and information seeking. The user questionnaire revealed that participants found the Content Flow Bar helpful and enjoyable for finding relevant information in videos. The interaction logs of the user study, where participants interacted with the tool for completing two informational tasks, showed that it holds promise for enhancing discoverability of content both across and within videos. This discovered potential could leverage a new generation of navigation tools in search and information retrieval., Comment: Published at the ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR'22)
Published: 2022

23. Semantic TrueLearn: Using Semantic Knowledge Graphs in Recommendation Systems

Author: Bulathwela, Sahan, Pérez-Ortiz, María, Yilmaz, Emine, and Shawe-Taylor, John
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computers and Society, Statistics - Applications, Statistics - Machine Learning, H.3.3, J.1, I.2.0
Abstract: In informational recommenders, many challenges arise from the need to handle the semantic and hierarchical structure between knowledge areas. This work aims to advance towards building a state-aware educational recommendation system that incorporates semantic relatedness between knowledge topics, propagating latent information across semantically related topics. We introduce a novel learner model that exploits this semantic relatedness between knowledge components in learning resources using the Wikipedia link graph, with the aim to better predict learner engagement and latent knowledge in a lifelong learning scenario. In this sense, Semantic TrueLearn builds a humanly intuitive knowledge representation while leveraging Bayesian machine learning to improve the predictive performance of the educational engagement. Our experiments with a large dataset demonstrate that this new semantic version of TrueLearn algorithm achieves statistically significant improvements in terms of predictive performance with a simple extension that adds semantic awareness to the model., Comment: Presented at the First International Workshop on Joint Use of Probabilistic Graphical Models and Ontology at Conference on Knowledge Graph and Semantic Web 2021
Published: 2021

24. Could AI Democratise Education? Socio-Technical Imaginaries of an EdTech Revolution

Author: Bulathwela, Sahan, Pérez-Ortiz, María, Holloway, Catherine, and Shawe-Taylor, John
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence, Statistics - Machine Learning, K.3.1
Abstract: Artificial Intelligence (AI) in Education has been said to have the potential for building more personalised curricula, as well as democratising education worldwide and creating a Renaissance of new ways of teaching and learning. Millions of students are already starting to benefit from the use of these technologies, but millions more around the world are not. If this trend continues, the first delivery of AI in Education could be greater educational inequality, along with a global misallocation of educational resources motivated by the current technological determinism narrative. In this paper, we focus on speculating and posing questions around the future of AI in Education, with the aim of starting the pressing conversation that would set the right foundations for the new generation of education that is permeated by technology. This paper starts by synthesising how AI might change how we learn and teach, focusing specifically on the case of personalised learning companions, and then move to discuss some socio-technical features that will be crucial for avoiding the perils of these AI systems worldwide (and perhaps ensuring their success). This paper also discusses the potential of using AI together with free, participatory and democratic resources, such as Wikipedia, Open Educational Resources and open-source tools. We also emphasise the need for collectively designing human-centered, transparent, interactive and collaborative AI-based algorithms that empower and give complete agency to stakeholders, as well as support new emerging pedagogies. Finally, we ask what would it take for this educational revolution to provide egalitarian and empowering access to education, beyond any political, cultural, language, geographical and learning ability barriers., Comment: To be presented at Workshop on Machine Learning for the Developing World (ML4D) at the Conference on Neural Information Processing Systems 2021
Published: 2021

25. An AI-based Learning Companion Promoting Lifelong Learning Opportunities for All

Author: Perez-Ortiz, Maria, Novak, Erik, Bulathwela, Sahan, and Shawe-Taylor, John
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence
Abstract: Artifical Intelligence (AI) in Education has great potential for building more personalised curricula, as well as democratising education worldwide and creating a Renaissance of new ways of teaching and learning. We believe this is a crucial moment for setting the foundations of AI in education in the beginning of this Fourth Industrial Revolution. This report aims to synthesize how AI might change (and is already changing) how we learn, as well as what technological features are crucial for these AI systems in education, with the end goal of starting this pressing dialogue of how the future of AI in education should unfold, engaging policy makers, engineers, researchers and obviously, teachers and learners. This report also presents the advances within the X5GON project, a European H2020 project aimed at building and deploying a cross-modal, cross-lingual, cross-cultural, cross-domain and cross-site personalised learning platform for Open Educational Resources (OER)., Comment: Published as an Opinion Report from the International Research Centre on Artificial Intelligence under the auspices of UNESCO
Published: 2021

26. Progress in Self-Certified Neural Networks

Author: Perez-Ortiz, Maria, Rivasplata, Omar, Parrado-Hernandez, Emilio, Guedj, Benjamin, and Shawe-Taylor, John
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: A learning method is self-certified if it uses all available data to simultaneously learn a predictor and certify its quality with a tight statistical certificate that is valid on unseen data. Recent work has shown that neural network models trained by optimising PAC-Bayes bounds lead not only to accurate predictors, but also to tight risk certificates, bearing promise towards achieving self-certified learning. In this context, learning and certification strategies based on PAC-Bayes bounds are especially attractive due to their ability to leverage all data to learn a posterior and simultaneously certify its risk with a tight numerical certificate. In this paper, we assess the progress towards self-certification in probabilistic neural networks learnt by PAC-Bayes inspired objectives. We empirically compare (on 4 classification datasets) classical test set bounds for deterministic predictors and a PAC-Bayes bound for randomised self-certified predictors. We first show that both of these generalisation bounds are not too far from out-of-sample test set errors. We then show that in data starvation regimes, holding out data for the test set bounds adversely affects generalisation performance, while self-certified strategies based on PAC-Bayes bounds do not suffer from this drawback, proving that they might be a suitable choice for the small data regime. We also find that probabilistic neural networks learnt by PAC-Bayes inspired objectives lead to certificates that can be surprisingly competitive with commonly used test set bounds.
Published: 2021

27. Learning PAC-Bayes Priors for Probabilistic Neural Networks

Author: Perez-Ortiz, Maria, Rivasplata, Omar, Guedj, Benjamin, Gleeson, Matthew, Zhang, Jingyu, Shawe-Taylor, John, Bober, Miroslaw, and Kittler, Josef
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent works have investigated deep learning models trained by optimising PAC-Bayes bounds, with priors that are learnt on subsets of the data. This combination has been shown to lead not only to accurate classifiers, but also to remarkably tight risk certificates, bearing promise towards self-certified learning (i.e. use all the data to learn a predictor and certify its quality). In this work, we empirically investigate the role of the prior. We experiment on 6 datasets with different strategies and amounts of data to learn data-dependent PAC-Bayes priors, and we compare them in terms of their effect on test performance of the learnt predictors and tightness of their risk certificate. We ask what is the optimal amount of data which should be allocated for building the prior and show that the optimum may be dataset dependent. We demonstrate that using a small percentage of the prior-building data for validation of the prior leads to promising results. We include a comparison of underparameterised and overparameterised models, along with an empirical study of different training objectives and regularisation strategies to learn the prior distribution.
Published: 2021

28. Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach

Author: Sánchez-Cartagena, Víctor M., Esplà-Gomis, Miquel, Pérez-Ortiz, Juan Antonio, and Sánchez-Martínez, Felipe
Subjects: Computer Science - Computation and Language
Abstract: In the context of neural machine translation, data augmentation (DA) techniques may be used for generating additional training samples when the available parallel data are scarce. Many DA approaches aim at expanding the support of the empirical data distribution by generating new sentence pairs that contain infrequent words, thus making it closer to the true data distribution of parallel sentences. In this paper, we propose to follow a completely different approach and present a multi-task DA approach in which we generate new sentence pairs with transformations, such as reversing the order of the target sentence, which produce unfluent target sentences. During training, these augmented sentences are used as auxiliary tasks in a multi-task framework with the aim of providing new contexts where the target prefix is not informative enough to predict the next word. This strengthens the encoder and forces the decoder to pay more attention to the source representations of the encoder. Experiments carried out on six low-resource translation tasks show consistent improvements over the baseline and over DA methods aiming at extending the support of the empirical data distribution. The systems trained with our approach rely more on the source tokens, are more robust against domain shift and suffer less hallucinations., Comment: To be published as long paper in EMNLP 2021
Published: 2021

29. PEEK: A Large Dataset of Learner Engagement with Educational Videos

Author: Bulathwela, Sahan, Perez-Ortiz, Maria, Novak, Erik, Yilmaz, Emine, and Shawe-Taylor, John
Subjects: Computer Science - Information Retrieval, Computer Science - Computers and Society, Computer Science - Machine Learning, H.3.3, J.1, I.2.0, H.3.3, J.1, I.2.0
Abstract: Educational recommenders have received much less attention in comparison to e-commerce and entertainment-related recommenders, even though efficient intelligent tutors have great potential to improve learning gains. One of the main challenges in advancing this research direction is the scarcity of large, publicly available datasets. In this work, we release a large, novel dataset of learners engaging with educational videos in-the-wild. The dataset, named Personalised Educational Engagement with Knowledge Topics PEEK, is the first publicly available dataset of this nature. The video lectures have been associated with Wikipedia concepts related to the material of the lecture, thus providing a humanly intuitive taxonomy. We believe that granular learner engagement signals in unison with rich content representations will pave the way to building powerful personalization algorithms that will revolutionise educational and informational recommendation systems. Towards this goal, we 1) construct a novel dataset from a popular video lecture repository, 2) identify a set of benchmark algorithms to model engagement, and 3) run extensive experimentation on the PEEK dataset to demonstrate its value. Our experiments with the dataset show promise in building powerful informational recommender systems. The dataset and the support code is available publicly., Comment: To be published at ORSUM '21: 4th Workshop on Online Recommender Systems and User Modeling at ACM RecSys 2021
Published: 2021

30. Transfer and zero-shot learning for scalable weed detection and classification in UAV images

Author: Belissent, Nicolas, Peña, José M., Mesías-Ruiz, Gustavo A., Shawe-Taylor, John, and Pérez-Ortiz, María
Published: 2024
Full Text: View/download PDF

31. Seroprevalence for measles among healthcare workers in Madrid, Spain

Author: Cuerda, A.de la, González, M.I.Tejeda, López Aparicio, A., Monfort Vinuesa, C., López Pedraza, M.J., Isidoro, B., Mayordomo-Cava, J., Barberán, L.C., Collazos, J., Pérez-Ortiz, J.M., and Barberán, J.
Published: 2024
Full Text: View/download PDF

32. Consolidated Dataset and Metrics for High-Dynamic-Range Image Quality

Author: Mikhailiuk, Aliaksei, Perez-Ortiz, Maria, Yue, Dingcheng, Suen, Wilson, and Mantiuk, Rafal K.
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Increasing popularity of high-dynamic-range (HDR) image and video content brings the need for metrics that could predict the severity of image impairments as seen on displays of different brightness levels and dynamic range. Such metrics should be trained and validated on a sufficiently large subjective image quality dataset to ensure robust performance. As the existing HDR quality datasets are limited in size, we created a Unified Photometric Image Quality dataset (UPIQ) with over 4,000 images by realigning and merging existing HDR and standard-dynamic-range (SDR) datasets. The realigned quality scores share the same unified quality scale across all datasets. Such realignment was achieved by collecting additional cross-dataset quality comparisons and re-scaling data with a psychometric scaling method. Images in the proposed dataset are represented in absolute photometric and colorimetric units, corresponding to light emitted from a display. We use the new dataset to retrain existing HDR metrics and show that the dataset is sufficiently large for training deep architectures. We show the utility of the dataset on brightness aware image compression.
Published: 2020
Full Text: View/download PDF

33. A PAC-Bayesian Perspective on Structured Prediction with Implicit Loss Embeddings

Author: Cantelobre, Théophile, Guedj, Benjamin, Pérez-Ortiz, María, and Shawe-Taylor, John
Subjects: Computer Science - Machine Learning, Mathematics - Statistics Theory, Statistics - Machine Learning
Abstract: Many practical machine learning tasks can be framed as Structured prediction problems, where several output variables are predicted and considered interdependent. Recent theoretical advances in structured prediction have focused on obtaining fast rates convergence guarantees, especially in the Implicit Loss Embedding (ILE) framework. PAC-Bayes has gained interest recently for its capacity of producing tight risk bounds for predictor distributions. This work proposes a novel PAC-Bayes perspective on the ILE Structured prediction framework. We present two generalization bounds, on the risk and excess risk, which yield insights into the behavior of ILE predictors. Two learning algorithms are derived from these bounds. The algorithms are implemented and their behavior analyzed, with source code available at \url{https://github.com/theophilec/PAC-Bayes-ILE-Structured-Prediction}., Comment: 38 pages
Published: 2020

34. Fetoscopic endoluminal tracheal occlusion vs expectant management for fetuses with severe left-sided congenital diaphragmatic hernia

Author: Dütemeyer, Vivien, Schaible, Thomas, Badr, Dominique A., Cordier, Anne-Gael, Weis, Meike, Perez-Ortiz, Alba, Carriere, Diane, Cannie, Mieke M., Vuckovic, Aline, Persico, Nicola, Cavallaro, Giacomo, Benachi, Alexandra, and Jani, Jacques C.
Published: 2024
Full Text: View/download PDF

35. Neurological manifestations in PMM2-congenital disorders of glycosylation (PMM2-CDG): Insights into clinico-radiological characteristics, recommendations for follow-up, and future directions

Author: Muthusamy, Karthik, Perez-Ortiz, Judit M., Ligezka, Anna N., Altassan, Ruqaiah, Johnsen, Christin, Schultz, Matthew J., Patterson, Marc C., and Morava, Eva
Published: 2024
Full Text: View/download PDF

36. The Anytime-Valid Logrank Test: Error Control Under Continuous Monitoring with Unlimited Horizon

Author: ter Schure, J., Perez-Ortiz, M. F., Ly, A., and Grunwald, P.
Subjects: Statistics - Methodology, Mathematics - Statistics Theory
Abstract: We introduce the anytime-valid (AV) logrank test, a version of the logrank test that provides type-I error guarantees under optional stopping and optional continuation. The test is sequential without the need to specify a maximum sample size or stopping rule, and allows for cumulative meta-analysis with type-I error control. The method can be extended to define anytime-valid confidence intervals. The logrank test is an instance of the martingale tests based on E-variables that have been recently developed. We demonstrate type-I error guarantees for the test in a semiparametric setting of proportional hazards and show how to extend it to ties, Cox' regression and confidence sequences. Using a Gaussian approximation on the logrank statistic, we show that the AV logrank test (which itself is always exact) has a similar rejection region to O'Brien-Fleming alpha-spending but with the potential to achieve 100% power by optional continuation. Although our approach to study design requires a larger sample size, the *expected* sample size is competitive by optional stopping.
Published: 2020

37. VLEngagement: A Dataset of Scientific Video Lectures for Evaluating Population-based Engagement

Author: Bulathwela, Sahan, Perez-Ortiz, Maria, Yilmaz, Emine, and Shawe-Taylor, John
Subjects: Computer Science - Computers and Society, Computer Science - Information Retrieval, Computer Science - Machine Learning, Statistics - Machine Learning, K.3.1, H.3.1
Abstract: With the emergence of e-learning and personalised education, the production and distribution of digital educational resources have boomed. Video lectures have now become one of the primary modalities to impart knowledge to masses in the current digital age. The rapid creation of video lecture content challenges the currently established human-centred moderation and quality assurance pipeline, demanding for more efficient, scalable and automatic solutions for managing learning resources. Although a few datasets related to engagement with educational videos exist, there is still an important need for data and research aimed at understanding learner engagement with scientific video lectures. This paper introduces VLEngagement, a novel dataset that consists of content-based and video-specific features extracted from publicly available scientific video lectures and several metrics related to user engagement. We introduce several novel tasks related to predicting and understanding context-agnostic engagement in video lectures, providing preliminary baselines. This is the largest and most diverse publicly available dataset to our knowledge that deals with such tasks. The extraction of Wikipedia topic-based features also allows associating more sophisticated Wikipedia based features to the dataset to improve the performance in these tasks. The dataset, helper tools and example code snippets are available publicly at https://github.com/sahanbull/context-agnostic-engagement
Published: 2020

38. Tighter risk certificates for neural networks

Author: Pérez-Ortiz, María, Rivasplata, Omar, Shawe-Taylor, John, and Szepesvári, Csaba
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: This paper presents an empirical study regarding training probabilistic neural networks using training objectives derived from PAC-Bayes bounds. In the context of probabilistic neural networks, the output of training is a probability distribution over network weights. We present two training objectives, used here for the first time in connection with training neural networks. These two training objectives are derived from tight PAC-Bayes bounds. We also re-implement a previously used training objective based on a classical PAC-Bayes bound, to compare the properties of the predictors learned using the different training objectives. We compute risk certificates for the learnt predictors, based on part of the data used to learn the predictors. We further experiment with different types of priors on the weights (both data-free and data-dependent priors) and neural network architectures. Our experiments on MNIST and CIFAR-10 show that our training methods produce competitive test set errors and non-vacuous risk bounds with much tighter values than previous results in the literature, showing promise not only to guide the learning algorithm through bounding the risk but also for model selection. These observations suggest that the methods studied here might be good candidates for self-certified learning, in the sense of using the whole data set for learning a predictor and certifying its risk on any unseen data (from the same distribution as the training data) potentially without the need for holding out test data., Comment: New version includes: i) experiment showing the potential of the risk certificate for neural architecture search (Fig. 2); ii) experiments spanning uncertainty quantification and analysis of prior/posterior (Section 7.8); iii) an outline of the strengths of probabilistic neural networks trained by PBB (Section 7.9) and iv) a strengthened discussion on the connection to Bayesian learning
Published: 2020

39. Predicting Engagement in Video Lectures

Author: Bulathwela, Sahan, Pérez-Ortiz, María, Lipani, Aldo, Yilmaz, Emine, and Shawe-Taylor, John
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction, K.3.1, H.3.1
Abstract: The explosion of Open Educational Resources (OERs) in the recent years creates the demand for scalable, automatic approaches to process and evaluate OERs, with the end goal of identifying and recommending the most suitable educational materials for learners. We focus on building models to find the characteristics and features involved in context-agnostic engagement (i.e. population-based), a seldom researched topic compared to other contextualised and personalised approaches that focus more on individual learner engagement. Learner engagement, is arguably a more reliable measure than popularity/number of views, is more abundant than user ratings and has also been shown to be a crucial component in achieving learning outcomes. In this work, we explore the idea of building a predictive model for population-based engagement in education. We introduce a novel, large dataset of video lectures for predicting context-agnostic engagement and propose both cross-modal and modality-specific feature sets to achieve this task. We further test different strategies for quantifying learner engagement signals. We demonstrate the use of our approach in the case of data scarcity. Additionally, we perform a sensitivity analysis of the best performing model, which shows promising performance and can be easily integrated into an educational recommender system for OERs., Comment: In Proceedings of International Conference on Educational Data Mining 2020
Published: 2020

40. Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

Author: Mikhailiuk, Aliaksei, Wilmot, Clifford, Perez-Ortiz, Maria, Yue, Dingcheng, and Mantiuk, Rafal
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Pairwise comparison data arise in many domains with subjective assessment experiments, for example in image and video quality assessment. In these experiments observers are asked to express a preference between two conditions. However, many pairwise comparison protocols require a large number of comparisons to infer accurate scores, which may be unfeasible when each comparison is time-consuming (e.g. videos) or expensive (e.g. medical imaging). This motivates the use of an active sampling algorithm that chooses only the most informative pairs for comparison. In this paper we propose ASAP, an active sampling algorithm based on approximate message passing and expected information gain maximization. Unlike most existing methods, which rely on partial updates of the posterior distribution, we are able to perform full updates and therefore much improve the accuracy of the inferred scores. The algorithm relies on three techniques for reducing computational cost: inference based on approximate message passing, selective evaluations of the information gain, and selecting pairs in a batch that forms a minimum spanning tree of the inverse of information gain. We demonstrate, with real and synthetic data, that ASAP offers the highest accuracy of inferred scores compared to the existing methods. We also provide an open-source GPU implementation of ASAP for large-scale experiments.
Published: 2020

41. Learning synchronous context-free grammars with multiple specialised non-terminals for hierarchical phrase-based translation

Author: Sánchez-Martínez, Felipe, Pérez-Ortiz, Juan Antonio, and Carrasco, Rafael C.
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Translation models based on hierarchical phrase-based statistical machine translation (HSMT) have shown better performances than the non-hierarchical phrase-based counterparts for some language pairs. The standard approach to HSMT learns and apply a synchronous context-free grammar with a single non-terminal. The hypothesis behind the grammar refinement algorithm presented in this work is that this single non-terminal is overloaded, and insufficiently discriminative, and therefore, an adequate split of it into more specialised symbols could lead to improved models. This paper presents a method to learn synchronous context-free grammars with a huge number of initial non-terminals, which are then grouped via a clustering algorithm. Our experiments show that the resulting smaller set of non-terminals correctly capture the contextual information that makes it possible to statistically significantly improve the BLEU score of the standard HSMT approach.
Published: 2020

42. Towards an Integrative Educational Recommender for Lifelong Learners

Author: Bulathwela, Sahan, Perez-Ortiz, Maria, Yilmaz, Emine, and Shawe-Taylor, John
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Statistics - Machine Learning, H.3.3, J.1, I.2.0, H.3.3, J.1, I.2.0
Abstract: One of the most ambitious use cases of computer-assisted learning is to build a recommendation system for lifelong learning. Most recommender algorithms exploit similarities between content and users, overseeing the necessity to leverage sensible learning trajectories for the learner. Lifelong learning thus presents unique challenges, requiring scalable and transparent models that can account for learner knowledge and content novelty simultaneously, while also retaining accurate learners representations for long periods of time. We attempt to build a novel educational recommender, that relies on an integrative approach combining multiple drivers of learners engagement. Our first step towards this goal is TrueLearn, which models content novelty and background knowledge of learners and achieves promising performance while retaining a human interpretable learner model., Comment: In Proceedings of AAAI Conference on Artificial Intelligence 2020
Published: 2019

43. Investigating and exploiting the late stage functionalisation of nonribosomal peptide antibiotics

Author: Perez Ortiz, Gustavo, Barry, Sarah Marie, and Isaacson, Rivka Leah
Subjects: 615.7
Published: 2020

44. TrueLearn: A Family of Bayesian Algorithms to Match Lifelong Learners to Open Educational Resources

Author: Bulathwela, Sahan, Perez-Ortiz, Maria, Yilmaz, Emine, and Shawe-Taylor, John
Subjects: Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning, Statistics - Applications, Statistics - Machine Learning, H.3.3, J.1, I.2.0
Abstract: The recent advances in computer-assisted learning systems and the availability of open educational resources today promise a pathway to providing cost-efficient, high-quality education to large masses of learners. One of the most ambitious use cases of computer-assisted learning is to build a lifelong learning recommendation system. Unlike short-term courses, lifelong learning presents unique challenges, requiring sophisticated recommendation models that account for a wide range of factors such as background knowledge of learners or novelty of the material while effectively maintaining knowledge states of masses of learners for significantly longer periods of time (ideally, a lifetime). This work presents the foundations towards building a dynamic, scalable and transparent recommendation system for education, modelling learner's knowledge from implicit data in the form of engagement with open educational resources. We i) use a text ontology based on Wikipedia to automatically extract knowledge components of educational resources and, ii) propose a set of online Bayesian strategies inspired by the well-known areas of item response theory and knowledge tracing. Our proposal, TrueLearn, focuses on recommendations for which the learner has enough background knowledge (so they are able to understand and learn from the material), and the material has enough novelty that would help the learner improve their knowledge about the subject and keep them engaged. We further construct a large open educational video lectures dataset and test the performance of the proposed algorithms, which show clear promise towards building an effective educational recommendation system., Comment: In Proceedings of AAAI Conference on Artificial Intelligence 2020
Published: 2019

45. A mixture of experts model for predicting persistent weather patterns

Author: Perez-Ortiz, Maria, Gutierrez, Pedro A., Tino, Peter, Casanova-Mateo, Carlos, and Salcedo-Sanz, Sancho
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Weather and atmospheric patterns are often persistent. The simplest weather forecasting method is the so-called persistence model, which assumes that the future state of a system will be similar (or equal) to the present state. Machine learning (ML) models are widely used in different weather forecasting applications, but they need to be compared to the persistence model to analyse whether they provide a competitive solution to the problem at hand. In this paper, we devise a new model for predicting low-visibility in airports using the concepts of mixture of experts. Visibility level is coded as two different ordered categorical variables: cloud height and runway visual height. The underlying system in this application is stagnant approximately in 90% of the cases, and standard ML models fail to improve on the performance of the persistence model. Because of this, instead of trying to simply beat the persistence model using ML, we use this persistence as a baseline and learn an ordinal neural network model that refines its results by focusing on learning weather fluctuations. The results show that the proposal outperforms persistence and other ordinal autoregressive models, especially for longer time horizon predictions and for the runway visual height variable., Comment: Published in IEEE International Joint Conference on Neural Networks (IJCNN) 2018
Published: 2019

46. Exploiting Synthetically Generated Data with Semi-Supervised Learning for Small and Imbalanced Datasets

Author: Perez-Ortiz, Maria, Tino, Peter, Mantiuk, Rafal, and Hervas-Martinez, Cesar
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Data augmentation is rapidly gaining attention in machine learning. Synthetic data can be generated by simple transformations or through the data distribution. In the latter case, the main challenge is to estimate the label associated to new synthetic patterns. This paper studies the effect of generating synthetic data by convex combination of patterns and the use of these as unsupervised information in a semi-supervised learning framework with support vector machines, avoiding thus the need to label synthetic examples. We perform experiments on a total of 53 binary classification datasets. Our results show that this type of data over-sampling supports the well-known cluster assumption in semi-supervised learning, showing outstanding results for small high-dimensional datasets and imbalanced learning problems., Comment: Published in the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Published: 2019

47. Evaluation of Treatment Option for Demodicosis in Patients Undergoing Cataract Surgery

Author: Claudia Palacio Pastrana, M.D., M.Sc., Samuel J. Avalos Lara, M.D., Hector Perez Cano, Ph.D., Bani Antonio-Aguirre, M.D., M.P.H.c., Cristina Mendoza-Velasquez, M.D., M.Sc., Azyadeh Camacho-Ordóñez, M.D., M.Sc.c., Andric C. Perez-Ortiz, M.D., M.P.H., and Andric C Perez-Ortiz, M.D., M.P.H., Clinical Professor
Published: 2020

48. Aeroallergen immunotherapy associated with reduced risk of severe COVID-19 in 1095 allergic patients

Author: Larenas-Linnemann, Désirée, Morfin-Maciel, Blanca María, Bedolla-Barajas, Martín, López-Bago, Ana, Navarrete Rodríguez, Elsy Maureen, Mogica-Martínez, María Dolores, Gereda, José E., Sarrazola Sanjuan, Mauricio, Cano Pedroza, Rosa Yazmín, Cavallo, María Cecilia, Romero Tapia, Sergio de Jesús, Jossen, Roberto A., Fuentes Pérez, José Miguel, Del Rio Navarro, Blanca E., Rodríguez Zagal, Erendira, Piraino Sosa, Pedro A., Huerta Villalobos, Yunuen Rocío, Chavez-Vereau, Pierre, García Imperial, Daniel Alberto, Olivares Gómez, Margarita, Valle Rodríguez, Francisco, Zuñiga Reyes, Carlos Omar, Rodríguez-González, Mónica, Gallego Corella, Claudia Ivonne, Ivancevich, Juan Carlos, García Cruz, María de la Luz Hortencia, Repka-Ramirez, María Susana, Flores Morales, Mauricio Ernesto, Fernández De Córdova Aguirre, Juan Carlos, Luna-Pech, Jorge A., Rivero Yeverino, Daniela, Martínez Guzmán, Edgar, Pérez Ortiz, Cinthia Elizabeth, and Villa Médica, Leonor
Published: 2023
Full Text: View/download PDF

49. Preclinical safety study of a gemcitabine-controlled release polymer for locoregional administration in pancreatic cancer

Author: Rodríguez-Martínez, Marta, García, María Teresa, Pérez-Ortiz, José Manuel, González, Lucia, Gómez-Alonso, Sergio, Lara, Irene Álvarez, Gracia, Ignacio, Rodríguez, Juan Francisco, Redondo-Calvo, Javier, García-Santos, Esther, and Padilla-Valverde, David
Published: 2023
Full Text: View/download PDF

50. Pharmacogenetics of taxane‐induced neurotoxicity in breast cancer: Systematic review and meta‐analysis

Author: Alberto Guijosa, Ana Freyria, Jose Rodrigo Espinosa‐Fernandez, Francisco J. Estrada‐Mena, Ana Sofía Armenta‐Quiroga, Maria Fernanda Ortega‐Treviño, Rodrigo Catalán, Bani Antonio‐Aguirre, Cynthia Villarreal‐Garza, and Andric C. Perez‐Ortiz
Subjects: Therapeutics. Pharmacology, RM1-950, Public aspects of medicine, RA1-1270
Abstract: Abstract Taxane‐based chemotherapy regimens are used as first‐line treatment for breast cancer. Neurotoxicity, mainly taxane‐induced peripheral neuropathy (TIPN), remains the most important dose‐limiting adverse event. Multiple genes may be associated with TIPN; however, the strength and direction of the association remain unclear. For this reason, we systematically reviewed observational studies of TIPN pharmacogenetic markers in breast cancer treatment. We conducted a systematic search of terms alluding to breast cancer, genetic markers, taxanes, and neurotoxicity in Ovid, ProQuest, PubMed, Scopus, Virtual Health, and Web of Science. We assessed the quality of evidence and bias profile. We extracted relevant variables and effect measures. Whenever possible, we performed random‐effects gene meta‐analyses and examined interstudy heterogeneity with meta‐regression models and subgroup analyses. This study follows the Preferred Reporting Items for Systematic Reviews and Meta‐Analyses (PRISMA) and STrengthening the REporting of Genetic Association Studies (STREGA) reporting guidance. A total of 42 studies with 19,431 participants were included. These evaluated 262 single‐nucleotide polymorphisms (SNPs) across 121 genes. We conducted meta‐analyses on 23 genes with 60 SNPs (19 studies and 6246 participants). Thirteen individual SNPs (ABCB1‐rs2032582, ABCB1‐rs3213619, BCL6/‐rs1903216, /CAND1‐rs17781082, CYP1B1‐rs1056836, CYP2C8‐rs10509681, CYP2C8‐rs11572080, EPHA5‐rs7349683, EPHA6‐rs301927, FZD3‐rs7001034, GSTP1‐rs1138272, TUBB2A‐rs9501929, and XKR4‐rs4737264) and the overall SNPs' effect in four genes (CYP3A4, EphA5, GSTP1, and SLCO1B1) were statistically significantly associated with TIPN through meta‐analysis. In conclusion, through systematic review and meta‐analysis, we found that polymorphisms, and particularly 13 SNPs, are associated with TIPN, suggesting that genetics does play a role in interindividual predisposition. Further studies could potentially use these findings to develop individual risk profiles and guide decision making.
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

612 results on '"Perez-Ortiz A"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources