Author: "Tlaie, Alejandro" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Tlaie, Alejandro"' showing total 11 results

Start Over Author "Tlaie, Alejandro"

11 results on '"Tlaie, Alejandro"'

1. Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks

Author: Tlaie, Alejandro
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence
Abstract: This paper leverages insights from Alignment Theory (AT) research, which primarily focuses on the potential pitfalls of technical alignment in Artificial Intelligence, to critically examine the European Union's Artificial Intelligence Act (EU AI Act). In the context of AT research, several key failure modes - such as proxy gaming, goal drift, reward hacking or specification gaming - have been identified. These can arise when AI systems are not properly aligned with their intended objectives. The central logic of this report is: what can we learn if we treat regulatory efforts in the same way as we treat advanced AI systems? As we systematically apply these concepts to the EU AI Act, we uncover potential vulnerabilities and areas for improvement in the regulation.
Published: 2024

2. Exploring and steering the moral compass of Large Language Models

Author: Tlaie, Alejandro
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Large Language Models (LLMs) have become central to advancing automation and decision-making across various sectors, raising significant ethical questions. This study proposes a comprehensive comparative analysis of the most advanced LLMs to assess their moral profiles. We subjected several state-of-the-art models to a selection of ethical dilemmas and found that all the proprietary ones are mostly utilitarian and all of the open-weights ones align mostly with values-based ethics. Furthermore, when using the Moral Foundations Questionnaire, all models we probed - except for Llama 2-7B - displayed a strong liberal bias. Lastly, in order to causally intervene in one of the studied models, we propose a novel similarity-specific activation steering technique. Using this method, we were able to reliably steer the model's moral compass to different ethical schools. All of these results showcase that there is an ethical dimension in already deployed LLMs, an aspect that is generally overlooked.
Published: 2024

3. Thoughtful faces: inferring internal states across species using facial features

Author: Tlaie, Alejandro, primary, Hay, Muad Abd El, additional, Mert, Berkutay, additional, Taylor, Robert, additional, Ferracci, Pierre-Antoine, additional, Shapcott, Katharine, additional, Glukhova, Mina, additional, Pillow, Jonathan, additional, Havenith, Martha, additional, and Schölvinck, Marieke, additional
Published: 2024
Full Text: View/download PDF

4. Diverging roles of TRPV1 and TRPM2 in warm-temperature detection

Author: El Hay, Muad Y. Abd, primary, Kamm, Gretel B., additional, Tlaie, Alejandro, additional, and Siemens, Jan, additional
Published: 2024
Full Text: View/download PDF

5. What does the mean mean? A simple test for neuroscience.

Author: Tlaie, Alejandro, Shapcott, Katharine, van der Plas, Thijs L., Rowland, James, Lees, Robert, Keeling, Joshua, Packer, Adam, Tiesinga, Paul, Schölvinck, Marieke L., and Havenith, Martha N.
Subjects: *SOMATOSENSORY cortex, *NEURAL codes, *NEUROSCIENCES, *RESEARCH personnel, *NEUROSCIENTISTS
Abstract: Trial-averaged metrics, e.g. tuning curves or population response vectors, are a ubiquitous way of characterizing neuronal activity. But how relevant are such trial-averaged responses to neuronal computation itself? Here we present a simple test to estimate whether average responses reflect aspects of neuronal activity that contribute to neuronal processing. The test probes two assumptions implicitly made whenever average metrics are treated as meaningful representations of neuronal activity: Reliability: Neuronal responses repeat consistently enough across trials that they convey a recognizable reflection of the average response to downstream regions. Behavioural relevance: If a single-trial response is more similar to the average template, it is more likely to evoke correct behavioural responses. We apply this test to two data sets: (1) Two-photon recordings in primary somatosensory cortices (S1 and S2) of mice trained to detect optogenetic stimulation in S1; and (2) Electrophysiological recordings from 71 brain areas in mice performing a contrast discrimination task. Under the highly controlled settings of Data set 1, both assumptions were largely fulfilled. In contrast, the less restrictive paradigm of Data set 2 met neither assumption. Simulations predict that the larger diversity of neuronal response preferences, rather than higher cross-trial reliability, drives the better performance of Data set 1. We conclude that when behaviour is less tightly restricted, average responses do not seem particularly relevant to neuronal computation, potentially because information is encoded more dynamically. Most importantly, we encourage researchers to apply this simple test of computational relevance whenever using trial-averaged neuronal metrics, in order to gauge how representative cross-trial averages are in a given context. Author summary: Neuronal activity is highly dynamic—our brain never responds to the same situation in exactly the same way. How do we extract information from such dynamic signals? The classical answer is: averaging neuronal activity across repetitions of the same stimulus to detect its consistent aspects. This logic is widespread—it is hard to find a neuroscience study that does not contain averages. But how well do averages represent the computations that happen in the brain moment by moment? We developed a simple test that probes two assumptions implicit in averaging: Reliability: Neuronal responses repeat consistently enough across stimulus repetitions that the average remains recognizable. Behavioural relevance: Neuronal responses that are more similar to the average, are more likely to evoke correct behaviour. We apply this test to two example data sets featuring population recordings in mice performing perceptual tasks. We show that both assumptions were largely fulfilled in the first data set, but not in the second; suggesting that the relevance of averaging varies across contexts, e.g. due to experimental control levels and neuronal diversity. Most importantly, we encourage neuroscientists to use our test to gauge whether averages reflect informative aspects of neuronal activity in their data. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. An information-theoretic quantification of the content of communication between brain regions

Author: Celotto, Marco, primary, Bím, Jan, additional, Tlaie, Alejandro, additional, De Feo, Vito, additional, Lemke, Stefan, additional, Chicharro, Daniel, additional, Nili, Hamed, additional, Bieler, Malte, additional, Hanganu-Opatz, Ileana L., additional, Donner, Tobias H., additional, Brovelli, Andrea, additional, and Panzeri, Stefano, additional
Published: 2023
Full Text: View/download PDF

7. An information-theoretic quantification of the content of communication between brain regions

Author: Celotto, Marco, Bím, Jan, Tlaie, Alejandro, De Feo, Vito, Lemke, Stefan, Chicharro, Daniel, Nili, Hamed, Bieler, Malte, Hanganu-Opatz, Ileana L., Donner, Tobias H., Brovelli, Andrea, and Panzeri, Stefano
Subjects: Article
Abstract: Quantifying the amount, content and direction of communication between brain regions is key to understanding brain function. Traditional methods to analyze brain activity based on the Wiener-Granger causality principle quantify the overall information propagated by neural activity between simultaneously recorded brain regions, but do not reveal the information flow about specific features of interest (such as sensory stimuli). Here, we develop a new information theoretic measure termed Feature-specific Information Transfer (FIT), quantifying how much information about a specific feature flows between two regions. FIT merges the Wiener-Granger causality principle with information-content specificity. We first derive FIT and prove analytically its key properties. We then illustrate and test them with simulations of neural activity, demonstrating that FIT identifies, within the total information flowing between regions, the information that is transmitted about specific features. We then analyze three neural datasets obtained with different recording methods, magneto- and electro-encephalography, and spiking activity, to demonstrate the ability of FIT to uncover the content and direction of information flow between brain regions beyond what can be discerned with traditional anaytical methods. FIT can improve our understanding of how brain regions communicate by uncovering previously hidden feature-specific information flow.
Published: 2023

8. Does the brain average? A simple test

Author: Tlaie, Alejandro, Shapcott, Katharine, Van Der Plas, Thijs, Packer, Adam, Tiesinga, Paul, Schölvinck, Marieke, and Havenith, Martha N.
Subjects: Computational Neuroscience, Sensory processing and perception
Abstract: Bernstein Conference 2022 abstract. http://bernstein-conference.de
Published: 2022
Full Text: View/download PDF

9. An Information-theoretic Quantification of the Content of Information Flow Across Neurons and Brain Areas

Author: Celotto, Marco, Tlaie, Alejandro, Bìm, Jan, De Feo, Vito, Chicharro, Daniel, Bieler, Malte, Hanganu-Opatz, Ileana, Donner, Tobias H., Brovelli, Andrea, and Panzeri, Stefano
Subjects: Computational Neuroscience, Data analysis, machine learning, neuroinformatics, Computer Science::Programming Languages, Mathematics::Representation Theory, Quantitative Biology::Genomics
Abstract: Bernstein Conference 2021 abstract. http://bernstein-conference.de
Published: 2021
Full Text: View/download PDF

10. Does the brain care about averages? A simple test.

Author: Tlaie, Alejandro, primary, Shapcott, Katharine A, additional, van der Plas, Thijs, additional, Rowland, James M, additional, Lees, Robert, additional, Keeling, Joshua, additional, Packer, Adam, additional, Tiesinga, Paul, additional, Schölvinck, Marieke, additional, and Havenith, Martha N, additional
Published: 2021
Full Text: View/download PDF

11. An information-theoretic quantification of the content of communication between brain regions.

Author: Celotto M, Bím J, Tlaie A, De Feo V, Lemke S, Chicharro D, Nili H, Bieler M, Hanganu-Opatz IL, Donner TH, Brovelli A, and Panzeri S
Abstract: Quantifying the amount, content and direction of communication between brain regions is key to understanding brain function. Traditional methods to analyze brain activity based on the Wiener-Granger causality principle quantify the overall information propagated by neural activity between simultaneously recorded brain regions, but do not reveal the information flow about specific features of interest (such as sensory stimuli). Here, we develop a new information theoretic measure termed Feature-specific Information Transfer (FIT), quantifying how much information about a specific feature flows between two regions. FIT merges the Wiener-Granger causality principle with information-content specificity. We first derive FIT and prove analytically its key properties. We then illustrate and test them with simulations of neural activity, demonstrating that FIT identifies, within the total information flowing between regions, the information that is transmitted about specific features. We then analyze three neural datasets obtained with different recording methods, magneto- and electro-encephalography, and spiking activity, to demonstrate the ability of FIT to uncover the content and direction of information flow between brain regions beyond what can be discerned with traditional anaytical methods. FIT can improve our understanding of how brain regions communicate by uncovering previously hidden feature-specific information flow.
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

11 results on '"Tlaie, Alejandro"'

1. Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks

2. Exploring and steering the moral compass of Large Language Models

3. Thoughtful faces: inferring internal states across species using facial features

4. Diverging roles of TRPV1 and TRPM2 in warm-temperature detection

5. What does the mean mean? A simple test for neuroscience.

6. An information-theoretic quantification of the content of communication between brain regions

7. An information-theoretic quantification of the content of communication between brain regions

8. Does the brain average? A simple test

9. An Information-theoretic Quantification of the Content of Information Flow Across Neurons and Brain Areas

10. Does the brain care about averages? A simple test.

11. An information-theoretic quantification of the content of communication between brain regions.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

11 results on '"Tlaie, Alejandro"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources