Author: "Shlizerman, Eli" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Shlizerman, Eli"' showing total 204 results

Start Over Author "Shlizerman, Eli"

204 results on '"Shlizerman, Eli"'

1. Tell What You Hear From What You See -- Video to Audio Generation Through Text

Author: Liu, Xiulong, Su, Kun, and Shlizerman, Eli
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: The content of visual and audio scenes is multi-faceted such that a video can be paired with various audio and vice-versa. Thereby, in video-to-audio generation task, it is imperative to introduce steering approaches for controlling the generated audio. While Video-to-Audio generation is a well-established generative task, existing methods lack such controllability. In this work, we propose VATT, a multi-modal generative framework that takes a video and an optional text prompt as input, and generates audio and optional textual description of the audio. Such a framework has two advantages: i) Video-to-Audio generation process can be refined and controlled via text which complements the context of visual information, and ii) The model can suggest what audio to generate for the video by generating audio captions. VATT consists of two key modules: VATT Converter, a LLM that is fine-tuned for instructions and includes a projection layer that maps video features to the LLM vector space; and VATT Audio, a transformer that generates audio tokens from visual frames and from optional text prompt using iterative parallel decoding. The audio tokens are converted to a waveform by pretrained neural codec. Experiments show that when VATT is compared to existing video-to-audio generation methods in objective metrics, it achieves competitive performance when the audio caption is not provided. When the audio caption is provided as a prompt, VATT achieves even more refined performance (lowest KLD score of 1.41). Furthermore, subjective studies show that VATT Audio has been chosen as preferred generated audio than audio generated by existing methods. VATT enables controllable video-to-audio generation through text as well as suggesting text prompts for videos through audio captions, unlocking novel applications such as text-guided video-to-audio generation and video-to-audio captioning., Comment: NeurIPS 2024
Published: 2024

2. Transferable polychromatic optical encoder for neural networks

Author: Choi, Minho, Xiang, Jinlin, Wirth-Singh, Anna, Baek, Seung-Hwan, Shlizerman, Eli, and Majumdar, Arka
Subjects: Computer Science - Computer Vision and Pattern Recognition, Physics - Optics
Abstract: Artificial neural networks (ANNs) have fundamentally transformed the field of computer vision, providing unprecedented performance. However, these ANNs for image processing demand substantial computational resources, often hindering real-time operation. In this paper, we demonstrate an optical encoder that can perform convolution simultaneously in three color channels during the image capture, effectively implementing several initial convolutional layers of a ANN. Such an optical encoding results in ~24,000 times reduction in computational operations, with a state-of-the art classification accuracy (~73.2%) in free-space optical system. In addition, our analog optical encoder, trained for CIFAR-10 data, can be transferred to the ImageNet subset, High-10, without any modifications, and still exhibits moderate accuracy. Our results evidence the potential of hybrid optical/digital computer vision system in which the optical frontend can pre-process an ambient scene to reduce the energy and latency of the whole computer vision system., Comment: 21 pages, 4 figures, 2 tables
Published: 2024

3. CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation

Author: Krause, Claudius, Giannelli, Michele Faucci, Kasieczka, Gregor, Nachman, Benjamin, Salamani, Dalila, Shih, David, Zaborowska, Anna, Amram, Oz, Borras, Kerstin, Buckley, Matthew R., Buhmann, Erik, Buss, Thorsten, Cardoso, Renato Paulo Da Costa, Caterini, Anthony L., Chernyavskaya, Nadezda, Corchia, Federico A. G., Cresswell, Jesse C., Diefenbacher, Sascha, Dreyer, Etienne, Ekambaram, Vijay, Eren, Engin, Ernst, Florian, Favaro, Luigi, Franchini, Matteo, Gaede, Frank, Gross, Eilam, Hsu, Shih-Chieh, Jaruskova, Kristina, Käch, Benno, Kalagnanam, Jayant, Kansal, Raghav, Kim, Taewoo, Kobylianskii, Dmitrii, Korol, Anatolii, Korcari, William, Krücker, Dirk, Krüger, Katja, Letizia, Marco, Li, Shu, Liu, Qibin, Liu, Xiulong, Loaiza-Ganem, Gabriel, Madula, Thandikire, McKeown, Peter, Melzer-Pellmann, Isabell-A., Mikuni, Vinicius, Nguyen, Nam, Ore, Ayodele, Schweitzer, Sofia Palacios, Pang, Ian, Pedro, Kevin, Plehn, Tilman, Pokorski, Witold, Qu, Huilin, Raikwar, Piyush, Raine, John A., Reyes-Gonzalez, Humberto, Rinaldi, Lorenzo, Ross, Brendan Leigh, Scham, Moritz A. W., Schnake, Simon, Shimmin, Chase, Shlizerman, Eli, Soybelman, Nathalie, Srivatsa, Mudhakar, Tsolaki, Kalliopi, Vallecorsa, Sofia, Yeo, Kyongmin, and Zhang, Rui
Subjects: Computer Science - Machine Learning, High Energy Physics - Experiment, High Energy Physics - Phenomenology, Physics - Instrumentation and Detectors
Abstract: We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge. We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality, ranging from a few hundred voxels to a few tens of thousand voxels. The 31 individual submissions span a wide range of current popular generative architectures, including Variational AutoEncoders (VAEs), Generative Adversarial Networks (GANs), Normalizing Flows, Diffusion models, and models based on Conditional Flow Matching. We compare all submissions in terms of quality of generated calorimeter showers, as well as shower generation time and model size. To assess the quality we use a broad range of different metrics including differences in 1-dimensional histograms of observables, KPD/FPD scores, AUCs of binary classifiers, and the log-posterior of a multiclass classifier. The results of the CaloChallenge provide the most complete and comprehensive survey of cutting-edge approaches to calorimeter fast simulation to date. In addition, our work provides a uniquely detailed perspective on the important problem of how to evaluate generative models. As such, the results presented here should be applicable for other domains that use generative AI and require fast and faithful generation of samples in a large phase space., Comment: 204 pages, 100+ figures, 30+ tables
Published: 2024

4. From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation

Author: Su, Kun, Liu, Xiulong, and Shlizerman, Eli
Subjects: Computer Science - Multimedia, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Video encompasses both visual and auditory data, creating a perceptually rich experience where these two modalities complement each other. As such, videos are a valuable type of media for the investigation of the interplay between audio and visual elements. Previous studies of audio-visual modalities primarily focused on either audio-visual representation learning or generative modeling of a modality conditioned on the other, creating a disconnect between these two branches. A unified framework that learns representation and generates modalities has not been developed yet. In this work, we introduce a novel framework called Vision to Audio and Beyond (VAB) to bridge the gap between audio-visual representation learning and vision-to-audio generation. The key approach of VAB is that rather than working with raw video frames and audio data, VAB performs representation learning and generative modeling within latent spaces. In particular, VAB uses a pre-trained audio tokenizer and an image encoder to obtain audio tokens and visual features, respectively. It then performs the pre-training task of visual-conditioned masked audio token prediction. This training strategy enables the model to engage in contextual learning and simultaneous video-to-audio generation. After the pre-training phase, VAB employs the iterative-decoding approach to rapidly generate audio tokens conditioned on visual features. Since VAB is a unified model, its backbone can be fine-tuned for various audio-visual downstream tasks. Our experiments showcase the efficiency of VAB in producing high-quality audio from video, and its capability to acquire semantic audio-visual features, leading to competitive results in audio-visual retrieval and classification., Comment: Accepted by ICML 2024
Published: 2024

5. Calo-VQ: Vector-Quantized Two-Stage Generative Model in Calorimeter Simulation

Author: Liu, Qibin, Shimmin, Chase, Liu, Xiulong, Shlizerman, Eli, Li, Shu, and Hsu, Shih-Chieh
Subjects: Physics - Instrumentation and Detectors, Computer Science - Machine Learning, High Energy Physics - Phenomenology
Abstract: We introduce a novel machine learning method developed for the fast simulation of calorimeter detector response, adapting vector-quantized variational autoencoder (VQ-VAE). Our model adopts a two-stage generation strategy: initially compressing geometry-aware calorimeter data into a discrete latent space, followed by the application of a sequence model to learn and generate the latent tokens. Extensive experimentation on the Calo-challenge dataset underscores the efficiency of our approach, showcasing a remarkable improvement in the generation speed compared with conventional method by a factor of 2000. Remarkably, our model achieves the generation of calorimeter showers within milliseconds. Furthermore, comprehensive quantitative evaluations across various metrics are performed to validate physics performance of generation.
Published: 2024

6. Compressed Meta-Optical Encoder for Image Classification

Author: Wirth-Singh, Anna, Xiang, Jinlin, Choi, Minho, Fröch, Johannes E., Huang, Luocheng, Colburn, Shane, Shlizerman, Eli, and Majumdar, Arka
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing, Physics - Optics
Abstract: Optical and hybrid convolutional neural networks (CNNs) recently have become of increasing interest to achieve low-latency, low-power image classification and computer vision tasks. However, implementing optical nonlinearity is challenging, and omitting the nonlinear layers in a standard CNN comes at a significant reduction in accuracy. In this work, we use knowledge distillation to compress modified AlexNet to a single linear convolutional layer and an electronic backend (two fully connected layers). We obtain comparable performance to a purely electronic CNN with five convolutional layers and three fully connected layers. We implement the convolution optically via engineering the point spread function of an inverse-designed meta-optic. Using this hybrid approach, we estimate a reduction in multiply-accumulate operations from 17M in a conventional electronic modified AlexNet to only 86K in the hybrid compressed network enabled by the optical frontend. This constitutes over two orders of magnitude reduction in latency and power consumption. Furthermore, we experimentally demonstrate that the classification accuracy of the system exceeds 93% on the MNIST dataset.
Published: 2024

7. Lyapunov-guided representation of recurrent neural network performance

Author: Vogt, Ryan, Zheng, Yang, and Shlizerman, Eli
Published: 2024
Full Text: View/download PDF

8. Evolutionary algorithms as an alternative to backpropagation for supervised training of Biophysical Neural Networks and Neural ODEs

Author: Hazelden, James, Liu, Yuhan Helena, Shlizerman, Eli, and Shea-Brown, Eric
Subjects: Quantitative Biology - Neurons and Cognition, Computer Science - Neural and Evolutionary Computing
Abstract: Training networks consisting of biophysically accurate neuron models could allow for new insights into how brain circuits can organize and solve tasks. We begin by analyzing the extent to which the central algorithm for neural network learning -- stochastic gradient descent through backpropagation (BP) -- can be used to train such networks. We find that properties of biophysically based neural network models needed for accurate modelling such as stiffness, high nonlinearity and long evaluation timeframes relative to spike times makes BP unstable and divergent in a variety of cases. To address these instabilities and inspired by recent work, we investigate the use of "gradient-estimating" evolutionary algorithms (EAs) for training biophysically based neural networks. We find that EAs have several advantages making them desirable over direct BP, including being forward-pass only, robust to noisy and rigid losses, allowing for discrete loss formulations, and potentially facilitating a more global exploration of parameters. We apply our method to train a recurrent network of Morris-Lecar neuron models on a stimulus integration and working memory task, and show how it can succeed in cases where direct BP is inapplicable. To expand on the viability of EAs in general, we apply them to a general neural ODE problem and a stiff neural ODE benchmark and find again that EAs can out-perform direct BP here, especially for the over-parameterized regime. Our findings suggest that biophysical neurons could provide useful benchmarks for testing the limits of BP-adjacent methods, and demonstrate the viability of EAs for training networks with complex components.
Published: 2023

9. Learning Time-Invariant Representations for Individual Neurons from Population Dynamics

Author: Mi, Lu, Le, Trung, He, Tianxing, Shlizerman, Eli, and Sümbül, Uygar
Subjects: Quantitative Biology - Neurons and Cognition, Computer Science - Machine Learning
Abstract: Neurons can display highly variable dynamics. While such variability presumably supports the wide range of behaviors generated by the organism, their gene expressions are relatively stable in the adult brain. This suggests that neuronal activity is a combination of its time-invariant identity and the inputs the neuron receives from the rest of the circuit. Here, we propose a self-supervised learning based method to assign time-invariant representations to individual neurons based on permutation-, and population size-invariant summary of population recordings. We fit dynamical models to neuronal activity to learn a representation by considering the activity of both the individual and the neighboring population. Our self-supervised approach and use of implicit representations enable robust inference against imperfections such as partial overlap of neurons across sessions, trial-to-trial variability, and limited availability of molecular (transcriptomic) labels for downstream supervised tasks. We demonstrate our method on a public multimodal dataset of mouse cortical neuronal activity and transcriptomic labels. We report > 35% improvement in predicting the transcriptomic subclass identity and > 20% improvement in predicting class identity with respect to the state-of-the-art., Comment: Accepted at NeurIPS 2023
Published: 2023

10. The time is ripe to reverse engineer an entire nervous system: simulating behavior from neural interactions

Author: Haspel, Gal, Baker, Ben, Beets, Isabel, Boyden, Edward S, Brown, Jeffrey, Church, George, Cohen, Netta, Colon-Ramos, Daniel, Dyer, Eva, Fang-Yen, Christopher, Flavell, Steven, Goodman, Miriam B, Hart, Anne C, Izquierdo, Eduardo J, Kagias, Konstantinos, Lockery, Shawn, Lu, Yangning, Marblestone, Adam, Matelsky, Jordan, Mensh, Brett, Pereira, Talmo D, Pfister, Hanspeter, Rajan, Kanaka, Rotstein, Horacio G, Scholz, Monika, Shaevitz, Joshua W., Shlizerman, Eli, Simeon, Quilee, Skuhersky, Michael A, Tiruvadi, Vineet, Venkatachalam, Vivek, Wei, Donglai, Wester, Brock, Yang, Guangyu Robert, Yemini, Eviatar, Zimmer, Manuel, and Kording, Konrad P
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Just like electrical engineers understand how microprocessors execute programs in terms of how transistor currents are affected by their inputs, neuroscientists want to understand behavior production in terms of how neuronal outputs are affected by their inputs and internal states. This dependency of neuronal outputs on inputs can be described by a state-dependent input-output (IO)-function. However, to reliably identify these IO-functions, we need to perturb each input and combinations of inputs while observing all the outputs. Here, we argue that such completeness is possible in C. elegans; a complete description that goes all the way from the activity of every neuron to predict behavior. The established and growing toolkit of optophysiology can non-invasively capture and control every neuron's activity and scale to countless experiments. The information from many such experiments can be pooled while capturing the inter-individual variability because neuronal identity and function are largely conserved across individuals. Just like electrical engineers use transistor IO-functions to simulate program execution, we argue that neuronal IO-functions could be used to simulate the impressive breadth of brain states and behaviors of C. elegans., Comment: 28 pages, 2 figures, opinion paper
Published: 2023

11. Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos

Author: Su, Kun, Qian, Kaizhi, Shlizerman, Eli, Torralba, Antonio, and Gan, Chuang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Modeling sounds emitted from physical object interactions is critical for immersive perceptual experiences in real and virtual worlds. Traditional methods of impact sound synthesis use physics simulation to obtain a set of physics parameters that could represent and synthesize the sound. However, they require fine details of both the object geometries and impact locations, which are rarely available in the real world and can not be applied to synthesize impact sounds from common videos. On the other hand, existing video-driven deep learning-based approaches could only capture the weak correspondence between visual content and impact sounds since they lack of physics knowledge. In this work, we propose a physics-driven diffusion model that can synthesize high-fidelity impact sound for a silent video clip. In addition to the video content, we propose to use additional physics priors to guide the impact sound synthesis procedure. The physics priors include both physics parameters that are directly estimated from noisy real-world impact sound examples without sophisticated setup and learned residual parameters that interpret the sound environment via neural networks. We further implement a novel diffusion model with specific training and inference strategies to combine physics priors and visual information for impact sound synthesis. Experimental results show that our model outperforms several existing systems in generating realistic impact sounds. More importantly, the physics-based representations are fully interpretable and transparent, thus enabling us to perform sound editing flexibly., Comment: CVPR 2023. Project page: https://sukun1045.github.io/video-physics-sound-diffusion/
Published: 2023

12. Correction: Lyapunov-guided representation of recurrent neural network performance

Author: Vogt, Ryan, Zheng, Yang, and Shlizerman, Eli
Published: 2024
Full Text: View/download PDF

13. TKIL: Tangent Kernel Approach for Class Balanced Incremental Learning

Author: Xiang, Jinlin and Shlizerman, Eli
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Quantitative Biology - Neurons and Cognition, Statistics - Machine Learning
Abstract: When learning new tasks in a sequential manner, deep neural networks tend to forget tasks that they previously learned, a phenomenon called catastrophic forgetting. Class incremental learning methods aim to address this problem by keeping a memory of a few exemplars from previously learned tasks, and distilling knowledge from them. However, existing methods struggle to balance the performance across classes since they typically overfit the model to the latest task. In our work, we propose to address these challenges with the introduction of a novel methodology of Tangent Kernel for Incremental Learning (TKIL) that achieves class-balanced performance. The approach preserves the representations across classes and balances the accuracy for each class, and as such achieves better overall accuracy and variance. TKIL approach is based on Neural Tangent Kernel (NTK), which describes the convergence behavior of neural networks as a kernel function in the limit of infinite width. In TKIL, the gradients between feature layers are treated as the distance between the representations of these layers and can be defined as Gradients Tangent Kernel loss (GTK loss) such that it is minimized along with averaging weights. This allows TKIL to automatically identify the task and to quickly adapt to it during inference. Experiments on CIFAR-100 and ImageNet datasets with various incremental learning settings show that these strategies allow TKIL to outperform existing state-of-the-art methods.
Published: 2022

14. STNDT: Modeling Neural Population Activity with a Spatiotemporal Transformer

Author: Le, Trung and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition, Computer Science - Machine Learning
Abstract: Modeling neural population dynamics underlying noisy single-trial spiking activities is essential for relating neural observation and behavior. A recent non-recurrent method - Neural Data Transformers (NDT) - has shown great success in capturing neural dynamics with low inference latency without an explicit dynamical model. However, NDT focuses on modeling the temporal evolution of the population activity while neglecting the rich covariation between individual neurons. In this paper we introduce SpatioTemporal Neural Data Transformer (STNDT), an NDT-based architecture that explicitly models responses of individual neurons in the population across time and space to uncover their underlying firing rates. In addition, we propose a contrastive learning loss that works in accordance with mask modeling objective to further improve the predictive performance. We show that our model achieves state-of-the-art performance on ensemble level in estimating neural activities across four neural datasets, demonstrating its capability to capture autonomous and non-autonomous dynamics spanning different cortical regions while being completely agnostic to the specific behaviors at hand. Furthermore, STNDT spatial attention mechanism reveals consistently important subsets of neurons that play a vital role in driving the response of the entire population, providing interpretability and key insights into how the population of neurons performs computation.
Published: 2022

15. Lyapunov-Guided Representation of Recurrent Neural Network Performance

Author: Vogt, Ryan, Zheng, Yang, and Shlizerman, Eli
Subjects: Computer Science - Machine Learning, Mathematics - Dynamical Systems, Nonlinear Sciences - Chaotic Dynamics, Statistics - Machine Learning
Abstract: Recurrent Neural Networks (RNN) are ubiquitous computing systems for sequences and multivariate time series data. While several robust architectures of RNN are known, it is unclear how to relate RNN initialization, architecture, and other hyperparameters with accuracy for a given task. In this work, we propose to treat RNN as dynamical systems and to correlate hyperparameters with accuracy through Lyapunov spectral analysis, a methodology specifically designed for nonlinear dynamical systems. To address the fact that RNN features go beyond the existing Lyapunov spectral analysis, we propose to infer relevant features from the Lyapunov spectrum with an Autoencoder and an embedding of its latent representation (AeLLE). Our studies of various RNN architectures show that AeLLE successfully correlates RNN Lyapunov spectrum with accuracy. Furthermore, the latent representation learned by AeLLE is generalizable to novel inputs from the same task and is formed early in the process of RNN training. The latter property allows for the prediction of the accuracy to which RNN would converge when training is complete. We conclude that representation of RNN through Lyapunov spectrum along with AeLLE provides a novel method for organization and interpretation of variants of RNN architectures., Comment: 26 pages, 7 figures, 4 tables
Published: 2022

16. Statistical Perspective on Functional and Causal Neural Connectomics: The Time-Aware PC Algorithm

Author: Biswas, Rahul and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition, Quantitative Biology - Quantitative Methods, Statistics - Applications
Abstract: The representation of the flow of information between neurons in the brain based on their activity is termed the causal functional connectome. Such representation incorporates the dynamic nature of neuronal activity and causal interactions between them. In contrast to connectome, the causal functional connectome is not directly observed and needs to be inferred from neural time series. A popular statistical framework for inferring causal connectivity from observations is the directed probabilistic graphical modeling. Its common formulation is not suitable for neural time series since was developed for variables with independent and identically distributed static samples. In this work, we propose to model and estimate the causal functional connectivity from neural time series using a novel approach that adapts directed probabilistic graphical modeling to the time series scenario. In particular, we develop the Time-Aware PC (TPC) algorithm for estimating the causal functional connectivity, which adapts the PC algorithm a state-of-the-art method for statistical causal inference. We show that the model outcome of TPC has the properties of reflecting causality of neural interactions such as being non-parametric, exhibits the directed Markov property in a time-series setting, and is predictive of the consequence of counterfactual interventions on the time series. We demonstrate the utility of the methodology to obtain the causal functional connectome for several datasets including simulations, benchmark datasets, and recent multi-array electro-physiological recordings from the mouse visual cortex.
Published: 2022
Full Text: View/download PDF

17. Statistical Perspective on Functional and Causal Neural Connectomics: A Comparative Study

Author: Biswas, Rahul and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition, Quantitative Biology - Quantitative Methods, Statistics - Applications
Abstract: Representation of brain network interactions is fundamental to the translation of neural structure to brain function. As such, methodologies for mapping neural interactions into structural models, i.e., inference of functional connectome from neural recordings, are key for the study of brain networks. While multiple approaches have been proposed for functional connectomics based on statistical associations between neural activity, association does not necessarily incorporate causation. Additional approaches have been proposed to incorporate aspects of causality to turn functional connectomes into causal functional connectomes, however, these methodologies typically focus on specific aspects of causality. This warrants a systematic statistical framework for causal functional connectomics that defines the foundations of common aspects of causality. Such a framework can assist in contrasting existing approaches and to guide development of further causal methodologies. In this work, we develop such a statistical guide. In particular, we consolidate the notions of associations and representations of neural interaction, i.e., types of neural connectomics, and then describe causal modeling in the statistics literature. We particularly focus on the introduction of directed Markov graphical models as a framework through which we define the Directed Markov Property -- an essential criterion for examining the causality of proposed functional connectomes. We demonstrate how based on these notions, a comparative study of several existing approaches for finding causal functional connectivity from neural activity can be conducted. We proceed by providing an outlook ahead regarding the additional properties that future approaches could include to thoroughly address causality.
Published: 2021
Full Text: View/download PDF

18. Knowledge Distillation Circumvents Nonlinearity for Optical Convolutional Neural Networks

Author: Xiang, Jinlin, Colburn, Shane, Majumdar, Arka, and Shlizerman, Eli
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Emerging Technologies, Computer Science - Machine Learning
Abstract: In recent years, Convolutional Neural Networks (CNNs) have enabled ubiquitous image processing applications. As such, CNNs require fast runtime (forward propagation) to process high-resolution visual streams in real time. This is still a challenging task even with state-of-the-art graphics and tensor processing units. The bottleneck in computational efficiency primarily occurs in the convolutional layers. Performing operations in the Fourier domain is a promising way to accelerate forward propagation since it transforms convolutions into elementwise multiplications, which are considerably faster to compute for large kernels. Furthermore, such computation could be implemented using an optical 4f system with orders of magnitude faster operation. However, a major challenge in using this spectral approach, as well as in an optical implementation of CNNs, is the inclusion of a nonlinearity between each convolutional layer, without which CNN performance drops dramatically. Here, we propose a Spectral CNN Linear Counterpart (SCLC) network architecture and develop a Knowledge Distillation (KD) approach to circumvent the need for a nonlinearity and successfully train such networks. While the KD approach is known in machine learning as an effective process for network pruning, we adapt the approach to transfer the knowledge from a nonlinear network (teacher) to a linear counterpart (student). We show that the KD approach can achieve performance that easily surpasses the standard linear version of a CNN and could approach the performance of the nonlinear network. Our simulations show that the possibility of increasing the resolution of the input image allows our proposed 4f optical linear network to perform more efficiently than a nonlinear network with the same accuracy on two fundamental image processing tasks: (i) object classification and (ii) semantic segmentation.
Published: 2021
Full Text: View/download PDF

19. Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements

Author: Su, Kun, Liu, Xiulong, and Shlizerman, Eli
Subjects: Computer Science - Sound, Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We propose a novel system that takes as an input body movements of a musician playing a musical instrument and generates music in an unsupervised setting. Learning to generate multi-instrumental music from videos without labeling the instruments is a challenging problem. To achieve the transformation, we built a pipeline named 'Multi-instrumentalistNet' (MI Net). At its base, the pipeline learns a discrete latent representation of various instruments music from log-spectrogram using a Vector Quantized Variational Autoencoder (VQ-VAE) with multi-band residual blocks. The pipeline is then trained along with an autoregressive prior conditioned on the musician's body keypoints movements encoded by a recurrent neural network. Joint training of the prior with the body movements encoder succeeds in the disentanglement of the music into latent features indicating the musical components and the instrumental features. The latent space results in distributions that are clustered into distinct instruments from which new music can be generated. Furthermore, the VQ-VAE architecture supports detailed music generation with additional conditioning. We show that a Midi can further condition the latent space such that the pipeline will generate the exact content of the music being played by the instrument in the video. We evaluate MI Net on two datasets containing videos of 13 instruments and obtain generated music of reasonable audio quality, easily associated with the corresponding instrument, and consistent with the music audio content., Comment: Please see associated video at https://www.youtube.com/watch?v=yo5OZKBbBh4
Published: 2020

20. Sparse Semi-Supervised Action Recognition with Active Learning

Author: Li, Jingyuan and Shlizerman, Eli
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Current state-of-the-art methods for skeleton-based action recognition are supervised and rely on labels. The reliance is limiting the performance due to the challenges involved in annotation and mislabeled data. Unsupervised methods have been introduced, however, they organize sequences into clusters and still require labels to associate clusters with actions. In this paper, we propose a novel approach for skeleton-based action recognition, called SESAR, that connects these approaches. SESAR leverages the information from both unlabeled data and a handful of sequences actively selected for labeling, combining unsupervised training with sparsely supervised guidance. SESAR is composed of two main components, where the first component learns a latent representation for unlabeled action sequences through an Encoder-Decoder RNN which reconstructs the sequences, and the second component performs active learning to select sequences to be labeled based on cluster and classification uncertainty. When the two components are simultaneously trained on skeleton-based action sequences, they correspond to a robust system for action recognition with only a handful of labeled samples. We evaluate our system on common datasets with multiple sequences and actions, such as NW UCLA, NTU RGB+D 60, and UWA3D. Our results outperform standalone skeleton-based supervised, unsupervised with cluster identification, and active-learning methods for action recognition when applied to sparse labeled samples, as low as 1% of the data.
Published: 2020

21. Neuro-PC: Causal Functional Connectivity from Neural Dynamics

Author: Biswas, Rahul and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Functional connectome extends the anatomical connectome by capturing the relations between neurons according to their activity and interactions. When these relations are causal, the functional connectome maps how neural activity flows within neural circuits and provides the possibility for inference of functional neural pathways, such as sensory-motor-behavioral pathways. While there exist various information approaches for non-causal estimations of the functional connectome, approaches that characterize the causal functional connectivity - the causal relationships between neuronal time series, are scarce. In this work, we develop the Neuro-PC algorithm which is a novel methodology for inferring the causal functional connectivity between neurons from multi-dimensional time series, such as neuronal recordings. The core of our methodology relies on a novel adaptation of the PC algorithm, a state-of-the-art method for statistical causal inference, to the multi-dimensional time-series of neural dynamics. We validate the performance of the method on network motifs with various interactions between their neurons simulated using continuous-time artificial network of neurons. We then consider the application of the method to obtain the causal functional connectome for recent multi-array electrophysiological recordings from the mouse visual cortex in the presence of different stimuli. We show how features of the mapping can be used for quantification of the similarities between neural responses subject to different stimuli.
Published: 2020

22. On Lyapunov Exponents for RNNs: Understanding Information Propagation Using Dynamical Systems Tools

Author: Vogt, Ryan, Touzel, Maximilian Puelma, Shlizerman, Eli, and Lajoie, Guillaume
Subjects: Computer Science - Machine Learning, Mathematics - Dynamical Systems, Nonlinear Sciences - Chaotic Dynamics, Statistics - Machine Learning
Abstract: Recurrent neural networks (RNNs) have been successfully applied to a variety of problems involving sequential data, but their optimization is sensitive to parameter initialization, architecture, and optimizer hyperparameters. Considering RNNs as dynamical systems, a natural way to capture stability, i.e., the growth and decay over long iterates, are the Lyapunov Exponents (LEs), which form the Lyapunov spectrum. The LEs have a bearing on stability of RNN training dynamics because forward propagation of information is related to the backward propagation of error gradients. LEs measure the asymptotic rates of expansion and contraction of nonlinear system trajectories, and generalize stability analysis to the time-varying attractors structuring the non-autonomous dynamics of data-driven RNNs. As a tool to understand and exploit stability of training dynamics, the Lyapunov spectrum fills an existing gap between prescriptive mathematical approaches of limited scope and computationally-expensive empirical approaches. To leverage this tool, we implement an efficient way to compute LEs for RNNs during training, discuss the aspects specific to standard RNN architectures driven by typical sequential datasets, and show that the Lyapunov spectrum can serve as a robust readout of training stability across hyperparameters. With this exposition-oriented contribution, we hope to draw attention to this understudied, but theoretically grounded tool for understanding training stability in RNNs., Comment: Associated github repository: https://github.com/shlizee/lyapunov-hyperopt
Published: 2020

23. Audeo: Audio Generation for a Silent Performance Video

Author: Su, Kun, Liu, Xiulong, and Shlizerman, Eli
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Multimedia, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: We present a novel system that gets as an input video frames of a musician playing the piano and generates the music for that video. Generation of music from visual cues is a challenging problem and it is not clear whether it is an attainable goal at all. Our main aim in this work is to explore the plausibility of such a transformation and to identify cues and components able to carry the association of sounds with visual events. To achieve the transformation we built a full pipeline named `\textit{Audeo}' containing three components. We first translate the video frames of the keyboard and the musician hand movements into raw mechanical musical symbolic representation Piano-Roll (Roll) for each video frame which represents the keys pressed at each time step. We then adapt the Roll to be amenable for audio synthesis by including temporal correlations. This step turns out to be critical for meaningful audio generation. As a last step, we implement Midi synthesizers to generate realistic music. \textit{Audeo} converts video to audio smoothly and clearly with only a few setup constraints. We evaluate \textit{Audeo} on `in the wild' piano performance videos and obtain that their generated music is of reasonable audio quality and can be successfully recognized with high precision by popular music identification software., Comment: Please see associated video at https://www.youtube.com/watch?v=8rS3VgjG7_c
Published: 2020

24. BI-MAML: Balanced Incremental Approach for Meta Learning

Author: Zheng, Yang, Xiang, Jinlin, Su, Kun, and Shlizerman, Eli
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics, Quantitative Biology - Neurons and Cognition, Statistics - Machine Learning
Abstract: We present a novel Balanced Incremental Model Agnostic Meta Learning system (BI-MAML) for learning multiple tasks. Our method implements a meta-update rule to incrementally adapt its model to new tasks without forgetting old tasks. Such a capability is not possible in current state-of-the-art MAML approaches. These methods effectively adapt to new tasks, however, suffer from 'catastrophic forgetting' phenomena, in which new tasks that are streamed into the model degrade the performance of the model on previously learned tasks. Our system performs the meta-updates with only a few-shots and can successfully accomplish them. Our key idea for achieving this is the design of balanced learning strategy for the baseline model. The strategy sets the baseline model to perform equally well on various tasks and incorporates time efficiency. The balanced learning strategy enables BI-MAML to both outperform other state-of-the-art models in terms of classification accuracy for existing tasks and also accomplish efficient adaption to similar new tasks with less required shots. We evaluate BI-MAML by conducting comparisons on two common benchmark datasets with multiple number of image classification tasks. BI-MAML performance demonstrates advantages in both accuracy and efficiency., Comment: Please see associated video at: https://youtu.be/4qlb-iG5SFo
Published: 2020

25. Deep Reinforcement Learning for Neural Control

Author: Kim, Jimin and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control
Abstract: We present a novel methodology for control of neural circuits based on deep reinforcement learning. Our approach achieves aimed behavior by generating external continuous stimulation of existing neural circuits (neuromodulation control) or modulations of neural circuits architecture (connectome control). Both forms of control are challenging due to nonlinear and recurrent complexity of neural activity. To infer candidate control policies, our approach maps neural circuits and their connectome into a grid-world like setting and infers the actions needed to achieve aimed behavior. The actions are inferred by adaptation of deep Q-learning methods known for their robust performance in navigating grid-worlds. We apply our approach to the model of \textit{C. elegans} which simulates the full somatic nervous system with muscles and body. Our framework successfully infers neuropeptidic currents and synaptic architectures for control of chemotaxis. Our findings are consistent with in vivo measurements and provide additional insights into neural control of chemotaxis. We further demonstrate the generality and scalability of our methods by inferring chemotactic neural circuits from scratch., Comment: Please see the associated Video at: https://youtu.be/ixsUMfb9m_U
Published: 2020

26. Iterate & Cluster: Iterative Semi-Supervised Action Recognition

Author: Li, Jingyuan and Shlizerman, Eli
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: We propose a novel system for active semi-supervised feature-based action recognition. Given time sequences of features tracked during movements our system clusters the sequences into actions. Our system is based on encoder-decoder unsupervised methods shown to perform clustering by self-organization of their latent representation through the auto-regression task. These methods were tested on human action recognition benchmarks and outperformed non-feature based unsupervised methods and achieved comparable accuracy to skeleton-based supervised methods. However, such methods rely on K-Nearest Neighbours (KNN) associating sequences to actions, and general features with no annotated data would correspond to approximate clusters which could be further enhanced. Our system proposes an iterative semi-supervised method to address this challenge and to actively learn the association of clusters and actions. The method utilizes latent space embedding and clustering of the unsupervised encoder-decoder to guide the selection of sequences to be annotated in each iteration. Each iteration, the selection aims to enhance action recognition accuracy while choosing a small number of sequences for annotation. We test the approach on human skeleton-based action recognition benchmarks assuming that only annotations chosen by our method are available and on mouse movements videos recorded in lab experiments. We show that our system can boost recognition performance with only a small percentage of annotations. The system can be used as an interactive annotation tool to guide labeling efforts for 'in the wild' videos of various objects and actions to reach robust recognition., Comment: for associated video, see https://www.youtube.com/watch?v=ewuoz2tt73E
Published: 2020

27. R-FORCE: Robust Learning for Random Recurrent Neural Networks

Author: Zheng, Yang and Shlizerman, Eli
Subjects: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, Quantitative Biology - Neurons and Cognition
Abstract: Random Recurrent Neural Networks (RRNN) are the simplest recurrent networks to model and extract features from sequential data. The simplicity however comes with a price; RRNN are known to be susceptible to diminishing/exploding gradient problem when trained with gradient-descent based optimization. To enhance robustness of RRNN, alternative training approaches have been proposed. Specifically, FORCE learning approach proposed a recursive least squares alternative to train RRNN and was shown to be applicable even for the challenging task of target-learning, where the network is tasked with generating dynamic patterns with no guiding input. While FORCE training indicates that solving target-learning is possible, it appears to be effective only in a specific regime of network dynamics (edge-of-chaos). We thereby investigate whether initialization of RRNN connectivity according to a tailored distribution can guarantee robust FORCE learning. We are able to generate such distribution by inference of four generating principles constraining the spectrum of the network Jacobian to remain in stability region. This initialization along with FORCE learning provides a robust training method, i.e., Robust-FORCE (R-FORCE). We validate R-FORCE performance on various target functions for a wide range of network configurations and compare with alternative methods. Our experiments indicate that R-FORCE facilitates significantly more stable and accurate target-learning for a wide class of RRNN. Such stability becomes critical in modeling multi-dimensional sequences as we demonstrate on modeling time-series of human body joints during physical movements., Comment: Github Repository: https://github.com/shlizee/R-FORCE
Published: 2020

28. PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition

Author: Su, Kun, Liu, Xiulong, and Shlizerman, Eli
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: We propose a novel system for unsupervised skeleton-based action recognition. Given inputs of body keypoints sequences obtained during various movements, our system associates the sequences with actions. Our system is based on an encoder-decoder recurrent neural network, where the encoder learns a separable feature representation within its hidden states formed by training the model to perform prediction task. We show that according to such unsupervised training the decoder and the encoder self-organize their hidden states into a feature space which clusters similar movements into the same cluster and distinct movements into distant clusters. Current state-of-the-art methods for action recognition are strongly supervised, i.e., rely on providing labels for training. Unsupervised methods have been proposed, however, they require camera and depth inputs (RGB+D) at each time step. In contrast, our system is fully unsupervised, does not require labels of actions at any stage, and can operate with body keypoints input only. Furthermore, the method can perform on various dimensions of body keypoints (2D or 3D) and include additional cues describing movements. We evaluate our system on three extensive action recognition benchmarks with different number of actions and examples. Our results outperform prior unsupervised skeleton-based methods, unsupervised RGB+D based methods on cross-view tests and while being unsupervised have similar performance to supervised skeleton-based action recognition., Comment: See video at: https://www.youtube.com/watch?v=-dcCFUBRmwE
Published: 2019

29. Clustering and Recognition of Spatiotemporal Features through Interpretable Embedding of Sequence to Sequence Recurrent Neural Networks

Author: Su, Kun and Shlizerman, Eli
Subjects: Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing, Quantitative Biology - Neurons and Cognition, Statistics - Machine Learning
Abstract: Encoder-decoder recurrent neural network models (RNN Seq2Seq) have achieved great success in ubiquitous areas of computation and applications. It was shown to be successful in modeling data with both temporal and spatial dependencies for translation or prediction tasks. In this study, we propose an embedding approach to visualize and interpret the representation of data by these models. Furthermore, we show that the embedding is an effective method for unsupervised learning and can be utilized to estimate the optimality of model training. In particular, we demonstrate that embedding space projections of the decoder states of RNN Seq2Seq model trained on sequences prediction are organized in clusters capturing similarities and differences in the dynamics of these sequences. Such performance corresponds to an unsupervised clustering of any spatio-temporal features and can be employed for time-dependent problems such as temporal segmentation, clustering of dynamic activity, self-supervised classification, action recognition, failure prediction, etc. We test and demonstrate the application of the embedding methodology to time-sequences of 3D human body poses. We show that the methodology provides a high-quality unsupervised categorization of movements.
Published: 2019

30. An Optical Frontend for a Convolutional Neural Network

Author: Colburn, Shane, Chu, Yi, Shlizerman, Eli, and Majumdar, Arka
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Emerging Technologies, Computer Science - Machine Learning, Physics - Optics
Abstract: The parallelism of optics and the miniaturization of optical components using nanophotonic structures, such as metasurfaces present a compelling alternative to electronic implementations of convolutional neural networks. The lack of a low-power optical nonlinearity, however, requires slow and energy-inefficient conversions between the electronic and optical domains. Here, we design an architecture which utilizes a single electrical to optical conversion by designing a free-space optical frontend unit that implements the linear operations of the first layer with the subsequent layers realized electronically. Speed and power analysis of the architecture indicates that the hybrid photonic-electronic architecture outperforms sole electronic architecture for large image sizes and kernels. Benchmarking of the photonic-electronic architecture on a modified version of AlexNet achieves a classification accuracy of 87% on images from the Kaggle Cats and Dogs challenge database.
Published: 2018
Full Text: View/download PDF

31. Audio to Body Dynamics

Author: Shlizerman, Eli, Dery, Lucio M., Schoen, Hayden, and Kemelmacher-Shlizerman, Ira
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Sound
Abstract: We present a method that gets as input an audio of violin or piano playing, and outputs a video of skeleton predictions which are further used to animate an avatar. The key idea is to create an animation of an avatar that moves their hands similarly to how a pianist or violinist would do, just from audio. Aiming for a fully detailed correct arms and fingers motion is a goal, however, it's not clear if body movement can be predicted from music at all. In this paper, we present the first result that shows that natural body dynamics can be predicted at all. We built an LSTM network that is trained on violin and piano recital videos uploaded to the Internet. The predicted points are applied onto a rigged avatar to create the animation., Comment: Link with videos https://arviolin.github.io/AudioBodyDynamics/
Published: 2017
Full Text: View/download PDF

32. Functional Connectomics from Data: Probabilistic Graphical Models for Neuronal Network of C. elegans

Author: Liu, Hexuan, Kim, Jimin, and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: We propose a data-driven approach to represent neuronal network dynamics as a Probabilistic Graphical Model (PGM). Our approach learns the PGM structure by employing dimension reduction to network response dynamics evoked by stimuli applied to each neuron separately. The outcome model captures how stimuli propagate through the network and thus represents functional dependencies between neurons, i.e., functional connectome. The benefit of using a PGM as the functional connectome is that posterior inference can be done efficiently and circumvent the complexities in direct inference of response pathways in dynamic neuronal networks. In particular, posterior inference reveals the relations between known stimuli and downstream neurons or allows to query which stimuli are associated with downstream neurons. For validation and as an example for our approach we apply our methodology to a model of Caenorhabiditis elegans nervous system which structure and dynamics are well-studied. From its dynamical model we collect time series of the network response and use singular value decomposition to obtain a low-dimensional projection of the time series data. We then extract dominant patterns in each data matrix to get pairwise dependency information and create a graphical model for the full somatic nervous system. The PGM enables us to obtain and verify underlying neuronal pathways dominant for known behavioral scenarios and to detect possible pathways for novel scenarios.
Published: 2017

33. Classification of Fixed Point Network Dynamics From Multiple Node Timeseries Data

Author: Blaszka, David, Sanders, Elischa, Riffell, Jeffrey, and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Fixed point networks are dynamic networks that encode stimuli via distinct output patterns. Although such networks are omnipresent in neural systems, their structures are typically unknown or poorly characterized. It is therefore valuable to use a supervised approach for resolving how a network encodes distinct inputs of interest, and the superposition of those inputs from sampled multiple node time series. In this paper we show that accomplishing such a task involves finding a low-dimensional state space from supervised recordings. We demonstrate that standard methods for dimension reduction are unable to provide the desired functionality of optimal separation of the fixed points and transient trajectories to them. However, the combination of dimension reduction with selection and optimization can successfully provide such functionality. Specifically, we propose two methods: Exclusive Threshold Reduction (ETR) and Optimal Exclusive Threshold Reduction (OETR) for finding a basis for the classification state space. We show that the classification space constructed upon combination of dimension reduction optimal separation can directly facilitate recognition of stimuli, and classify complex inputs (mixtures) into similarity classes. We test our methodology and compare it to standard state-of-the-art methods on a benchmark dataset - an experimental neuronal network (the olfactory system) that we recorded from to test these methods. We show that our methods are capable of providing a basis for the classification space in such network, and to perform recognition at a significantly better rate than previously proposed approaches., Comment: submitted for publication
Published: 2017

34. Fokas's Uniform Transform Method for linear systems

Author: Deconinck, Bernard, Guo, Qi, Shlizerman, Eli, and Vasan, Vishal
Subjects: Mathematics - Analysis of PDEs
Abstract: We demonstrate the use of the Unified Transform Method or Method of Fokas for boundary value problems for systems of constant-coefficient linear partial differential equations. We discuss how the apparent branch singularities typically appearing in the global relation are removable, allowing the method to proceed, in essence, as for scalar problems. We illustrate the use of the method with boundary value problems for the Klein-Gordon equation and the linearized Fitzhugh-Nagumo system. The case of wave equations is treated separately in an appendix.
Published: 2017

35. Symmetries constrain dynamics in a family of balanced neural networks

Author: Barreiro, Andrea K., Kutz, J. Nathan, and Shlizerman, Eli
Subjects: Quantitative Biology - Neurons and Cognition, 15B52, 34C14, 34C23, 37G40, 92B20
Abstract: We examine a family of random firing-rate neural networks in which we enforce the neurobiological constraint of Dale's Law --- each neuron makes either excitatory or inhibitory connections onto its post-synaptic targets. We find that this constrained system may be described as a perturbation from a system with non-trivial symmetries. We analyze the symmetric system using the tools of equivariant bifurcation theory, and demonstrate that the symmetry-implied structures remain evident in the perturbed system. In comparison, spectral characteristics of the network coupling matrix are relatively uninformative about the behavior of the constrained system., Comment: In review; submitted 1/21/2016 Revision submitted 9/24/16
Published: 2016

36. AL-SAR: Active Learning for Skeleton-Based Action Recognition

Author: Li, Jingyuan, Le, Trung, and Shlizerman, Eli
Abstract: Action recognition from temporal multivariate sequences of features, such as identifying human actions, is typically approached by supervised training as it requires many ground truth annotations to reach high recognition accuracy. Unsupervised methods for the organization of sequences into clusters have been introduced, however, such methods continue to require annotations to associate clusters with actions. The challenges in annotation necessitate an effective classification methodology that minimizes the required number of labels. Active learning (AL) approaches have been proposed to address these challenges and were able to establish robust results on image classification. Such approaches are not directly applicable to sequences, since for sequences, the variations are in both spatial and temporal domains. In this brief, we introduce a novel method for AL for sequences, called “AL-SAR,” which combines unsupervised training with sparsely supervised annotation. In particular, AL-SAR employs a multi-head mechanism for robust uncertainty evaluation of the latent space learned by an encoder-decoder framework. It aims to iteratively select a sparse set of samples, which annotation contributes the most to the disentanglement of the latent space. We evaluate our system on common benchmark datasets with multiple sequences and actions, such as NW-UCLA, NTU RGB+D 60, and UWA3D. Our results indicate that AL-SAR coupled with encoder-decoder network outperforms other AL methods coupled with the same network structure.
Published: 2024
Full Text: View/download PDF

37. The role of multistability and transient trajectories in networked dynamical systems: Connectomic dynamics of C. elegans and behavioral assays

Author: Kunert, James, Shlizerman, Eli, Walker, Andrew, and Kutz, J. Nathan
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: The neural dynamics of the nematode C. elegans are experimentally low-dimensional and correspond to discrete behavioral states, where previous modeling work has found neural proxies for some of these states. Experimental results further suggest that dynamics may be understood as long-timescale transitions between multiple low-dimensional attractors. To identify multistable regimes of our model, we develop a method for systematic generation of bifurcation diagrams and their analysis in an interpretable low-dimensional subspace, showing the existence and nature of multistable input responses at a glance. Stimulation of the PLM neuron pair, experimentally associated with forward movement and shown in simulation to drive a limit cycle, defines our low-dimensional projection space. We then obtain bifurcation diagrams for single-neuron excitation over a range of amplitudes and which classify whether the dynamics in this projection space are associated with a limit cycle, fixed point, or multiple states. In the specific case of compound input into both the PLM pair and ASK pair we discover bistability of a limit cycle and a fixed point, with transitional timescales between different states being much longer than other timescales in the system. This suggests consistency of our model with the characterization of dynamics in neural systems as long-timescale transitions between discrete, low-dimensional attractors corresponding to behavioral states. Our methodology thus prescribes a method for identifying these states and transitions in response to arbitrary input., Comment: 8 Pages, 5 Figures
Published: 2015

38. Let the Beat Follow You - Creating Interactive Drum Sounds From Body Rhythm

Author: Liu, Xiulong, primary, Su, Kun, additional, and Shlizerman, Eli, additional
Published: 2024
Full Text: View/download PDF

39. ElectroPhysiomeGAN: Generation of Biophysical Neuron Model Parameters from Recorded Electrophysiological Responses

Author: Kim, Jimin, primary, Liu, Qiang, additional, and Shlizerman, Eli, additional
Published: 2023
Full Text: View/download PDF

40. Data-driven modeling of the olfactory neural codes and their dynamics in the insect antennal lobe

Author: Shlizerman, Eli, Riffell, Jeffrey A., and Kutz, J. Nathan
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Recordings from neurons in the insects' olfactory primary processing center, the antennal lobe (AL), reveal that the AL is able to process the input from chemical receptors into distinct neural activity patterns, called olfactory neural codes. These exciting results show the importance of neural codes and their relation to perception. The next challenge is to \emph{model the dynamics} of neural codes. In our study, we perform multichannel recordings from the projection neurons in the AL driven by different odorants. We then derive a neural network from the electrophysiological data. The network consists of lateral-inhibitory neurons and excitatory neurons, and is capable of producing unique olfactory neural codes for the tested odorants. Specifically, we (i) design a projection, an odor space, for the neural recording from the AL, which discriminates between distinct odorants trajectories (ii) characterize scent recognition, i.e., decision-making based on olfactory signals and (iii) infer the wiring of the neural circuit, the connectome of the AL. We show that the constructed model is consistent with biological observations, such as contrast enhancement and robustness to noise. The study answers a key biological question in identifying how lateral inhibitory neurons can be wired to excitatory neurons to permit robust activity patterns.
Published: 2013
Full Text: View/download PDF

41. Low-dimensional functionality of complex network dynamics: Neuro-sensory integration in the Caenorhabditis elegans connectome

Author: Kunert, James, Shlizerman, Eli, and Kutz, J. Nathan
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: We develop a biophysical model of neuro-sensory integration in the model organism Caenorhabditis elegans. Building on recent experimental findings of the neuron conductances and their resolved connectome, we posit the first full dynamic model of the neural voltage excitations that allows for a characterization of input stimuli to behavioral responses. Thus a clear connection between receptory cell inputs to downstream motor-responses is illustrated, showing that robust, low-dimensional bifurcation structures dominate neural pathways of activity. The underlying bifurcation structures discovered, i.e. an induced Hopf bifurcation, are critical in explaining behavioral responses such as swimming and crawling. More broadly, we demonstrate that complex dynamical networks can produce robust functionality from underlying low-dimensional bifurcations.
Published: 2013
Full Text: View/download PDF

42. TKIL: Tangent Kernel Optimization for Class Balanced Incremental Learning

Author: Xiang, Jinlin, primary and Shlizerman, Eli, additional
Published: 2023
Full Text: View/download PDF

43. Functional connectomics from neural dynamics : probabilistic graphical models for neuronal network of Caenorhabditis elegans

Author: Liu, Hexuan, Kim, Jimin, and Shlizerman, Eli
Published: 2018

44. FOKAS'S UNIFIED TRANSFORM METHOD FOR LINEAR SYSTEMS

Author: DECONINCK, BERNARD, GUO, QI, SHLIZERMAN, ELI, and VASAN, VISHAL
Published: 2018

45. Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos

Author: Su, Kun, primary, Qian, Kaizhi, additional, Shlizerman, Eli, additional, Torralba, Antonio, additional, and Gan, Chuang, additional
Published: 2023
Full Text: View/download PDF

46. AL-SAR: Active Learning for Skeleton-Based Action Recognition

Author: Li, Jingyuan, primary, Le, Trung, additional, and Shlizerman, Eli, additional
Published: 2023
Full Text: View/download PDF

47. Statistical perspective on functional and causal neural connectomics: The Time-Aware PC algorithm

Author: Biswas, Rahul, primary and Shlizerman, Eli, additional
Published: 2022
Full Text: View/download PDF

48. OpenLabCluster: Active Learning Based Clustering and Classification of Animal Behaviors in Videos Based on Automatically Extracted Kinematic Body Keypoints

Author: Li, Jingyuan, primary, Keselman, Moishe, additional, and Shlizerman, Eli, additional
Published: 2022
Full Text: View/download PDF

49. Multi-block RNN Autoencoders Enable Broadband ECoG Signal Reconstruction

Author: Nolan, Michael, primary, Pesaran, Bijan, additional, Shlizerman, Eli, additional, and Orsborn, Amy, additional
Published: 2022
Full Text: View/download PDF

50. Flower discrimination by pollinators in a dynamic chemical environment

Author: Riffell, Jeffrey A., Shlizerman, Eli, Sanders, Elischa, Abrell, Leif, Medina, Billie, Hinterwirth, Armin J., and Kutz, J. Nathan
Published: 2014

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

204 results on '"Shlizerman, Eli"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources