Author: "Moreno-Noguer, Francesc" / Topic: artificial intelligence - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Moreno-Noguer, Francesc"' showing total 15 results

Start Over Author "Moreno-Noguer, Francesc" Topic artificial intelligence

15 results on '"Moreno-Noguer, Francesc"'

1. Stochastic exploration of ambiguities for nonrigid shape recovery.

Author: Moreno-Noguer F and Fua P
Subjects: Data Interpretation, Statistical, Image Enhancement methods, Models, Biological, Models, Statistical, Reproducibility of Results, Sensitivity and Specificity, Algorithms, Artificial Intelligence, Image Interpretation, Computer-Assisted methods, Imaging, Three-Dimensional methods, Pattern Recognition, Automated methods
Abstract: Recovering the 3D shape of deformable surfaces from single images is known to be a highly ambiguous problem because many different shapes may have very similar projections. This is commonly addressed by restricting the set of possible shapes to linear combinations of deformation modes and by imposing additional geometric constraints. Unfortunately, because image measurements are noisy, such constraints do not always guarantee that the correct shape will be recovered. To overcome this limitation, we introduce a stochastic sampling approach to efficiently explore the set of solutions of an objective function based on point correspondences. This allows us to propose a small set of ambiguous candidate 3D shapes and then use additional image information to choose the best one. As a proof of concept, we use either motion or shading cues to this end and show that we can handle a complex objective function without having to solve a difficult nonlinear minimization problem. The advantages of our method are demonstrated on a variety of problems including both real and synthetic data.
Published: 2013
Full Text: View/download PDF

2. Active testing search for point cloud matching.

Author: Pinheiro MA, Sznitman R, Serradell E, Kybic J, Moreno-Noguer F, and Fua P
Subjects: Bayes Theorem, Humans, Information Storage and Retrieval methods, Models, Biological, Models, Statistical, Reproducibility of Results, Sensitivity and Specificity, Algorithms, Artificial Intelligence, Image Enhancement methods, Image Interpretation, Computer-Assisted methods, Pattern Recognition, Automated methods, Subtraction Technique
Abstract: We present a general approach for solving the point-cloud matching problem for the case of mildly nonlinear transformations. Our method quickly finds a coarse approximation of the solution by exploring a reduced set of partial matches using an approach to which we refer to as Active Testing Search (ATS). We apply the method to registration of graph structures by branching point matching. It is based solely on the geometric position of the points, no additional information is used nor the knowledge of an initial alignment. In the second stage, we use dynamic programming to refine the solution. We tested our algorithm on angiography, retinal fundus, and neuronal data gathered using electron and light microscopy. We show that our method solves cases not solved by most approaches, and is faster than the remaining ones.
Published: 2013
Full Text: View/download PDF

3. Dependent multiple cue integration for robust tracking.

Author: Moreno-Noguer F, Sanfeliu A, and Samaras D
Subjects: Motion, Reproducibility of Results, Sensitivity and Specificity, Algorithms, Artificial Intelligence, Cues, Image Enhancement methods, Image Interpretation, Computer-Assisted methods, Pattern Recognition, Automated methods, Subtraction Technique
Abstract: We propose a new technique for fusing multiple cues to robustly segment an object from its background in video sequences that suffer from abrupt changes of both illumination and position of the target. Robustness is achieved by the integration of appearance and geometric object features and by their estimation using Bayesian filters, such as Kalman or particle filters. In particular, each filter estimates the state of a specific object feature, conditionally dependent on another feature estimated by a distinct filter. This dependence provides improved target representations, permitting to segment it out from the background even in non-stationary sequences. Considering that the procedure of the Bayesian filters may be described by a "hypotheses generation--hypotheses correction" strategy, the major novelty of our methodology compared to previous approaches is that the mutual dependence between filters is considered during the feature observation, i.e, into the "hypotheses correction" stage,instead of considering it when generating the hypotheses. This proves to be much more effective in terms of accuracy and reliability. The proposed method is analytically justified and applied to develop a robust tracking system that adapts online and simultaneously the color space where the image points are represented, the color distributions, the contour of the object and its bounding box. Results with synthetic data and real video sequences demonstrate the robustness and versatility of our method.
Published: 2008
Full Text: View/download PDF

4. Challenge 4: Intelligent robotics

Author: Alenyà Ribas, Guillem|||0000-0002-6018-154X, Villagrá Serrano, Jorge, Fernández Saavedra, Maria Belén, González de Santos, Pablo, Haber Guerra, Rodolfo E., Jiménez Ruiz, Antonio Ramón, Ribeiro, Angela, Rocón de Lima, Eduardo, Borràs Sol, Júlia, Moreno-Noguer, Francesc, Torras, Carme|||0000-0002-2933-398X, Institut de Robòtica i Informàtica Industrial, and Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI
Subjects: Artificial intelligence, Robòtica, Social aspects of automation, Robotics, Artificial intelligence--Engineering applications, Intel·ligència artificial--Aplicacions a l'enginyeria, Informàtica::Robòtica [Àrees temàtiques de la UPC], Intelligent robots
Abstract: Accés lliure al text del llibre a la web de l'editor Intelligent robotics are called to be the next revolution by providing AI with the capability of interacting with the physical world. Robots are overpassing their cages in the industry to become intelligent machines that can live among us, helping in the service sector, as tools in rehabilitation and assistive tasks, and also as companions. Robotics poses especial problems and AI research must be reshaped and redefined to meet robotics special needs in areas like perception and scene understanding, decision making and learning, and actuation. Besides these classical robotics areas, modern robots need to take into account the central role of human-robot interaction : unstructured environments, unforeseen situations, user preferences, and safety. The challenges to frame this revolution are multiple. We highlight the seven where we identify CSIC has a strategic advantage and thus can cause a better impact. Modern robotics implies robots in human environments, what we called here robots for everyone : easy reprogramming and continuous learning. Deployment can include big-scale mobile robots and cars for autonomous navigation for cities, or small-scale robots for intelligent manipulation for new applications, possibly making use of effective and adaptive coordination of robot fleets. Robots in human environments require safe and ethical human-robot interaction, that can take advantage of seamless cooperative and everywhere localization solutions and dexterity and efficiency through bio-inspired and parallel mechanisms. Advances on intelligent robotics will have a great impact on science, industry, and society in general. Robots have the potential to change people’s lifestyle and thus, require special attention from rule bodies and policymakers. However, robotics is highly experimental and requires special efforts in physically building the prototypes. To make this possible, we believe a new joint lab or infrastructure must be established to facilitate research and testing, foster collaboration and involve industry and policy-makers.
Published: 2021

5. Modeling robot's world with minimal effort

Author: Villamizar, Michael, Garrell, Anaís, Sanfeliu, Alberto, Moreno-Noguer, Francesc, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Ministerio de Ciencia e Innovación (España), Ministerio de Economía y Competitividad (España), European Commission, and Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel.ligents
Subjects: Engineering, business.industry, Small number, Human robot interaction, Mobile robot, Machine learning, computer.software_genre, Automation::Robots::Humanoid robots [Classificació INSPEC], Synthetic data, Object detection, Human–robot interaction, Scalability, Robot, Computer vision, Artificial intelligence, Informàtica::Robòtica [Àrees temàtiques de la UPC], business, Classifier (UML), computer
Abstract: Trabajo presentado al ICRA celebrado en Seattle (US) del 26 al 30 de mayo de 2015., We propose an efficient Human Robot Interaction approach to efficiently model the appearance of all relevant objects in robot's environment. Given an input video stream recorded while the robot is navigating, the user just needs to annotate a very small number of frames to build specific classifiers for each of the objects of interest. At the core of the method, there are several random ferns classifiers that share the same features and are updated online. The resulting methodology is fast (runs at 8 fps), versatile (it can be applied to unconstrained scenarios), scalable (real experiments show we can model up to 30 different object classes), and minimizes the amount of human intervention by leveraging the uncertainty measures associated to each classifier. We thoroughly validate the approach on synthetic data and on real sequences acquired with a mobile platform in outdoor and challenging scenarios containing a multitude of different objects. We show that the human can, with minimal effort, provide the robot with a detailed model of the objects in the scene., Work partially supported by the Spanish Ministry of Science and Innovation under project DPI2013-42458-P, ERA-Net Chistera project ViSen PCIN-2013-047, and by the EU project ARCAS FP7-ICT-2011-28761.
Published: 2015

6. Learning RGB-D descriptors of garment parts for informed robot grasping

Author: Ramisa, Arnau, Alenyà, Guillem, Moreno-Noguer, Francesc, Torras, Carme, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel.ligents, Ministerio de Ciencia e Innovación (España), European Commission, and Consejo Superior de Investigaciones Científicas (España)
Subjects: 0209 industrial biotechnology, Computer science, 02 engineering and technology, Machine learning, computer.software_genre, garment part detection, computer vision, Task (project management), 020901 industrial engineering & automation, Artificial Intelligence, Pattern recognition, 0202 electrical engineering, electronic engineering, information engineering, Segmentation, Computer vision, Electrical and Electronic Engineering, Bag of visual words, manipulators, business.industry, pattern recognition, object detection, Classification, Clothing, robot vision, Object detection, machine learning, classification, Control and Systems Engineering, Bag-of-words model in computer vision, Pattern recognition (psychology), Robot, 020201 artificial intelligence & image processing, Artificial intelligence, Garment part detection, business, Informàtica::Robòtica [Àrees temàtiques de la UPC], bag-of-visual-words, computer, Pattern recognition::Computer vision [Classificació INSPEC]
Abstract: Robotic handling of textile objects in household environments is an emerging application that has recently received considerable attention thanks to the development of domestic robots. Most current approaches follow a multiple re-grasp strategy for this purpose, in which clothes are sequentially grasped from different points until one of them yields a desired configuration. In this work we propose a vision-based method, built on the Bag of Visual Words approach, that combines appearance and 3D information to detect parts suitable for grasping in clothes, even when they are highly wrinkled. We also contribute a new, annotated, garment part dataset that can be used for benchmarking classification, part detection, and segmentation algorithms. The dataset is used to evaluate our approach and several state-of-the-art 3D descriptors for the task of garment part detection. Results indicate that appearance is a reliable source of information, but that augmenting it with 3D information can help the method perform better with new clothing items., This research is partially funded by the Spanish Ministry of Science and Innovation under Project PAU+ DPI2011-2751, the EU Project IntellAct FP7-ICT2009-6-269959 and the ERA-Net Chistera Project ViSen PCIN-2013-047. A. Ramisa worked under the JAE-Doc grant from CSIC and FSE.
Published: 2014

7. Integration of conditionally dependent object features for robust figure-background segmentation

Author: Moreno-Noguer, Francesc, Sanfeliu, Alberto, and Samaras, Dimitris
Subjects: Object detection [Pattern recognition], Bayesian methods, business.industry, Object detection, Feature extraction, Pattern recognition: Computer vision, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Probabilistic logic, Pattern recognition, ComputingMilieux_LEGALASPECTSOFCOMPUTING, Pattern recognition systems, Image segmentation, Computer vision [Pattern recognition], Color space, ComputingMilieux_GENERAL, Robustness (computer science), Computer vision, Segmentation, Artificial intelligence, business, Particle filter, Pattern recognition: Object detection, Mathematics
Abstract: IEEE International Conference on Computer Vision (ICCV) 2005, Beijing (China), We propose a new technique for focusing multiple cues to robustly segment an object from its background in video sequences that suffer from abrupt changes of both illumination and position of the target. Robustness is achieved by tile integration of appearance and geometric object features and by their description using particle filters. Previous approaches assume independence of the object cues or apply the particle filter formulation to only one of the features, and assume a smooth change in the rest, which can prove is very limiting, especially when the state of some features needs to be updated using other cues or when their dynamics follow non-linear and unpredictable paths. Our technique offers a general framework to model the probabilistic relationship between features. The proposed method is analytically justified and applied to develop a robust tracking system that adapts online and simultaneously the color space where the image points are represented, the color distributions, and the contour of the object. Results with synthetic data and real video sequences demonstrate the robustness and versatility of our method., This work was supported by projects: 'Navegación autónoma de robots guiados por objetivos visuales' (070-720), 'Supervised learning of industrial scenes by means of an active vision equipped mobile robot.' (J-00063).
Published: 2005

8. Non-Rigid Graph Registration Using Active Testing Search.

Author: Serradell, Eduard, Pinheiro, Miguel Amavel, Sznitman, Raphael, Kybic, Jan, Moreno-Noguer, Francesc, and Fua, Pascal
Subjects: IMAGE registration, PATTERN matching, DIGITAL image processing, PATTERN recognition systems, GAUSSIAN processes, THREE-dimensional imaging, MEDICAL imaging systems, ARTIFICIAL intelligence
Abstract: We present a new approach for matching sets of branching curvilinear structures that form graphs embedded in \mathbb R^2 or \mathbb R^3 and may be subject to deformations. Unlike earlier methods, ours does not rely on local appearance similarity nor does require a good initial alignment. Furthermore, it can cope with non-linear deformations, topological differences, and partial graphs. To handle arbitrary non-linear deformations, we use Gaussian process regressions to represent the geometrical mapping relating the two graphs. In the absence of appearance information, we iteratively establish correspondences between points, update the mapping accordingly, and use it to estimate where to find the most likely correspondences that will be used in the next step. To make the computation tractable for large graphs, the set of new potential matches considered at each iteration is not selected at random as with many RANSAC-based algorithms. Instead, we introduce a so-called Active Testing Search strategy that performs a priority search to favor the most likely matches and speed-up the process. We demonstrate the effectiveness of our approach first on synthetic cases and then on angiography data, retinal fundus images, and microscopy image stacks acquired at very different resolutions. [ABSTRACT FROM PUBLISHER]
Published: 2015
Full Text: View/download PDF

9. Deformable motion 3D reconstruction by union of regularized subspaces

Author: Francese Moreno-Noguer, Antonio Agudo, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Agudo, Antonio, Moreno-Noguer, Francesc, Agudo, Antonio [0000-0001-6845-4998], and Moreno-Noguer, Francesc [0000-0002-8640-684X]
Subjects: Union of Regularized Subspaces, Informàtica::Automàtica i control [Àrees temàtiques de la UPC], Computer science, optimisation, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Non-Rigid Structure from Motion, 010103 numerical & computational mathematics, 02 engineering and technology, 01 natural sciences, Regularization (mathematics), computer vision, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, 0101 mathematics, Cluster analysis, ComputingMethodologies_COMPUTERGRAPHICS, Sparse matrix, business.industry, 3D reconstruction, Order-Varying Regularization, Linear subspace, 020201 artificial intelligence & image processing, Artificial intelligence, business, Pattern recognition::Computer vision [Classificació INSPEC]
Abstract: Trabajo presentado en la 25th IEEE International Conference on Image Processing (ICIP), celebrada en Atenas del 7 al 10 de octubre de 2018, This paper presents an approach to jointly retrieve camera pose, time-varying 3D shape, and automatic clustering based on motion primitives, from incomplete 2D trajectories in a monocular video. We introduce the concept of order-varying temporal regularization in order to exploit video data, that can be indistinctly applied to the 3D shape evolution as well as to the similarities between images. This results in a union of regularized subspaces which effectively encodes the 3D shape deformation. All parameters are learned via augmented Lagrange multipliers, in a unified and unsupervised manner that does not assume any training data at all. Experimental validation is reported on human motion from sparse to dense shapes, providing more robust and accurate solutions than state-of-the-art approaches in terms of 3D reconstruction, while also obtaining motion grouping results.
Published: 2018

10. Image collection pop-up: 3D reconstruction and clustering of rigid and non-rigid categories

Author: Antonio Agudo, Francesc Moreno-Noguer, Melcior Pijoan, Google, Ministerio de Economía y Competitividad (España), Agudo, Antonio [0000-0001-6845-4998], Moreno-Noguer, Francesc [0000-0002-8640-684X], Agudo, Antonio, Moreno-Noguer, Francesc, Institut de Robòtica i Informàtica Industrial, and Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI
Subjects: Informàtica::Automàtica i control [Àrees temàtiques de la UPC], Computer science, optimisation, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, pattern clustering, 010103 numerical & computational mathematics, 02 engineering and technology, Iterative reconstruction, 01 natural sciences, computer vision, Strain, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, 0101 mathematics, Cluster analysis, ComputingMethodologies_COMPUTERGRAPHICS, business.industry, 3D reconstruction, Process (computing), Shape, Two dimensional displays, Object (computer science), Cameras, Linear subspace, Control theory [Classificació INSPEC], Image reconstruction, Three-dimensional displays, 020201 artificial intelligence & image processing, Rigid and Non-Rigid Categories, Deformable models, Artificial intelligence, business, 3D Reconstruction
Abstract: Trabajo presentado en la IEEE/CVF Conference on Computer Vision and Pattern Recognition, celebrada en Salt Lake City (UT, USA), del 18 al 23 de junio de 2018, This paper introduces an approach to simultaneously estimate 3D shape, camera pose, and object and type of deformation clustering, from partial 2D annotations in a multi-instance collection of images. Furthermore, we can indistinctly process rigid and non-rigid categories. This advances existing work, which only addresses the problem for one single object or, if multiple objects are considered, they are assumed to be clustered a priori. To handle this broader version of the problem, we model object deformation using a formulation based on multiple unions of subspaces, able to span from small rigid motion to complex deformations. The parameters of this model are learned via Augmented Lagrange Multipliers, in a completely unsupervised manner that does not require any training data at all. Extensive validation is provided in a wide variety of synthetic and real scenarios, including rigid and non-rigid categories with small and large deformations. In all cases our approach outperforms state-of-the-art in terms of 3D reconstruction accuracy, while also providing clustering results that allow segmenting the images into object instances and their associated type of deformation (or action the object is performing)., This work is supported in part by a Google Faculty Research Award, by the Spanish Ministry of Science and Innovation under projects HuMoUR TIN2017- 90086-R, and Mar´ıa de Maeztu Seal of Excellence MDM2016-0656.
Published: 2018

11. Geometry-aware network for non-rigid shape prediction from a single view

Author: Antonio Agudo, Francesc Moreno-Noguer, Alberto Sanfeliu, Albert Pumarola, Vincent Lepetit, Lorenzo Porzi, Google, Ministerio de Economía y Competitividad (España), European Commission, Agudo, Antonio [0000-0001-6845-4998], Moreno-Noguer, Francesc [0000-0002-8640-684X], Institut de Robòtica i Informàtica Industrial (IRI), Universitat Politècnica de Catalunya [Barcelona] (UPC)-Consejo Superior de Investigaciones Científicas [Spain] (CSIC), Institut de Recherche Interdisciplinaire [Villeneuve d'Ascq] (IRI), Université de Lille, Sciences et Technologies-Université de Lille, Droit et Santé-Centre National de la Recherche Scientifique (CNRS), Laboratoire Bordelais de Recherche en Informatique (LaBRI), Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB)-Université Sciences et Technologies - Bordeaux 1-Université Bordeaux Segalen - Bordeaux 2, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel.ligents, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Consejo Superior de Investigaciones Científicas [Madrid] (CSIC)-Universitat Politècnica de Catalunya [Barcelona] (UPC), Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Droit et Santé-Université de Lille, Sciences et Technologies, Université de Bordeaux (UB)-Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), Lepetit, Vincent, Agudo, Antonio, Moreno-Noguer, Francesc, and Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents
Subjects: [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], FOS: Computer and information sciences, Surface (mathematics), Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Surface finish, Iterative reconstruction, 010501 environmental sciences, Deformation (meteorology), 01 natural sciences, Texture (geology), computer vision, Rendering (computer graphics), Image (mathematics), [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Strain, Informàtica [Àrees temàtiques de la UPC], 0202 electrical engineering, electronic engineering, information engineering, Reconeixement de formes (Informàtica), Surface texture, 0105 earth and related environmental sciences, ComputingMethodologies_COMPUTERGRAPHICS, Infografia tridimensional, business.industry, Contrast (statistics), Shape, Two dimensional displays, Pattern recognition systems, Image reconstruction, Three-dimensional displays, 020201 artificial intelligence & image processing, Artificial intelligence, business, Three-dimensional modeling, Surface reconstruction, Algorithm, 3D Reconstruction [optimisation. Author keywords]
Abstract: Trabajo presentado en la IEEE/CVF Conference on Computer Vision and Pattern Recognition, celebrada en Salt Lake City (UT, USA), del 18 al 23 de junio de 2018, We propose a method for predicting the 3D shape of a deformable surface from a single view. By contrast with previous approaches, we do not need a pre-registered template of the surface, and our method is robust to the lack of texture and partial occlusions. At the core of our approach is a geometry-aware deep architecture that tackles the problem as usually done in analytic solutions: first perform 2D detection of the mesh and then estimate a 3D shape that is geometrically consistent with the image. We train this architecture in an end-to-end manner using a large dataset of synthetic renderings of shapes under different levels of deformation, material properties, textures and lighting conditions. We evaluate our approach on a test split of this dataset and available real benchmarks, consistently improving state-of-the-art solutions with a significantly lower computational time., This work is supported in part by a Google Faculty Research Award, by the Spanish Ministry of Science and Innovation under projects HuMoUR TIN2017- 90086-R, ColRobTransp DPI2016-78957 and Mar´ıa de Maeztu Seal of Excellence MDM-2016-0656; and by the EU project AEROARMS ICT-2014-1-644271. We also thank Nvidia for hardware donation.
Published: 2018

12. Force-based representation for non-rigid Shape and elastic model estimation

Author: Antonio Agudo, Francesc Moreno-Noguer, Ministerio de Economía y Competitividad (España), Google, Agudo, Antonio [0000-0001-6845-4998], Moreno-Noguer, Francesc [0000-0002-8640-684X], Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Agudo, Antonio, and Moreno-Noguer, Francesc
Subjects: Non-rigid structure from motion, 0209 industrial biotechnology, Informàtica::Automàtica i control [Àrees temàtiques de la UPC], Computer science, Trajectory, Elastic model, 02 engineering and technology, Solid modeling, 020901 industrial engineering & automation, Artificial Intelligence, Active shape model, Force space, 0202 electrical engineering, electronic engineering, information engineering, Force, ComputingMethodologies_COMPUTERGRAPHICS, 3D-reconstruction, business.industry, Applied Mathematics, Expectation maximization, 3D reconstruction, Shape, Two dimensional displays, Modelling [Classificació INSPEC], Missing data, Computational Theory and Mathematics, Three-dimensional displays, 020201 artificial intelligence & image processing, Deformable models, Computer Vision and Pattern Recognition, Artificial intelligence, business, 3D Reconstruction, Algorithm, Software
Abstract: This paper addresses the problem of simultaneously recovering 3D shape, pose and the elastic model of a deformable object from only 2D point tracks in a monocular video. This is a severely under-constrained problem that has been typically addressed by enforcing the shape or the point trajectories to lie on low-rank dimensional spaces. We show that formulating the problem in terms of a low-rank force space that induces the deformation and introducing the elastic model as an additional unknown, allows for a better physical interpretation of the resulting priors and a more accurate representation of the actual object's behavior. In order to simultaneously estimate force, pose, and the elastic model of the object we use an expectation maximization strategy, where each of these parameters are successively learned by partial M-steps. Once the elastic model is learned, it can be transfered to similar objects to code its 3D deformation. Moreover, our approach can robustly deal with missing data, and encode both rigid and non-rigid points under the same formalism. We thoroughly validate the approach on Mocap and real sequences, showing more accurate 3D reconstructions than state-of-the-art, and additionally providing an estimate of the full elastic model with no a priori information., This work has been partially supported by the Spanish Ministry of Science and Innovation under project RobInstruct TIN2014-58178-R; and by a Google Faculty Award. This work is also supported by the Spanish State Research Agency through the María de Maeztu Seal of Excellence to IRI MDM-2016-0656.
Published: 2018

13. Deep lidar CNN to understand the dynamics of moving vehicles

Author: Alberto Sanfeliu, Victor Vaquero, Francesc Moreno-Noguer, Ministerio de Economía y Competitividad (España), Moreno-Noguer, Francesc, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel.ligents, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Moreno-Noguer, Francesc [https://orcid.org/0000-0002-8640-684X], and Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents
Subjects: FOS: Computer and information sciences, lidar-flow, Informàtica::Automàtica i control [Àrees temàtiques de la UPC], Computer science, Computer Vision and Pattern Recognition (cs.CV), Feature extraction, Optical flow, Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Machine Learning (stat.ML), Vehicle dynamics, motion features, computer vision, Statistics - Machine Learning, Machine learning, Computer vision, lidar, business.industry, Deep learning, feature extraction, deep learning, optical-flow, Dynamics, Semantics, Laser radar, Lidar, Task analysis, RGB color model, Three-dimensional displays, Artificial intelligence, Automation [Classificació INSPEC], business
Abstract: Trabajo presentado en la IEEE International Conference on Robotics and Automation (ICRA), celebrado en Brisbane (Australia), del 21 al 25 de mayo de 2018, Perception technologies in Autonomous Driving are experiencing their golden age due to the advances in Deep Learning. Yet, most of these systems rely on the semantically rich information of RGB images. Deep Learning solutions applied to the data of other sensors typically mounted on autonomous cars (e.g. lidars or radars) are not explored much. In this paper we propose a novel solution to understand the dynamics of moving vehicles of the scene from only lidar information. The main challenge of this problem stems from the fact that we need to disambiguate the proprio-motion of the “observer” vehicle from that of the external “observed” vehicles. For this purpose, we devise a CNN architecture which at testing time is fed with pairs of consecutive lidar scans. However, in order to properly learn the parameters of this network, during training we introduce a series of so-called pretext tasks which also leverage on image data. These tasks include semantic information about vehicleness and a novel lidar-flow feature which combines standard image-based optical flow with lidar scans. We obtain very promising results and show that including distilled image information only during training, allows improving the inference results of the network at test time, even when image data is no longer used., This work has been supported by the Spanish Ministry of Economy and Competitiveness projects HuMoUR (TIN2017-90086-R) and COLROBTRANSP (DPI2016-78957-R) and the Spanish State Research Agency through the Mar´ıa de Maeztu Seal of Excellence to IRI (MDM-2016-0656). The authors also thank Nvidia for hardware donation under the GPU grant program.
Published: 2018

14. Hallucinating dense optical flow from sparse lidar for autonomous vehicles

Author: Francesc Moreno-Noguer, Alberto Sanfeliu, Victor Vaquero, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel.ligents, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Ministerio de Economía y Competitividad (España), European Commission, Moreno-Noguer, Francesc, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents, and Moreno-Noguer, Francesc [0000-0002-8640-684X]
Subjects: FOS: Computer and information sciences, Informàtica::Automàtica i control [Àrees temàtiques de la UPC], Computer science, Deep Lidar, Computer Vision and Pattern Recognition (cs.CV), Feature extraction, Optical flow, Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Optical imaging, computer vision, Image (mathematics), Autonomous Driving, Image resolution, 11. Sustainability, 0202 electrical engineering, electronic engineering, information engineering, Range (statistics), Training, Computer vision, business.industry, feature extraction, pattern recognition, Laser radar, Lidar, Lidar-flow, Hallucinating, Optical sensors, Pattern recognition (psychology), Three-dimensional displays, 020201 artificial intelligence & image processing, Artificial intelligence, Optical variables measurement, business, Pattern recognition::Computer vision [Classificació INSPEC]
Abstract: Trabajo presentado en la 24th International Conference on Pattern Recognition (ICPR), celebrada en Beijing (China), del 20 al 24 de agosto de 2018, In this paper we propose a novel approach to estimate dense optical flow from sparse lidar data acquired on an autonomous vehicle. This is intended to be used as a drop-in replacement of any image-based optical flow system when images are not reliable due to e.g. adverse weather conditions or at night. In order to infer high resolution 2D flows from discrete range data we devise a three-block architecture of multiscale filters that combines multiple intermediate objectives, both in the lidar and image domain. To train this network we introduce a dataset with approximately 20K lidar samples of the Kitti dataset which we have augmented with a pseudo ground-truth image-based optical flow computed using FlowNet2. We demonstrate the effectiveness of our approach on Kitti, and show that despite using the low-resolution and sparse measurements of the lidar, we can regress dense optical flow maps which are at par with those estimated with image-based methods., This work has been supported by the Spanish Ministry of Economy, Industry and Competitiveness projects COLROBTRANSP (DPI2016-78957-R), HuMoUR (TIN2017-90086-R), the Spanish State Research Agency through the Maria de Maeztu Seal of Excellence (MDM-2016- 0656), and the EU project LOGIMATIC (H2020-Galileo2015-1-687534). We also thank Nvidia for hardware donation under the GPU Grant Program.

15. Unsupervised Person Image Synthesis in Arbitrary Poses

Author: Albert Pumarola, Alberto Sanfeliu, Francesc Moreno-Noguer, Antonio Agudo, Google, Ministerio de Economía y Competitividad (España), European Commission, Agudo, Antonio, Moreno-Noguer, Francesc, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel·ligents, Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI, Agudo, Antonio [0000-0001-6845-4998], Moreno-Noguer, Francesc [0000-0002-8640-684X], and Universitat Politècnica de Catalunya. VIS - Visió Artificial i Sistemes Intel.ligents
Subjects: FOS: Computer and information sciences, Mecànica humana, Computer science, Computer Vision and Pattern Recognition (cs.CV), Conditioned Image Generation, Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, computer vision, Geometria computacional, Image (mathematics), Deep Learning, Informàtica [Àrees temàtiques de la UPC], 0202 electrical engineering, electronic engineering, information engineering, Training, Computer vision, Reconeixement de formes (Informàtica), Pose, Skeleton, 0105 earth and related environmental sciences, Ground truth, business.industry, Deep learning, Data models, Human mechanics, Two dimensional displays, Pattern recognition systems, Gallium nitride, Generators, Hallucinating, Solid modeling, GANs [optimisation. Author keywords], 020201 artificial intelligence & image processing, Artificial intelligence, business, Generator (mathematics)
Abstract: Trabajo presentado en la IEEE/CVF Conference on Computer Vision and Pattern Recognition, celebrada en Salt Lake City (UT, USA), del 18 al 23 de junio de 2018, We present a novel approach for synthesizing photorealistic images of people in arbitrary poses using generative adversarial learning. Given an input image of a person and a desired pose represented by a 2D skeleton, our model renders the image of the same person under the new pose, synthesizing novel views of the parts visible in the input image and hallucinating those that are not seen. This problem has recently been addressed in a supervised manner [16, 35], i.e., during training the ground truth images under the new poses are given to the network. We go beyond these approaches by proposing a fully unsupervised strategy. We tackle this challenging scenario by splitting the problem into two principal subtasks. First, we consider a pose conditioned bidirectional generator that maps back the initially rendered image to the original pose, hence being directly comparable to the input image without the need to resort to any training image. Second, we devise a novel loss function that incorporates content and style terms, and aims at producing images of high perceptual quality. Extensive experiments conducted on the DeepFashion dataset demonstrate that the images rendered by our model are very close in appearance to those obtained by fully supervised approaches., This work is supported in part by a Google Faculty Research Award, by the Spanish Ministry of Science and Innovation under projects HuMoUR TIN2017- 90086-R, ColRobTransp DPI2016-78957 and Mar´ıa de Maeztu Seal of Excellence MDM-2016-0656; and by the EU project AEROARMS ICT-2014-1-644271. We also thank Nvidia for hardware donation under the GPU Grant Program.
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

15 results on '"Moreno-Noguer, Francesc"'

1. Stochastic exploration of ambiguities for nonrigid shape recovery.

2. Active testing search for point cloud matching.

3. Dependent multiple cue integration for robust tracking.

4. Challenge 4: Intelligent robotics

5. Modeling robot's world with minimal effort

6. Learning RGB-D descriptors of garment parts for informed robot grasping

7. Integration of conditionally dependent object features for robust figure-background segmentation

8. Non-Rigid Graph Registration Using Active Testing Search.

9. Deformable motion 3D reconstruction by union of regularized subspaces

10. Image collection pop-up: 3D reconstruction and clustering of rigid and non-rigid categories

11. Geometry-aware network for non-rigid shape prediction from a single view

12. Force-based representation for non-rigid Shape and elastic model estimation

13. Deep lidar CNN to understand the dynamics of moving vehicles

14. Hallucinating dense optical flow from sparse lidar for autonomous vehicles

15. Unsupervised Person Image Synthesis in Arbitrary Poses

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

15 results on '"Moreno-Noguer, Francesc"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources