Author: "Moschoglou, Stylianos" / Publication Year Range: Last 50 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Moschoglou, Stylianos"' showing total 39 results

Start Over Author "Moschoglou, Stylianos" Publication Year Range Last 50 years

39 results on '"Moschoglou, Stylianos"'

1. Improving face generation quality and prompt following with synthetic captions

Author: Tarasiou, Michail, Moschoglou, Stylianos, Deng, Jiankang, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Recent advancements in text-to-image generation using diffusion models have significantly improved the quality of generated images and expanded the ability to depict a wide range of objects. However, ensuring that these models adhere closely to the text prompts remains a considerable challenge. This issue is particularly pronounced when trying to generate photorealistic images of humans. Without significant prompt engineering efforts models often produce unrealistic images and typically fail to incorporate the full extent of the prompt information. This limitation can be largely attributed to the nature of captions accompanying the images used in training large scale diffusion models, which typically prioritize contextual information over details related to the person's appearance. In this paper we address this issue by introducing a training-free pipeline designed to generate accurate appearance descriptions from images of people. We apply this method to create approximately 250,000 captions for publicly available face datasets. We then use these synthetic captions to fine-tune a text-to-image diffusion model. Our results demonstrate that this approach significantly improves the model's ability to generate high-quality, realistic human faces and enhances adherence to the given prompts, compared to the baseline model. We share our synthetic captions, pretrained checkpoints and training code.
Published: 2024

2. AnimateMe: 4D Facial Expressions via Diffusion Models

Author: Gerogiannis, Dimitrios, Papantoniou, Foivos Paraperas, Potamias, Rolandos Alexandros, Lattas, Alexandros, Moschoglou, Stylianos, Ploumpis, Stylianos, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The field of photorealistic 3D avatar reconstruction and generation has garnered significant attention in recent years; however, animating such avatars remains challenging. Recent advances in diffusion models have notably enhanced the capabilities of generative models in 2D animation. In this work, we directly utilize these models within the 3D domain to achieve controllable and high-fidelity 4D facial animation. By integrating the strengths of diffusion processes and geometric deep learning, we employ Graph Neural Networks (GNNs) as denoising diffusion models in a novel approach, formulating the diffusion process directly on the mesh space and enabling the generation of 3D facial expressions. This facilitates the generation of facial deformations through a mesh-diffusion-based model. Additionally, to ensure temporal coherence in our animations, we propose a consistent noise sampling method. Under a series of both quantitative and qualitative experiments, we showcase that the proposed method outperforms prior work in 4D expression synthesis by generating high-fidelity extreme expressions. Furthermore, we applied our method to textured 4D facial expression generation, implementing a straightforward extension that involves training on a large-scale textured 4D facial expression database.
Published: 2024

3. Arc2Face: A Foundation Model for ID-Consistent Human Faces

Author: Papantoniou, Foivos Paraperas, Lattas, Alexandros, Moschoglou, Stylianos, Deng, Jiankang, Kainz, Bernhard, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper presents Arc2Face, an identity-conditioned face foundation model, which, given the ArcFace embedding of a person, can generate diverse photo-realistic images with an unparalleled degree of face similarity than existing models. Despite previous attempts to decode face recognition features into detailed images, we find that common high-resolution datasets (e.g. FFHQ) lack sufficient identities to reconstruct any subject. To that end, we meticulously upsample a significant portion of the WebFace42M database, the largest public dataset for face recognition (FR). Arc2Face builds upon a pretrained Stable Diffusion model, yet adapts it to the task of ID-to-face generation, conditioned solely on ID vectors. Deviating from recent works that combine ID with text embeddings for zero-shot personalization of text-to-image models, we emphasize on the compactness of FR features, which can fully capture the essence of the human face, as opposed to hand-crafted prompts. Crucially, text-augmented models struggle to decouple identity and text, usually necessitating some description of the given face to achieve satisfactory similarity. Arc2Face, however, only needs the discriminative features of ArcFace to guide the generation, offering a robust prior for a plethora of tasks where ID consistency is of paramount importance. As an example, we train a FR model on synthetic images from our model and achieve superior performance to existing synthetic datasets., Comment: ECCV 2024 (Oral), 29 pages, 20 figures. Project page: https://arc2face.github.io/
Published: 2024

4. FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models

Author: Galanakis, Stathis, Lattas, Alexandros, Moschoglou, Stylianos, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The remarkable progress in 3D face reconstruction has resulted in high-detail and photorealistic facial representations. Recently, Diffusion Models have revolutionized the capabilities of generative methods by surpassing the performance of GANs. In this work, we present FitDiff, a diffusion-based 3D facial avatar generative model. Leveraging diffusion principles, our model accurately generates relightable facial avatars, utilizing an identity embedding extracted from an "in-the-wild" 2D facial image. The introduced multi-modal diffusion model is the first to concurrently output facial reflectance maps (diffuse and specular albedo and normals) and shapes, showcasing great generalization capabilities. It is solely trained on an annotated subset of a public facial dataset, paired with 3D reconstructions. We revisit the typical 3D facial fitting approach by guiding a reverse diffusion process using perceptual and face recognition losses. Being the first 3D LDM conditioned on face recognition embeddings, FitDiff reconstructs relightable human avatars, that can be used as-is in common rendering engines, starting only from an unconstrained facial image, and achieving state-of-the-art performance.
Published: 2023

5. FitMe: Deep Photorealistic 3D Morphable Model Avatars

Author: Lattas, Alexandros, Moschoglou, Stylianos, Ploumpis, Stylianos, Gecer, Baris, Deng, Jiankang, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Machine Learning, I.2.10, I.3.7, I.4.1
Abstract: In this paper, we introduce FitMe, a facial reflectance model and a differentiable rendering optimization pipeline, that can be used to acquire high-fidelity renderable human avatars from single or multiple images. The model consists of a multi-modal style-based generator, that captures facial appearance in terms of diffuse and specular reflectance, and a PCA-based shape model. We employ a fast differentiable rendering process that can be used in an optimization pipeline, while also achieving photorealistic facial shading. Our optimization process accurately captures both the facial reflectance and shape in high-detail, by exploiting the expressivity of the style-based latent representation and of our shape model. FitMe achieves state-of-the-art reflectance acquisition and identity preservation on single "in-the-wild" facial images, while it produces impressive scan-like results, when given multiple unconstrained facial images pertaining to the same identity. In contrast with recent implicit avatar reconstructions, FitMe requires only one minute and produces relightable mesh and texture-based avatars, that can be used by end-user applications., Comment: Accepted at CVPR 2023, project page at https://lattas.github.io/fitme , 17 pages including supplementary material
Published: 2023

6. Relightify: Relightable 3D Faces from a Single Image via Diffusion Models

Author: Papantoniou, Foivos Paraperas, Lattas, Alexandros, Moschoglou, Stylianos, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Following the remarkable success of diffusion models on image generation, recent works have also demonstrated their impressive ability to address a number of inverse problems in an unsupervised way, by properly constraining the sampling process based on a conditioning input. Motivated by this, in this paper, we present the first approach to use diffusion models as a prior for highly accurate 3D facial BRDF reconstruction from a single image. We start by leveraging a high-quality UV dataset of facial reflectance (diffuse and specular albedo and normals), which we render under varying illumination settings to simulate natural RGB textures and, then, train an unconditional diffusion model on concatenated pairs of rendered textures and reflectance components. At test time, we fit a 3D morphable model to the given image and unwrap the face in a partial UV texture. By sampling from the diffusion model, while retaining the observed texture part intact, the model inpaints not only the self-occluded areas but also the unknown reflectance components, in a single sequence of denoising steps. In contrast to existing methods, we directly acquire the observed texture from the input image, thus, resulting in more faithful and consistent reflectance estimation. Through a series of qualitative and quantitative comparisons, we demonstrate superior performance in both texture completion as well as reflectance reconstruction tasks., Comment: ICCV 2023, 15 pages, 14 figures. Project page: https://foivospar.github.io/Relightify/
Published: 2023

7. AvatarMe++: Facial Shape and BRDF Inference with Photorealistic Rendering-Aware GANs

Author: Lattas, Alexandros, Moschoglou, Stylianos, Ploumpis, Stylianos, Gecer, Baris, Ghosh, Abhijeet, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, I.4.1, I.3.7, I.2.10
Abstract: Over the last years, many face analysis tasks have accomplished astounding performance, with applications including face generation and 3D face reconstruction from a single "in-the-wild" image. Nevertheless, to the best of our knowledge, there is no method which can produce render-ready high-resolution 3D faces from "in-the-wild" images and this can be attributed to the: (a) scarcity of available data for training, and (b) lack of robust methodologies that can successfully be applied on very high-resolution data. In this work, we introduce the first method that is able to reconstruct photorealistic render-ready 3D facial geometry and BRDF from a single "in-the-wild" image. We capture a large dataset of facial shape and reflectance, which we have made public. We define a fast facial photorealistic differentiable rendering methodology with accurate facial skin diffuse and specular reflection, self-occlusion and subsurface scattering approximation. With this, we train a network that disentangles the facial diffuse and specular BRDF components from a shape and texture with baked illumination, reconstructed with a state-of-the-art 3DMM fitting method. Our method outperforms the existing arts by a significant margin and reconstructs high-resolution 3D faces from a single low-resolution image, that can be rendered in various applications, and bridge the uncanny valley., Comment: Project and Dataset page: ( https://github.com/lattas/AvatarMe ). 20 pages, including supplemental materials. Accepted for publishing at IEEE Transactions on Pattern Analysis and Machine Intelligence on 13 November 2021. Copyright 2021 IEEE. Personal use of this material is permitted
Published: 2021
Full Text: View/download PDF

8. 3D human tongue reconstruction from single 'in-the-wild' images

Author: Ploumpis, Stylianos, Moschoglou, Stylianos, Triantafyllou, Vasileios, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Graphics
Abstract: 3D face reconstruction from a single image is a task that has garnered increased interest in the Computer Vision community, especially due to its broad use in a number of applications such as realistic 3D avatar creation, pose invariant face recognition and face hallucination. Since the introduction of the 3D Morphable Model in the late 90's, we witnessed an explosion of research aiming at particularly tackling this task. Nevertheless, despite the increasing level of detail in the 3D face reconstructions from single images mainly attributed to deep learning advances, finer and highly deformable components of the face such as the tongue are still absent from all 3D face models in the literature, although being very important for the realness of the 3D avatar representations. In this work we present the first, to the best of our knowledge, end-to-end trainable pipeline that accurately reconstructs the 3D face together with the tongue. Moreover, we make this pipeline robust in "in-the-wild" images by introducing a novel GAN method tailored for 3D tongue surface generation. Finally, we make publicly available to the community the first diverse tongue dataset, consisting of 1,800 raw scans of 700 individuals varying in gender, age, and ethnicity backgrounds. As we demonstrate in an extensive series of quantitative as well as qualitative experiments, our model proves to be robust and realistically captures the 3D tongue structure, even in adverse "in-the-wild" conditions., Comment: 10 pages, 9 figures
Published: 2021

9. Deep Polynomial Neural Networks

Author: Chrysos, Grigorios, Moschoglou, Stylianos, Bouritsas, Giorgos, Deng, Jiankang, Panagakis, Yannis, and Zafeiriou, Stefanos
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Deep Convolutional Neural Networks (DCNNs) are currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $\Pi$-Nets, a new class of function approximators based on polynomial expansions. $\Pi$-Nets are polynomial neural networks, i.e., the output is a high-order polynomial of the input. The unknown parameters, which are naturally represented by high-order tensors, are estimated through a collective tensor factorization with factors sharing. We introduce three tensor decompositions that significantly reduce the number of parameters and show how they can be efficiently implemented by hierarchical neural networks. We empirically demonstrate that $\Pi$-Nets are very expressive and they even produce good results without the use of non-linear activation functions in a large battery of tasks and signals, i.e., images, graphs, and audio. When used in conjunction with activation functions, $\Pi$-Nets produce state-of-the-art results in three challenging tasks, i.e. image generation, face verification and 3D mesh representation learning. The source code is available at \url{https://github.com/grigorisg9gr/polynomial_nets}., Comment: Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI). Code: https://github.com/grigorisg9gr/polynomial_nets. arXiv admin note: substantial text overlap with arXiv:2003.03828
Published: 2020
Full Text: View/download PDF

10. AvatarMe: Realistically Renderable 3D Facial Reconstruction 'in-the-wild'

Author: Lattas, Alexandros, Moschoglou, Stylianos, Gecer, Baris, Ploumpis, Stylianos, Triantafyllou, Vasileios, Ghosh, Abhijeet, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, I.2.10, I.3.7, I.4.1
Abstract: Over the last years, with the advent of Generative Adversarial Networks (GANs), many face analysis tasks have accomplished astounding performance, with applications including, but not limited to, face generation and 3D face reconstruction from a single "in-the-wild" image. Nevertheless, to the best of our knowledge, there is no method which can produce high-resolution photorealistic 3D faces from "in-the-wild" images and this can be attributed to the: (a) scarcity of available data for training, and (b) lack of robust methodologies that can successfully be applied on very high-resolution data. In this paper, we introduce AvatarMe, the first method that is able to reconstruct photorealistic 3D faces from a single "in-the-wild" image with an increasing level of detail. To achieve this, we capture a large dataset of facial shape and reflectance and build on a state-of-the-art 3D texture and shape reconstruction method and successively refine its results, while generating the per-pixel diffuse and specular components that are required for realistic rendering. As we demonstrate in a series of qualitative and quantitative experiments, AvatarMe outperforms the existing arts by a significant margin and reconstructs authentic, 4K by 6K-resolution 3D faces from a single low-resolution image that, for the first time, bridges the uncanny valley., Comment: Accepted to CVPR2020. Project page: github.com/lattas/AvatarMe with high resolution results, data and more. 10 pages, 9 figures
Published: 2020

11. $\Pi-$nets: Deep Polynomial Neural Networks

Author: Chrysos, Grigorios G., Moschoglou, Stylianos, Bouritsas, Giorgos, Panagakis, Yannis, Deng, Jiankang, and Zafeiriou, Stefanos
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Deep Convolutional Neural Networks (DCNNs) is currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $\Pi$-Nets, a new class of DCNNs. $\Pi$-Nets are polynomial neural networks, i.e., the output is a high-order polynomial of the input. $\Pi$-Nets can be implemented using special kind of skip connections and their parameters can be represented via high-order tensors. We empirically demonstrate that $\Pi$-Nets have better representation power than standard DCNNs and they even produce good results without the use of non-linear activation functions in a large battery of tasks and signals, i.e., images, graphs, and audio. When used in conjunction with activation functions, $\Pi$-Nets produce state-of-the-art results in challenging tasks, such as image generation. Lastly, our framework elucidates why recent generative models, such as StyleGAN, improve upon their predecessors, e.g., ProGAN., Comment: Accepted in CVPR 2020
Published: 2020

12. Advances in generative modelling : from component analysis to generative adversarial networks

Author: Moschoglou, Stylianos and Zafeiriou, Stefanos
Abstract: This Thesis revolves around datasets and algorithms, with a focus on generative modelling. In particular, we first turn our attention to a novel, multi-attribute, 2D facial dataset. We then present deterministic as well as probabilistic Component Analysis (CA) techniques which can be applied to multi-attribute 2D as well as 3D data. We finally present deep learning generative approaches specially designed to manipulate 3D facial data. Most 2D facial datasets that are available in the literature, are: a) automatically or semi-automatically collected and thus contain noisy labels, hindering the benchmarking and comparisons between algorithms. Moreover, they are not annotated for multiple attributes. In the first part of the Thesis, we present the first manually collected and annotated database, which contains labels for multiple attributes. As we demonstrate in a series of experiments, it can be used in a number of applications ranging from image translation to age-invariant face recognition. Moving on, we turn our attention to CA methodologies. CA approaches, although being able to only capture linear relationships between data, can still be proven to be efficient in data such as UV maps or 3D data registered in a common template, since they are well aligned. The introduction of more complex datasets in the literature, which contain labels for multiple attributes, naturally brought the need for novel algorithms that can simultaneously handle multiple attributes. In this Thesis, we cover novel CA approaches which are specifically designed to be utilised in datasets annotated with respect to multiple attributes and can be used in a variety of tasks, such as 2D image denoising and translation, as well as 3D data generation and identification. Nevertheless, while CA methods are indeed efficient when handling registered 3D facial data, linear 3D generative models lack details when it comes to reconstructing or generating finer facial characteristics. To alleviate this, in the final part of this Thesis we propose a novel generative framework harnessing the power of Generative Adversarial Networks.
Published: 2021
Full Text: View/download PDF

13. Towards a complete 3D morphable model of the human head

Author: Ploumpis, Stylianos, Ververas, Evangelos, Sullivan, Eimear O', Moschoglou, Stylianos, Wang, Haoyang, Pears, Nick, Smith, William A. P., Gecer, Baris, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Three-dimensional Morphable Models (3DMMs) are powerful statistical tools for representing the 3D shapes and textures of an object class. Here we present the most complete 3DMM of the human head to date that includes face, cranium, ears, eyes, teeth and tongue. To achieve this, we propose two methods for combining existing 3DMMs of different overlapping head parts: i. use a regressor to complete missing parts of one model using the other, ii. use the Gaussian Process framework to blend covariance matrices from multiple models. Thus we build a new combined face-and-head shape model that blends the variability and facial detail of an existing face model (the LSFM) with the full head modelling capability of an existing head model (the LYHM). Then we construct and fuse a highly-detailed ear model to extend the variation of the ear shape. Eye and eye region models are incorporated into the head model, along with basic models of the teeth, tongue and inner mouth cavity. The new model achieves state-of-the-art performance. We use our model to reconstruct full head representations from single, unconstrained images allowing us to parameterize craniofacial shape and texture, along with the ear shape, eye gaze and eye color., Comment: 18 pages, 18 figures, submitted to Transactions on Pattern Analysis and Machine Intelligence (TPAMI) on the 9th of October as an extension paper of the original oral CVPR paper : arXiv:1903.03785
Published: 2019

14. Synthesizing Coupled 3D Face Modalities by Trunk-Branch Generative Adversarial Networks

Author: Gecer, Baris, Lattas, Alexander, Ploumpis, Stylianos, Deng, Jiankang, Papaioannou, Athanasios, Moschoglou, Stylianos, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: Generating realistic 3D faces is of high importance for computer graphics and computer vision applications. Generally, research on 3D face generation revolves around linear statistical models of the facial surface. Nevertheless, these models cannot represent faithfully either the facial texture or the normals of the face, which are very crucial for photo-realistic face synthesis. Recently, it was demonstrated that Generative Adversarial Networks (GANs) can be used for generating high-quality textures of faces. Nevertheless, the generation process either omits the geometry and normals, or independent processes are used to produce 3D shape information. In this paper, we present the first methodology that generates high-quality texture, shape, and normals jointly, which can be used for photo-realistic synthesis. To do so, we propose a novel GAN that can generate data from different modalities while exploiting their correlations. Furthermore, we demonstrate how we can condition the generation on the expression and create faces with various facial expressions. The qualitative results shown in this paper are compressed due to size limitations, full-resolution results and the accompanying video can be found in the supplementary documents. The code and models are available at the project page: https://github.com/barisgecer/TBGAN., Comment: Check project page: https://github.com/barisgecer/TBGAN for the full resolution results and the accompanying video
Published: 2019
Full Text: View/download PDF

15. PolyGAN: High-Order Polynomial Generators

Author: Chrysos, Grigorios, Moschoglou, Stylianos, Panagakis, Yannis, and Zafeiriou, Stefanos
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Generative Adversarial Networks (GANs) have become the gold standard when it comes to learning generative models for high-dimensional distributions. Since their advent, numerous variations of GANs have been introduced in the literature, primarily focusing on utilization of novel loss functions, optimization/regularization strategies and network architectures. In this paper, we turn our attention to the generator and investigate the use of high-order polynomials as an alternative class of universal function approximators. Concretely, we propose PolyGAN, where we model the data generator by means of a high-order polynomial whose unknown parameters are naturally represented by high-order tensors. We introduce two tensor decompositions that significantly reduce the number of parameters and show how they can be efficiently implemented by hierarchical neural networks that only employ linear/convolutional blocks. We exhibit for the first time that by using our approach a GAN generator can approximate the data distribution without using any activation functions. Thorough experimental evaluation on both synthetic and real data (images and 3D point clouds) demonstrates the merits of PolyGAN against the state of the art.
Published: 2019

16. 3DFaceGAN: Adversarial Nets for 3D Face Representation, Generation, and Translation

Author: Moschoglou, Stylianos, Ploumpis, Stylianos, Nicolaou, Mihalis, Papaioannou, Athanasios, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Over the past few years, Generative Adversarial Networks (GANs) have garnered increased interest among researchers in Computer Vision, with applications including, but not limited to, image generation, translation, imputation, and super-resolution. Nevertheless, no GAN-based method has been proposed in the literature that can successfully represent, generate or translate 3D facial shapes (meshes). This can be primarily attributed to two facts, namely that (a) publicly available 3D face databases are scarce as well as limited in terms of sample size and variability (e.g., few subjects, little diversity in race and gender), and (b) mesh convolutions for deep networks present several challenges that are not entirely tackled in the literature, leading to operator approximations and model instability, often failing to preserve high-frequency components of the distribution. As a result, linear methods such as Principal Component Analysis (PCA) have been mainly utilized towards 3D shape analysis, despite being unable to capture non-linearities and high frequency details of the 3D face - such as eyelid and lip variations. In this work, we present 3DFaceGAN, the first GAN tailored towards modeling the distribution of 3D facial surfaces, while retaining the high frequency details of 3D face shapes. We conduct an extensive series of both qualitative and quantitative experiments, where the merits of 3DFaceGAN are clearly demonstrated against other, state-of-the-art methods in tasks such as 3D shape representation, generation, and translation., Comment: 15 pages, 12 figures. Submitted to International Journal of Computer Vision (IJCV), special issue: Generative Adversarial Networks for Computer Vision
Published: 2019

17. MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis

Author: Papaioannou, Athanasios, Gecer, Baris, Cheng, Shiyang, Chrysos, Grigorios, Deng, Jiankang, Fotiadou, Eftychia, Kampouris, Christos, Kollias, Dimitrios, Moschoglou, Stylianos, Songsri-In, Kritaphat, Ploumpis, Stylianos, Trigeorgis, George, Tzirakis, Panagiotis, Ververas, Evangelos, Zhou, Yuxiang, Ponniah, Allan, Roussos, Anastasios, Zafeiriou, Stefanos, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Avidan, Shai, editor, Brostow, Gabriel, editor, Cissé, Moustapha, editor, Farinella, Giovanni Maria, editor, and Hassner, Tal, editor
Published: 2022
Full Text: View/download PDF

18. Multi-Attribute Robust Component Analysis for Facial UV Maps

Author: Moschoglou, Stylianos, Ververas, Evangelos, Panagakis, Yannis, Nicolaou, Mihalis, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, due to the collection of large scale 3D face models, as well as the advent of deep learning, a significant progress has been made in the field of 3D face alignment "in-the-wild". That is, many methods have been proposed that establish sparse or dense 3D correspondences between a 2D facial image and a 3D face model. The utilization of 3D face alignment introduces new challenges and research directions, especially on the analysis of facial texture images. In particular, texture does not suffer any more from warping effects (that occurred when 2D face alignment methods were used). Nevertheless, since facial images are commonly captured in arbitrary recording conditions, a considerable amount of missing information and gross outliers is observed (e.g., due to self-occlusion, or subjects wearing eye-glasses). Given that many annotated databases have been developed for face analysis tasks, it is evident that component analysis techniques need to be developed in order to alleviate issues arising from the aforementioned challenges. In this paper, we propose a novel component analysis technique that is suitable for facial UV maps containing a considerable amount of missing information and outliers, while additionally, incorporates knowledge from various attributes (such as age and identity). We evaluate the proposed Multi-Attribute Robust Component Analysis (MA-RCA) on problems such as UV completion and age progression, where the proposed method outperforms compared techniques. Finally, we demonstrate that MA-RCA method is powerful enough to provide weak annotations for training deep learning systems for various applications, such as illumination transfer.
Published: 2017
Full Text: View/download PDF

19. Synthesizing Coupled 3D Face Modalities by Trunk-Branch Generative Adversarial Networks

Author: Gecer, Baris, Lattas, Alexandros, Ploumpis, Stylianos, Deng, Jiankang, Papaioannou, Athanasios, Moschoglou, Stylianos, Zafeiriou, Stefanos, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Vedaldi, Andrea, editor, Bischof, Horst, editor, Brox, Thomas, editor, and Frahm, Jan-Michael, editor
Published: 2020
Full Text: View/download PDF

20. Arc2Face: A Foundation Model of Human Faces

Author: Papantoniou, Foivos Paraperas, Lattas, Alexandros, Moschoglou, Stylianos, Deng, Jiankang, Kainz, Bernhard, Zafeiriou, Stefanos, Papantoniou, Foivos Paraperas, Lattas, Alexandros, Moschoglou, Stylianos, Deng, Jiankang, Kainz, Bernhard, and Zafeiriou, Stefanos
Abstract: This paper presents Arc2Face, an identity-conditioned face foundation model, which, given the ArcFace embedding of a person, can generate diverse photo-realistic images with an unparalleled degree of face similarity than existing models. Despite previous attempts to decode face recognition features into detailed images, we find that common high-resolution datasets (e.g. FFHQ) lack sufficient identities to reconstruct any subject. To that end, we meticulously upsample a significant portion of the WebFace42M database, the largest public dataset for face recognition (FR). Arc2Face builds upon a pretrained Stable Diffusion model, yet adapts it to the task of ID-to-face generation, conditioned solely on ID vectors. Deviating from recent works that combine ID with text embeddings for zero-shot personalization of text-to-image models, we emphasize on the compactness of FR features, which can fully capture the essence of the human face, as opposed to hand-crafted prompts. Crucially, text-augmented models struggle to decouple identity and text, usually necessitating some description of the given face to achieve satisfactory similarity. Arc2Face, however, only needs the discriminative features of ArcFace to guide the generation, offering a robust prior for a plethora of tasks where ID consistency is of paramount importance. As an example, we train a FR model on synthetic images from our model and achieve superior performance to existing synthetic datasets., Comment: 29 pages, 20 figures. Project page: https://arc2face.github.io
Published: 2024

21. Multi-Attribute Probabilistic Linear Discriminant Analysis for 3D Facial Shapes

Author: Moschoglou, Stylianos, Ploumpis, Stylianos, Nicolaou, Mihalis A., Zafeiriou, Stefanos, Hutchison, David, Editorial Board Member, Kanade, Takeo, Editorial Board Member, Kittler, Josef, Editorial Board Member, Kleinberg, Jon M., Editorial Board Member, Mattern, Friedemann, Editorial Board Member, Mitchell, John C., Editorial Board Member, Naor, Moni, Editorial Board Member, Pandu Rangan, C., Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Terzopoulos, Demetri, Editorial Board Member, Tygar, Doug, Editorial Board Member, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Jawahar, C. V., editor, Li, Hongdong, editor, Mori, Greg, editor, and Schindler, Konrad, editor
Published: 2019
Full Text: View/download PDF

22. 3DFaceGAN: Adversarial Nets for 3D Face Representation, Generation, and Translation

Author: Moschoglou, Stylianos, Ploumpis, Stylianos, Nicolaou, Mihalis A., Papaioannou, Athanasios, and Zafeiriou, Stefanos
Published: 2020
Full Text: View/download PDF

23. Synthesizing Coupled 3D Face Modalities by Trunk-Branch Generative Adversarial Networks

Author: Gecer, Baris, primary, Lattas, Alexandros, additional, Ploumpis, Stylianos, additional, Deng, Jiankang, additional, Papaioannou, Athanasios, additional, Moschoglou, Stylianos, additional, and Zafeiriou, Stefanos, additional
Published: 2020
Full Text: View/download PDF

24. Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model

Author: Potamias, Rolandos Alexandros, primary, Ploumpis, Stylianos, additional, Moschoglou, Stylianos, additional, Triantafyllou, Vasileios, additional, and Zafeiriou, Stefanos, additional
Published: 2023
Full Text: View/download PDF

25. FitMe: Deep Photorealistic 3D Morphable Model Avatars

Author: Lattas, Alexandros, primary, Moschoglou, Stylianos, additional, Ploumpis, Stylianos, additional, Gecer, Baris, additional, Deng, Jiankang, additional, and Zafeiriou, Stefanos, additional
Published: 2023
Full Text: View/download PDF

26. Multi-Attribute Probabilistic Linear Discriminant Analysis for 3D Facial Shapes

Author: Moschoglou, Stylianos, primary, Ploumpis, Stylianos, additional, Nicolaou, Mihalis A., additional, and Zafeiriou, Stefanos, additional
Published: 2019
Full Text: View/download PDF

27. AvatarMe++: Facial Shape and BRDF Inference With Photorealistic Rendering-Aware GANs

Author: Lattas, Alexandros, primary, Moschoglou, Stylianos, additional, Ploumpis, Stylianos, additional, Gecer, Baris, additional, Ghosh, Abhijeet, additional, and Zafeiriou, Stefanos, additional
Published: 2022
Full Text: View/download PDF

28. 3D human tongue reconstruction from single 'in-the-wild' images

Author: Ploumpis, Stylianos, Moschoglou, Stylianos, Triantafyllou, Vasileios, and Zafeiriou, Stefanos
Subjects: FOS: Computer and information sciences, Computer Science - Graphics, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Graphics (cs.GR)
Abstract: 3D face reconstruction from a single image is a task that has garnered increased interest in the Computer Vision community, especially due to its broad use in a number of applications such as realistic 3D avatar creation, pose invariant face recognition and face hallucination. Since the introduction of the 3D Morphable Model in the late 90's, we witnessed an explosion of research aiming at particularly tackling this task. Nevertheless, despite the increasing level of detail in the 3D face reconstructions from single images mainly attributed to deep learning advances, finer and highly deformable components of the face such as the tongue are still absent from all 3D face models in the literature, although being very important for the realness of the 3D avatar representations. In this work we present the first, to the best of our knowledge, end-to-end trainable pipeline that accurately reconstructs the 3D face together with the tongue. Moreover, we make this pipeline robust in "in-the-wild" images by introducing a novel GAN method tailored for 3D tongue surface generation. Finally, we make publicly available to the community the first diverse tongue dataset, consisting of 1,800 raw scans of 700 individuals varying in gender, age, and ethnicity backgrounds. As we demonstrate in an extensive series of quantitative as well as qualitative experiments, our model proves to be robust and realistically captures the 3D tongue structure, even in adverse "in-the-wild" conditions., Comment: 10 pages, 9 figures
Published: 2022

29. AvatarMe ++ : Facial Shape and BRDF Inference With Photorealistic Rendering-Aware GANs.

Author: Lattas, Alexandros, Moschoglou, Stylianos, Ploumpis, Stylianos, Gecer, Baris, Ghosh, Abhijeet, and Zafeiriou, Stefanos
Subjects: *GENERATIVE adversarial networks, *FACE, *TASK analysis
Abstract: Over the last years, with the advent of Generative Adversarial Networks (GANs), many face analysis tasks have accomplished astounding performance, with applications including, but not limited to, face generation and 3D face reconstruction from a single “in-the-wild” image. Nevertheless, to the best of our knowledge, there is no method which can produce render-ready high-resolution 3D faces from “in-the-wild” images and this can be attributed to the: (a) scarcity of available data for training, and (b) lack of robust methodologies that can successfully be applied on very high-resolution data. In this paper, we introduce the first method that is able to reconstruct photorealistic render-ready 3D facial geometry and BRDF from a single “in-the-wild” image. To achieve this, we capture a large dataset of facial shape and reflectance, which we have made public. Moreover, we define a fast and photorealistic differentiable rendering methodology with accurate facial skin diffuse and specular reflection, self-occlusion and subsurface scattering approximation. With this, we train a network that disentangles the facial diffuse and specular reflectance components from a mesh and texture with baked illumination, scanned or reconstructed with a 3DMM fitting method. As we demonstrate in a series of qualitative and quantitative experiments, our method outperforms the existing arts by a significant margin and reconstructs authentic, 4K by 6K-resolution 3D faces from a single low-resolution image, that are ready to be rendered in various applications and bridge the uncanny valley. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

30. Towards a Complete 3D Morphable Model of the Human Head

Author: Ploumpis, Stylianos, primary, Ververas, Evangelos, additional, Sullivan, Eimear Oa, additional, Moschoglou, Stylianos, additional, Wang, Haoyang, additional, Pears, Nick, additional, Smith, William A. P., additional, Gecer, Baris, additional, and Zafeiriou, Stefanos, additional
Published: 2021
Full Text: View/download PDF

31. Deep Polynomial Neural Networks.

Author: Chrysos, Grigorios G., Moschoglou, Stylianos, Bouritsas, Giorgos, Deng, Jiankang, Panagakis, Yannis, and Zafeiriou, Stefanos
Subjects: *ARTIFICIAL neural networks, *CONVOLUTIONAL neural networks, *PETRI nets, *COMPUTER vision, *DEEP learning, *HOPFIELD networks, *NONLINEAR functions, *MACHINE learning
Abstract: Deep convolutional neural networks (DCNNs) are currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $\Pi$ Π -Nets, a new class of function approximators based on polynomial expansions. $\Pi$ Π -Nets are polynomial neural networks, i.e., the output is a high-order polynomial of the input. The unknown parameters, which are naturally represented by high-order tensors, are estimated through a collective tensor factorization with factors sharing. We introduce three tensor decompositions that significantly reduce the number of parameters and show how they can be efficiently implemented by hierarchical neural networks. We empirically demonstrate that $\Pi$ Π -Nets are very expressive and they even produce good results without the use of non-linear activation functions in a large battery of tasks and signals, i.e., images, graphs, and audio. When used in conjunction with activation functions, $\Pi$ Π -Nets produce state-of-the-art results in three challenging tasks, i.e., image generation, face verification and 3D mesh representation learning. The source code is available at https://github.com/grigorisg9gr/polynomial_nets. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

32. Advances in generative modelling: from component analysis to generative adversarial networks

Author: Moschoglou, Stylianos, Zafeiriou, Stefanos, and Imperial College of London
Abstract: This Thesis revolves around datasets and algorithms, with a focus on generative modelling. In particular, we first turn our attention to a novel, multi-attribute, 2D facial dataset. We then present deterministic as well as probabilistic Component Analysis (CA) techniques which can be applied to multi-attribute 2D as well as 3D data. We finally present deep learning generative approaches specially designed to manipulate 3D facial data. Most 2D facial datasets that are available in the literature, are: a) automatically or semi-automatically collected and thus contain noisy labels, hindering the benchmarking and comparisons between algorithms. Moreover, they are not annotated for multiple attributes. In the first part of the Thesis, we present the first manually collected and annotated database, which contains labels for multiple attributes. As we demonstrate in a series of experiments, it can be used in a number of applications ranging from image translation to age-invariant face recognition. Moving on, we turn our attention to CA methodologies. CA approaches, although being able to only capture linear relationships between data, can still be proven to be efficient in data such as UV maps or 3D data registered in a common template, since they are well aligned. The introduction of more complex datasets in the literature, which contain labels for multiple attributes, naturally brought the need for novel algorithms that can simultaneously handle multiple attributes. In this Thesis, we cover novel CA approaches which are specifically designed to be utilised in datasets annotated with respect to multiple attributes and can be used in a variety of tasks, such as 2D image denoising and translation, as well as 3D data generation and identification. Nevertheless, while CA methods are indeed efficient when handling registered 3D facial data, linear 3D generative models lack details when it comes to reconstructing or generating finer facial characteristics. To alleviate this, in the final part of this Thesis we propose a novel generative framework harnessing the power of Generative Adversarial Networks. Open Access
Published: 2020

33. Pi-nets: Deep Polynomial Neural Networks

Author: Chrysos, Grigorios G. Moschoglou, Stylianos Bouritsas, Giorgos and Panagakis, Yannis Deng, Jiankang Zafeiriou, Stefanos
Abstract: Deep Convolutional Neural Networks (DCNNs) is currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose Pi-Nets, a new class of DCNNs. Pi-Nets are polynomial neural networks, i.e., the output is a high-order polynomial of the input. Pi-Nets can be implemented using special kind of skip connections and their parameters can be represented via high-order tensors. We empirically demonstrate that Pi-Nets have better representation power than standard DCNNs and they even produce good results without the use of non-linear activation functions in a large battery of tasks and signals, i.e., images, graphs, and audio. When used in conjunction with activation functions, Pi-Nets produce state-of-the-art results in challenging tasks, such as image generation. Lastly, our framework elucidates why recent generative models, such as StyleGAN, improve upon their predecessors, e.g., ProGAN.
Published: 2020

34. Deep Polynomial Neural Networks

Author: Chrysos, Grigorios G., primary, Moschoglou, Stylianos, additional, Bouritsas, Giorgos, additional, Deng, Jiankang, additional, Panagakis, Yannis, additional, and Zafeiriou, Stefanos P, additional
Published: 2021
Full Text: View/download PDF

35. P–nets: Deep Polynomial Neural Networks

Author: Chrysos, Grigorios G., primary, Moschoglou, Stylianos, additional, Bouritsas, Giorgos, additional, Panagakis, Yannis, additional, Deng, Jiankang, additional, and Zafeiriou, Stefanos, additional
Published: 2020
Full Text: View/download PDF

36. AvatarMe: Realistically Renderable 3D Facial Reconstruction “In-the-Wild”

Author: Lattas, Alexandros, primary, Moschoglou, Stylianos, additional, Gecer, Baris, additional, Ploumpis, Stylianos, additional, Triantafyllou, Vasileios, additional, Ghosh, Abhijeet, additional, and Zafeiriou, Stefanos, additional
Published: 2020
Full Text: View/download PDF

37. Multi-Attribute Robust Component Analysis for Facial UV Maps

Author: Moschoglou, Stylianos, primary, Ververas, Evangelos, additional, Panagakis, Yannis, additional, Nicolaou, Mihalis A., additional, and Zafeiriou, Stefanos, additional
Published: 2018
Full Text: View/download PDF

38. Initializing probabilistic linear discriminant analysis

Author: Moschoglou, Stylianos, primary, Nicolaou, Mihalis, additional, Panagakis, Yannis, additional, and Zafeiriou, Stefanos, additional
Published: 2017
Full Text: View/download PDF

39. AgeDB: The First Manually Collected, In-the-Wild Age Database

Author: Moschoglou, Stylianos, primary, Papaioannou, Athanasios, additional, Sagonas, Christos, additional, Deng, Jiankang, additional, Kotsia, Irene, additional, and Zafeiriou, Stefanos, additional
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

39 results on '"Moschoglou, Stylianos"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources