Author: "Benes, Bedrich" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Benes, Bedrich"' showing total 558 results

Start Over Author "Benes, Bedrich"

558 results on '"Benes, Bedrich"'

1. RGB2Point: 3D Point Cloud Generation from Single RGB Images

Author: Lee, Jae Joong and Benes, Bedrich
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We introduce RGB2Point, an unposed single-view RGB image to a 3D point cloud generation based on Transformer. RGB2Point takes an input image of an object and generates a dense 3D point cloud. Contrary to prior works based on CNN layers and diffusion denoising approaches, we use pre-trained Transformer layers that are fast and generate high-quality point clouds with consistent quality over available categories. Our generated point clouds demonstrate high quality on a real-world dataset, as evidenced by improved Chamfer distance (51.15%) and Earth Mover's distance (45.96%) metrics compared to the current state-of-the-art. Additionally, our approach shows a better quality on a synthetic dataset, achieving better Chamfer distance (39.26%), Earth Mover's distance (26.95%), and F-score (47.16%). Moreover, our method produces 63.1% more consistent high-quality results across various object categories compared to prior works. Furthermore, RGB2Point is computationally efficient, requiring only 2.3GB of VRAM to reconstruct a 3D point cloud from a single RGB image, and our implementation generates the results 15,133x faster than a SOTA diffusion-based model., Comment: Accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025
Published: 2024

2. Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Author: Lee, Jae Joong, Li, Bosheng, Beery, Sara, Huang, Jonathan, Fei, Songlin, Yeh, Raymond A., and Benes, Bedrich
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We introduce Tree D-fusion, featuring the first collection of 600,000 environmentally aware, 3D simulation-ready tree models generated through Diffusion priors. Each reconstructed 3D tree model corresponds to an image from Google's Auto Arborist Dataset, comprising street view images and associated genus labels of trees across North America. Our method distills the scores of two tree-adapted diffusion models by utilizing text prompts to specify a tree genus, thus facilitating shape reconstruction. This process involves reconstructing a 3D tree envelope filled with point markers, which are subsequently utilized to estimate the tree's branching structure using the space colonization algorithm conditioned on a specified genus., Comment: Accepted to ECCV24
Published: 2024

3. Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Author: Lee, Jae Joong, Li, Bosheng, Beery, Sara, Huang, Jonathan, Fei, Songlin, Yeh, Raymond A., Benes, Bedrich, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

4. LAESI: Leaf Area Estimation with Synthetic Imagery

Author: Kałużny, Jacek, Schreckenberg, Yannik, Cyganik, Karol, Annighöfer, Peter, Pirk, Sören, Michels, Dominik L., Cieslak, Mikolaj, Assaad-Gerbert, Farhah, Benes, Bedrich, and Pałubicki, Wojciech
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Graphics, Computer Science - Machine Learning, 68T07, 68T45, I.2.10, I.4.6
Abstract: We introduce LAESI, a Synthetic Leaf Dataset of 100,000 synthetic leaf images on millimeter paper, each with semantic masks and surface area labels. This dataset provides a resource for leaf morphology analysis primarily aimed at beech and oak leaves. We evaluate the applicability of the dataset by training machine learning models for leaf surface area prediction and semantic segmentation, using real images for validation. Our validation shows that these models can be trained to predict leaf surface area with a relative error not greater than an average human annotator. LAESI also provides an efficient framework based on 3D procedural models and generative AI for the large-scale, controllable generation of data with potential further applications in agriculture and biology. We evaluate the inclusion of generative AI in our procedural data generation pipeline and show how data filtering based on annotation consistency results in datasets which allow training the highest performing vision models., Comment: 10 pages, 12 figures, 1 table
Published: 2024

5. Hands-Free VR

Author: Fernandez, Jorge Askur Vazquez, Lee, Jae Joong, Vacca, Santiago Andrés Serrano, Magana, Alejandra, Pesam, Radim, Benes, Bedrich, and Popescu, Voicu
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: The paper introduces Hands-Free VR, a voice-based natural-language interface for VR. The user gives a command using their voice, the speech audio data is converted to text using a speech-to-text deep learning model that is fine-tuned for robustness to word phonetic similarity and to spoken English accents, and the text is mapped to an executable VR command using a large language model that is robust to natural language diversity. Hands-Free VR was evaluated in a controlled within-subjects study (N = 22) that asked participants to find specific objects and to place them in various configurations. In the control condition participants used a conventional VR user interface to grab, carry, and position the objects using the handheld controllers. In the experimental condition participants used Hands-Free VR. The results confirm that: (1) Hands-Free VR is robust to spoken English accents, as for 20 of our participants English was not their first language, and to word phonetic similarity, correctly transcribing the voice command 96.71% of the time; (2) Hands-Free VR is robust to natural language diversity, correctly mapping the transcribed command to an executable command in 97.83% of the time; (3) Hands-Free VR had a significant efficiency advantage over the conventional VR interface in terms of task completion time, total viewpoint translation, total view direction rotation, and total left and right hand translations; (4) Hands-Free VR received high user preference ratings in terms of ease of use, intuitiveness, ergonomics, reliability, and desirability., Comment: The first two authors contributed equally. Accepted VISIGRAPP@HUCAPP 2025
Published: 2024

6. DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

Author: Ling, Lu, Sheng, Yichen, Tu, Zhi, Zhao, Wentian, Xin, Cheng, Wan, Kun, Yu, Lantao, Guo, Qianyu, Yu, Zixun, Lu, Yawen, Li, Xuanmao, Sun, Xingpeng, Ashok, Rohan, Mukherjee, Aniruddha, Kang, Hao, Kong, Xiangrui, Hua, Gang, Zhang, Tianyi, Benes, Bedrich, and Bera, Aniket
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: We have witnessed significant progress in deep learning-based 3D vision, ranging from neural radiance field (NeRF) based 3D representation learning to applications in novel view synthesis (NVS). However, existing scene-level datasets for deep learning-based 3D vision, limited to either synthetic environments or a narrow selection of real-world scenes, are quite insufficient. This insufficiency not only hinders a comprehensive benchmark of existing methods but also caps what could be explored in deep learning-based 3D analysis. To address this critical gap, we present DL3DV-10K, a large-scale scene dataset, featuring 51.2 million frames from 10,510 videos captured from 65 types of point-of-interest (POI) locations, covering both bounded and unbounded scenes, with different levels of reflection, transparency, and lighting. We conducted a comprehensive benchmark of recent NVS methods on DL3DV-10K, which revealed valuable insights for future research in NVS. In addition, we have obtained encouraging results in a pilot study to learn generalizable NeRF from DL3DV-10K, which manifests the necessity of a large-scale scene-level dataset to forge a path toward a foundation model for learning 3D representation. Our DL3DV-10K dataset, benchmark results, and models will be publicly accessible at https://dl3dv-10k.github.io/DL3DV-10K/.
Published: 2023

7. Dr.Bokeh: DiffeRentiable Occlusion-aware Bokeh Rendering

Author: Sheng, Yichen, Yu, Zixun, Ling, Lu, Cao, Zhiwen, Zhang, Cecilia, Lu, Xin, Xian, Ke, Lin, Haiting, and Benes, Bedrich
Subjects: Computer Science - Graphics
Abstract: Bokeh is widely used in photography to draw attention to the subject while effectively isolating distractions in the background. Computational methods simulate bokeh effects without relying on a physical camera lens. However, in the realm of digital bokeh synthesis, the two main challenges for bokeh synthesis are color bleeding and partial occlusion at object boundaries. Our primary goal is to overcome these two major challenges using physics principles that define bokeh formation. To achieve this, we propose a novel and accurate filtering-based bokeh rendering equation and a physically-based occlusion-aware bokeh renderer, dubbed Dr.Bokeh, which addresses the aforementioned challenges during the rendering stage without the need of post-processing or data-driven approaches. Our rendering algorithm first preprocesses the input RGBD to obtain a layered scene representation. Dr.Bokeh then takes the layered representation and user-defined lens parameters to render photo-realistic lens blur. By softening non-differentiable operations, we make Dr.Bokeh differentiable such that it can be plugged into a machine-learning framework. We perform quantitative and qualitative evaluations on synthetic and real-world images to validate the effectiveness of the rendering quality and the differentiability of our method. We show Dr.Bokeh not only outperforms state-of-the-art bokeh rendering algorithms in terms of photo-realism but also improves the depth quality from depth-from-defocus.
Published: 2023

8. DeepTree: Modeling Trees with Situated Latents

Author: Zhou, Xiaochen, Li, Bosheng, Benes, Bedrich, Fei, Songlin, and Pirk, Sören
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: In this paper, we propose DeepTree, a novel method for modeling trees based on learning developmental rules for branching structures instead of manually defining them. We call our deep neural model situated latent because its behavior is determined by the intrinsic state -- encoded as a latent space of a deep neural model -- and by the extrinsic (environmental) data that is situated as the location in the 3D space and on the tree structure. We use a neural network pipeline to train a situated latent space that allows us to locally predict branch growth only based on a single node in the branch graph of a tree model. We use this representation to progressively develop new branch nodes, thereby mimicking the growth process of trees. Starting from a root node, a tree is generated by iteratively querying the neural network on the newly added nodes resulting in the branching structure of the whole tree. Our method enables generating a wide variety of tree shapes without the need to define intricate parameters that control their growth and behavior. Furthermore, we show that the situated latents can also be used to encode the environmental response of tree models, e.g., when trees grow next to obstacles. We validate the effectiveness of our method by measuring the similarity of our tree models and by procedurally generated ones based on a number of established metrics for tree form.
Published: 2023

9. StyleDEM: a Versatile Model for Authoring Terrains

Author: Perche, Simon, Peytavie, Adrien, Benes, Bedrich, Galin, Eric, and Guérin, Eric
Subjects: Computer Science - Graphics, Computer Science - Artificial Intelligence
Abstract: Many terrain modelling methods have been proposed for the past decades, providing efficient and often interactive authoring tools. However, they generally do not include any notion of style, which is a critical aspect for designers in the entertainment industry. We introduce StyleDEM, a new generative adversarial network method for terrain synthesis and authoring, with a versatile toolbox of authoring methods with style. This method starts from an input sketch or an existing terrain. It outputs a terrain with features that can be authored using interactive brushes and enhanced with additional tools such as style manipulation or super-resolution. The strength of our approach resides in the versatility and interoperability of the toolbox.
Published: 2023

10. SnakeVoxFormer: Transformer-based Single Image\\Voxel Reconstruction with Run Length Encoding

Author: Lee, Jae Joong and Benes, Bedrich
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep learning-based 3D object reconstruction has achieved unprecedented results. Among those, the transformer deep neural model showed outstanding performance in many applications of computer vision. We introduce SnakeVoxFormer, a novel, 3D object reconstruction in voxel space from a single image using the transformer. The input to SnakeVoxFormer is a 2D image, and the result is a 3D voxel model. The key novelty of our approach is in using the run-length encoding that traverses (like a snake) the voxel space and encodes wide spatial differences into a 1D structure that is suitable for transformer encoding. We then use dictionary encoding to convert the discovered RLE blocks into tokens that are used for the transformer. The 1D representation is a lossless 3D shape data compression method that converts to 1D data that use only about 1% of the original data size. We show how different voxel traversing strategies affect the effect of encoding and reconstruction. We compare our method with the state-of-the-art for 3D voxel reconstruction from images and our method improves the state-of-the-art methods by at least 2.8% and up to 19.8%.
Published: 2023

11. PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing

Author: Sheng, Yichen, Zhang, Jianming, Philip, Julien, Hold-Geoffroy, Yannick, Sun, Xin, Zhang, HE, Ling, Lu, and Benes, Bedrich
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: Lighting effects such as shadows or reflections are key in making synthetic images realistic and visually appealing. To generate such effects, traditional computer graphics uses a physically-based renderer along with 3D geometry. To compensate for the lack of geometry in 2D Image compositing, recent deep learning-based approaches introduced a pixel height representation to generate soft shadows and reflections. However, the lack of geometry limits the quality of the generated soft shadows and constrain reflections to pure specular ones. We introduce PixHt-Lab, a system leveraging an explicit mapping from pixel height representation to 3D space. Using this mapping, PixHt-Lab reconstructs both the cutout and background geometry and renders realistic, diverse, lighting effects for image compositing. Given a surface with physically-based materials, we can render reflections with varying glossiness. To generate more realistic soft shadows, we further propose to use 3D-aware buffer channels to guide a neural renderer. Both quantitative and qualitative evaluations demonstrate that PixHt-Lab significantly improves soft shadow generation., Comment: 11 pages, 10 figures
Published: 2023

12. Controllable Shadow Generation Using Pixel Height Maps

Author: Sheng, Yichen, Liu, Yifan, Zhang, Jianming, Yin, Wei, Oztireli, A. Cengiz, Zhang, He, Lin, Zhe, Shechtman, Eli, and Benes, Bedrich
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: Shadows are essential for realistic image compositing. Physics-based shadow rendering methods require 3D geometries, which are not always available. Deep learning-based shadow synthesis methods learn a mapping from the light information to an object's shadow without explicitly modeling the shadow geometry. Still, they lack control and are prone to visual artifacts. We introduce pixel heigh, a novel geometry representation that encodes the correlations between objects, ground, and camera pose. The pixel height can be calculated from 3D geometries, manually annotated on 2D images, and can also be predicted from a single-view RGB image by a supervised approach. It can be used to calculate hard shadows in a 2D image based on the projective geometry, providing precise control of the shadows' direction and shape. Furthermore, we propose a data-driven soft shadow generator to apply softness to a hard shadow based on a softness input parameter. Qualitative and quantitative evaluations demonstrate that the proposed pixel height significantly improves the quality of the shadow generation while allowing for controllability., Comment: 15 pages, 11 figures
Published: 2022

13. Co-design of Embodied Neural Intelligence via Constrained Evolution

Author: Wang, Zhiquan, Benes, Bedrich, Qureshi, Ahmed H., and Mousas, Christos
Subjects: Computer Science - Artificial Intelligence, Computer Science - Graphics, Computer Science - Neural and Evolutionary Computing, Computer Science - Robotics
Abstract: We introduce a novel co-design method for autonomous moving agents' shape attributes and locomotion by combining deep reinforcement learning and evolution with user control. Our main inspiration comes from evolution, which has led to wide variability and adaptation in Nature and has the potential to significantly improve design and behavior simultaneously. Our method takes an input agent with optional simple constraints such as leg parts that should not evolve or allowed ranges of changes. It uses physics-based simulation to determine its locomotion and finds a behavior policy for the input design, later used as a baseline for comparison. The agent is then randomly modified within the allowed ranges creating a new generation of several hundred agents. The generation is trained by transferring the previous policy, which significantly speeds up the training. The best-performing agents are selected, and a new generation is formed using their crossover and mutations. The next generations are then trained until satisfactory results are reached. We show a wide variety of evolved agents, and our results show that even with only 10% of changes, the overall performance of the evolved agents improves 50%. If more significant changes to the initial design are allowed, our experiments' performance improves even more to 150%. Contrary to related work, our co-design works on a single GPU and provides satisfactory results by training thousands of agents within one hour.
Published: 2022

14. Embodied immersive virtual reality to enhance the conceptual understanding of charged particles: A qualitative study

Author: Acevedo, Pedro, Magana, Alejandra J., Walsh, Yoselyn, Will, Hector, Benes, Bedrich, and Mousas, Christos
Published: 2024
Full Text: View/download PDF

15. PlantSegNet: 3D point cloud instance segmentation of nearby plant organs with identical semantics

Author: Zarei, Ariyan, Li, Bosheng, Schnable, James C., Lyons, Eric, Pauli, Duke, Barnard, Kobus, and Benes, Bedrich
Published: 2024
Full Text: View/download PDF

16. Procedural Urban Forestry

Author: Niese, Till, Pirk, Sören, Albrecht, Matthias, Benes, Bedrich, and Deussen, Oliver
Subjects: Computer Science - Graphics, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The placement of vegetation plays a central role in the realism of virtual scenes. We introduce procedural placement models (PPMs) for vegetation in urban layouts. PPMs are environmentally sensitive to city geometry and allow identifying plausible plant positions based on structural and functional zones in an urban layout. PPMs can either be directly used by defining their parameters or can be learned from satellite images and land register data. Together with approaches for generating buildings and trees, this allows us to populate urban landscapes with complex 3D vegetation. The effectiveness of our framework is shown through examples of large-scale city scenes and close-ups of individually grown tree models; we also validate it by a perceptual user study., Comment: 14 pages
Published: 2020

17. SSN: Soft Shadow Network for Image Compositing

Author: Sheng, Yichen, Zhang, Jianming, and Benes, Bedrich
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Electrical Engineering and Systems Science - Image and Video Processing, 68T45, 68U05, I.3.3, I.2.10
Abstract: We introduce an interactive Soft Shadow Network (SSN) to generates controllable soft shadows for image compositing. SSN takes a 2D object mask as input and thus is agnostic to image types such as painting and vector art. An environment light map is used to control the shadow's characteristics, such as angle and softness. SSN employs an Ambient Occlusion Prediction module to predict an intermediate ambient occlusion map, which can be further refined by the user to provides geometric cues to modulate the shadow generation. To train our model, we design an efficient pipeline to produce diverse soft shadow training data using 3D object models. In addition, we propose an inverse shadow map representation to improve model training. We demonstrate that our model produces realistic soft shadows in real-time. Our user studies show that the generated shadows are often indistinguishable from shadows calculated by a physics-based renderer and users can easily use SSN through an interactive application to generate specific shadow effects in minutes., Comment: 11 pages, 10 figures
Published: 2020

18. LeRoP: A Learning-Based Modular Robot Photography Framework

Author: Kang, Hao, Zhang, Jianming, Li, Haoxiang, Lin, Zhe, Rhodes, TJ, and Benes, Bedrich
Subjects: Computer Science - Robotics
Abstract: We introduce a novel framework for automatic capturing of human portraits. The framework allows the robot to follow a person to the desired location using a Person Re-identification model. When composing is activated, the robot attempts to adjust its position to form the view that can best match the given template image, and finally takes a photograph. A template image can be predicted dynamically using an off-the-shelf photo evaluation model by the framework, or selected manually from a pre-defined set by the user. The template matching-based view adjustment is driven by a deep reinforcement learning network. Our framework lies on top of the Robot Operating System (ROS). The framework is designed to be modular so that all the models can be flexibly replaced based on needs. We show our framework on a variety of examples. In particular, we tested it in three indoor scenes and used it to take 20 photos of each scene: ten for the pre-defined template, ten for the dynamically generated ones. The average number of adjustment was $11.20$ for pre-defined templates and $12.76$ for dynamically generated ones; the average time spent was $22.11$ and $24.10$ seconds respectively.
Published: 2019

19. PTRM: Perceived Terrain Realism Metrics

Author: Rajasekaran, Suren Deepak, Kang, Hao, Benes, Bedrich, Čadík, Martin, Galin, Eric, Guérin, Eric, Peytavie, Adrien, and Slavík, Pavel
Subjects: Computer Science - Graphics
Abstract: Terrains are visually important and commonly used in computer graphics. While many algorithms for their generation exist, it is difficult to assess the realism of a generated terrain. This paper presents a first step in the direction of perceptual evaluation of terrain models. We gathered and categorized several classes of real terrains and we generated synthetic terrains by using methods from computer graphics. We then conducted two large studies ranking the terrains perceptually and showing that the synthetic terrains are perceived as lacking realism as compared to the real ones. Then we provide insight into the features that affect the perceived realism by a quantitative evaluation based on localized geomorphology-based landform features (geomorphons) that categorize terrain structures such as valleys, ridges, hollows, etc. We show that the presence or absence of certain features have a significant perceptual effect. We then introduce Perceived Terrain Realism Metrics (PTRM); a perceptual metrics that estimates perceived realism of a terrain represented as a digital elevation map by relating distribution of terrain features with their perceived realism. We validated PTRM on real and synthetic data and compared it to the perceptual studies. To confirm the importance of the presence of these features, we used a generative deep neural network to transfer them between real terrains and synthetic ones and we performed another perceptual experiment that further confirmed their importance for perceived realism., Comment: 13 pages, 12 figures
Published: 2019

20. Efficient Project Gradient Descent for Ensemble Adversarial Attack

Author: Wu, Fanyou, Gazo, Rado, Haviarova, Eva, and Benes, Bedrich
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Statistics - Machine Learning
Abstract: Recent advances show that deep neural networks are not robust to deliberately crafted adversarial examples which many are generated by adding human imperceptible perturbation to clear input. Consider $l_2$ norms attacks, Project Gradient Descent (PGD) and the Carlini and Wagner (C\&W) attacks are the two main methods, where PGD control max perturbation for adversarial examples while C\&W approach treats perturbation as a regularization term optimized it with loss function together. If we carefully set parameters for any individual input, both methods become similar. In general, PGD attacks perform faster but obtains larger perturbation to find adversarial examples than the C\&W when fixing the parameters for all inputs. In this report, we propose an efficient modified PGD method for attacking ensemble models by automatically changing ensemble weights and step size per iteration per input. This method generates smaller perturbation adversarial examples than PGD method while remains efficient as compared to C\&W method. Our method won the first place in IJCAI19 Targeted Adversarial Attack competition., Comment: 6 pages, 2 figures, submit to IJCAI 19 AIBS workshop
Published: 2019

21. The Positivity Principle: Do Positive Instructors Improve Learning from Video Lectures?

Author: Lawson, Alyssa P., Mayer, Richard E., Adamo-Villani, Nicoletta, Benes, Bedrich, Lei, Xingyu, and Cheng, Justin
Abstract: The positivity principle states that people learn better from instructors who display positive emotions rather than negative emotions. In two experiments, students viewed a short video lecture on a statistics topic in which an instructor stood next to a series of slides as she lectured and then they took either an immediate test (Experiment 1) or a delayed test (Experiment 2). In a between-subjects design, students saw an instructor who used her voice, body movement, gesture, facial expression, and eye gaze to display one of four emotions while lecturing: happy (positive/active), content (positive/passive), frustrated (negative/active), or bored (negative/passive). First, learners were able to recognize the emotional tone of the instructor in an instructional video lecture, particularly by more strongly rating a positive instructor as displaying positive emotions and a negative instructor as displaying negative emotions (in Experiments 1 and 2). Second, concerning building a social connection during learning, learners rated a positive instructor as more likely to facilitate learning, more credible, and more engaging than a negative instructor (in Experiments 1 and 2). Third, concerning cognitive engagement during learning, learners reported paying more attention during learning for a positive instructor than a negative instructor (in Experiments 1 and 2). Finally, concerning learning outcome, learners who had a positive instructor scored higher than learners who had a negative instructor on a delayed posttest (Experiment 2) but not an immediate posttest (Experiment 1). Overall, there is evidence for the positivity principle and the cognitive-affective model of e-learning from which it is derived.
Published: 2021
Full Text: View/download PDF

22. Vision UFormer: Long-range monocular absolute depth estimation

Author: Polasek, Tomas, Čadík, Martin, Keller, Yosi, and Benes, Bedrich
Published: 2023
Full Text: View/download PDF

23. Multi-view triangulation without correspondences

Author: Gaillard, Mathieu, Benes, Bedrich, Tross, Michael C., and Schnable, James C.
Published: 2023
Full Text: View/download PDF

24. Automated tree ring detection of common Indiana hardwood species through deep learning: Introducing a new dataset of annotated images

Author: Wu, Fanyou, Huang, Yunmei, Benes, Bedrich, Warner, Charles C., and Gazo, Rado
Published: 2023
Full Text: View/download PDF

25. Urban tree generator: spatio-temporal and generative deep learning for urban tree localization and modeling

Author: Firoze, Adnan, Benes, Bedrich, and Aliaga, Daniel
Published: 2022
Full Text: View/download PDF

26. Do Learners Recognize and Relate to the Emotions Displayed by Virtual Instructors?

Author: Lawson, Alyssa P., Mayer, Richard E., Adamo-Villani, Nicoletta, Benes, Bedrich, Lei, Xingyu, and Cheng, Justin
Abstract: There has been much research on the effectiveness of animated pedagogical agents in an educational context, however there is little research about how the emotions they display contribute to a learner's understanding of the lesson. The positivity principle suggests that learners should learn better from instructors with positive emotions compared to those with negative emotions. Additionally, the media equation theory (Reeves and Nass 1996) would suggest this principle should be true for animated instructors as well. In an experiment, students viewed a lesson on binomial probability taught by an animated instructor who was happy (positive/active), content (positive/passive), frustrated (negative/active), or bored (negative/passive). Learners were able to recognize positive from negative emotions, rated the positive instructors as better at facilitating learning, more credible, more human-like, and more engaging. Additionally, learners who saw positive instructors indicated they tried to pay attention to the lesson and enjoyed the lesson more than those who saw negative instructors. However, learners who saw positive instructors did not perform better on a delayed test than those who saw negative instructors. This suggests that learners recognize and react to the emotions of the virtual instructors, but research is needed to determine how the emotions displayed by virtual instructors can promote better learning outcomes.
Published: 2021
Full Text: View/download PDF

27. Unerosion: Simulating Terrain Evolution Back in Time.

Author: Yang, Zhanyu, Cordonnier, Guillaume, Cani, Marie‐Paule, Perrenoud, Christian, and Benes, Bedrich
Subjects: RIVER channels, FLUVIAL geomorphology, CLIMATE change, SEDIMENTATION & deposition, PALEOGEOGRAPHY
Abstract: While the past of terrain cannot be known precisely because an effect can result from many different causes, exploring these possible pasts opens the way to numerous applications ranging from movies and games to paleogeography. We introduce unerosion, an attempt to recover plausible past topographies from an input terrain represented as a height field. Our solution relies on novel algorithms for the backward simulation of different processes: fluvial erosion, sedimentation, and thermal erosion. This is achieved by re‐formulating the equations of erosion and sedimentation so that they can be simulated back in time. These algorithms can be combined to account for a succession of climate changes backward in time, while the possible ambiguities provide editing options to the user. Results show that our solution can approximately reverse different types of erosion while enabling users to explore a variety of alternative pasts. Using a chronology of climatic periods to inform us about the main erosion phenomena, we also went back in time using real measured terrain data. We checked the consistency with geological findings, namely the height of river beds hundreds of thousands of years ago. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Perceived Naturalness of Interpolation Methods for Character Upper Body Animation

Author: Lei, Xingyu, Adamo-Villani, Nicoletta, Benes, Bedrich, Wang, Zhiquan, Meyer, Zachary, Mayer, Richard, Lawson, Alyssa, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Bebis, George, editor, Athitsos, Vassilis, editor, Yan, Tong, editor, Lau, Manfred, editor, Li, Frederick, editor, Shi, Conglei, editor, Yuan, Xiaoru, editor, Mousas, Christos, editor, and Bruder, Gerd, editor
Published: 2021
Full Text: View/download PDF

29. Bodily Expression of Emotions in Animated Agents

Author: Meyer, Zachary, Adamo, Nicoletta, Benes, Bedrich, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Bebis, George, editor, Athitsos, Vassilis, editor, Yan, Tong, editor, Lau, Manfred, editor, Li, Frederick, editor, Shi, Conglei, editor, Yuan, Xiaoru, editor, Mousas, Christos, editor, and Bruder, Gerd, editor
Published: 2021
Full Text: View/download PDF

30. Multimodal Affective Pedagogical Agents for Different Types of Learners

Author: Adamo, Nicoletta, Benes, Bedrich, Mayer, Richard E., Lei, Xingyu, Wang, Zhiquan, Meyer, Zachary, Lawson, Alyssa, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Russo, Dario, editor, Ahram, Tareq, editor, Karwowski, Waldemar, editor, Di Bucchianico, Giuseppe, editor, and Taiar, Redha, editor
Published: 2021
Full Text: View/download PDF

31. Deep BarkID: a portable tree bark identification system by knowledge distillation

Author: Wu, Fanyou, Gazo, Rado, Benes, Bedrich, and Haviarova, Eva
Published: 2021
Full Text: View/download PDF

32. Error-Bounded and Feature Preserving Surface Remeshing with Minimal Angle Improvement

Author: Hu, Kaimo, Yan, Dong-Ming, Bommes, David, Alliez, Pierre, and Benes, Bedrich
Subjects: Computer Science - Graphics, Computer Science - Computational Geometry
Abstract: The typical goal of surface remeshing consists in finding a mesh that is (1) geometrically faithful to the original geometry, (2) as coarse as possible to obtain a low-complexity representation and (3) free of bad elements that would hamper the desired application. In this paper, we design an algorithm to address all three optimization goals simultaneously. The user specifies desired bounds on approximation error {\delta}, minimal interior angle {\theta} and maximum mesh complexity N (number of vertices). Since such a desired mesh might not even exist, our optimization framework treats only the approximation error bound {\delta} as a hard constraint and the other two criteria as optimization goals. More specifically, we iteratively perform carefully prioritized local operators, whenever they do not violate the approximation error bound and improve the mesh otherwise. In this way our optimization framework greedily searches for the coarsest mesh with minimal interior angle above {\theta} and approximation error bounded by {\delta}. Fast runtime is enabled by a local approximation error estimation, while implicit feature preservation is obtained by specifically designed vertex relocation operators. Experiments show that our approach delivers high-quality meshes with implicitly preserved features and better balances between geometric fidelity, mesh complexity and element quality than the state-of-the-art., Comment: 14 pages, 20 figures. Submitted to IEEE Transactions on Visualization and Computer Graphics
Published: 2016

33. Understanding and Exploiting Object Interaction Landscapes

Author: Pirk, Sören, Krs, Vojtech, Hu, Kaimo, Rajasekaran, Suren Deepak, Kang, Hao, Benes, Bedrich, Yoshiyasu, Yusuke, and Guibas, Leonidas J.
Subjects: Computer Science - Graphics, Computer Science - Computational Geometry, Computer Science - Computer Vision and Pattern Recognition
Abstract: Interactions play a key role in understanding objects and scenes, for both virtual and real world agents. We introduce a new general representation for proximal interactions among physical objects that is agnostic to the type of objects or interaction involved. The representation is based on tracking particles on one of the participating objects and then observing them with sensors appropriately placed in the interaction volume or on the interaction surfaces. We show how to factorize these interaction descriptors and project them into a particular participating object so as to obtain a new functional descriptor for that object, its interaction landscape, capturing its observed use in a spatio-temporal framework. Interaction landscapes are independent of the particular interaction and capture subtle dynamic effects in how objects move and behave when in functional use. Our method relates objects based on their function, establishes correspondences between shapes based on functional key points and regions, and retrieves peer and partner objects with respect to an interaction., Comment: 14 pages, 19 figures
Published: 2016

34. Evolution-Based Shape and Behavior Co-Design of Virtual Agents

Author: Wang, Zhiquan, Benes, Bedrich, Qureshi, Ahmed H., and Mousas, Christos
Abstract: We introduce a novel co-design method for autonomous moving agents’ shape attributes and locomotion by combining deep reinforcement learning and evolution with user control. Our main inspiration comes from evolution, which has led to wide variability and adaptation in Nature and has significantly improved design and behavior simultaneously. Our method takes an input agent with optional user-defined constraints, such as leg parts that should not evolve or are only within the allowed ranges of changes. It uses physics-based simulation to determine its locomotion and finds a behavior policy for the input design that is used as a baseline for comparison. The agent is randomly modified within the allowed ranges, creating a new generation of several hundred agents. The generation is trained by transferring the previous policy, which significantly speeds up the training. The best-performing agents are selected, and a new generation is formed using their crossover and mutations. The next generations are then trained until satisfactory results are reached. We show a wide variety of evolved agents, and our results show that even with only 10% of allowed changes, the overall performance of the evolved agents improves by 50%. If more significant changes to the initial design are allowed, our experiments’ performance will improve even more to 150%. Our method significantly improved motion tasks without changing body structures, and it does not require considerable computation resources as it works on a single GPU and provides results by training thousands of agents within 30 minutes.
Published: 2024
Full Text: View/download PDF

35. Edge-based procedural textures

Author: Kim, Hansoo, Dischler, Jean-Michel, Rushmeier, Holly, and Benes, Bedrich
Published: 2021
Full Text: View/download PDF

36. An algorithm for automatic dormant tree pruning

Author: Kolmanič, Simon, Strnad, Damjan, Kohek, Štefan, Benes, Bedrich, Hirst, Peter, and Žalik, Borut
Published: 2021
Full Text: View/download PDF

37. Sorghum Segmentation by Skeleton Extraction

Author: Gaillard, Mathieu, Miao, Chenyong, Schnable, James, Benes, Bedrich, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Bartoli, Adrien, editor, and Fusiello, Andrea, editor
Published: 2020
Full Text: View/download PDF

38. Deep Learning-Based Emotion Recognition from Real-Time Videos

Author: Zhou, Wenbin, Cheng, Justin, Lei, Xingyu, Benes, Bedrich, Adamo, Nicoletta, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, and Kurosu, Masaaki, editor
Published: 2020
Full Text: View/download PDF

39. The Effects of Body Gestures and Gender on Viewer’s Perception of Animated Pedagogical Agent’s Emotions

Author: Cheng, Justin, Zhou, Wenbin, Lei, Xingyu, Adamo, Nicoletta, Benes, Bedrich, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, and Kurosu, Masaaki, editor
Published: 2020
Full Text: View/download PDF

40. Virtual and Augmented Reality in Science, Technology, Engineering, and Mathematics (STEM) Education: An Umbrella Review.

Author: Zhang, Yiqun, Feijoo-Garcia, Miguel A., Gu, Yiyin, Popescu, Voicu, Benes, Bedrich, and Magana, Alejandra J.
Subjects: STEM education, EDUCATIONAL technology, EDUCATIONAL benefits, AUGMENTED reality, ACADEMIC motivation
Abstract: The application of extended reality (XR) technology in education has been growing for the last two decades. XR offers immersive and interactive visualization experiences that can enhance learning by making it engaging. Recent technological advances have led to the availability of high-quality and affordable XR headsets. These advancements have spurred a wave of research focused on designing, implementing, and validating XR educational interventions. Limited literature focuses on the recent trends of XR within science, technology, engineering, and mathematics (STEM) education. Thus, this paper presents an umbrella review that explores the exploding field of XR and its transformative potential in STEM education. Using six online databases, the review zoomed in on 17 out of 1972 papers on XR for STEM education, published between 2020 and 2023, following the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines. The results highlighted the types of XR technology applied (i.e., virtual reality and augmented reality), the specific STEM disciplines involved, the focus of each study reviewed, and the major findings from recent reviews. Overall, the educational benefits of using XR technology in STEM education are apparent: XR boosts student motivation, facilitates learning engagement, and improves skills, for example. However, using XR in education still has challenges that must be addressed, such as the physical discomfort of the learner wearing the XR headset and technical glitches. Besides revealing trends of using XR in STEM education, this umbrella review encourages reflection on current practices and suggests ways to apply XR to STEM education effectively. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

41. A framework for multi-objective optimization of virtual tree pruning based on growth simulation

Author: Strnad, Damjan, Kohek, Štefan, Benes, Bedrich, Kolmanič, Simon, and Žalik, Borut
Published: 2020
Full Text: View/download PDF

42. An output-driven approach to design a swarming model for architectural indoor environments

Author: Mathew, C. D. Tharindu, Benes, Bedrich, and Aliaga, Daniel G.
Published: 2020
Full Text: View/download PDF

43. A fast pith detection for computed tomography scanned hardwood logs

Author: Gazo, Rado, Vanek, Juraj, Abdul_Massih, Michel, and Benes, Bedrich
Published: 2020
Full Text: View/download PDF

44. CentralBark Image Dataset and Tree Species Classification Using Deep Learning

Author: Warner, Charles, primary, Wu, Fanyou, additional, Gazo, Rado, additional, Benes, Bedrich, additional, Kong, Nicole, additional, and Fei, Songlin, additional
Published: 2024
Full Text: View/download PDF

45. An XR Environment for AI Education: Design and First Implementation

Author: Gu, Yiyin, primary, Garcia, Miguel, additional, Zhang, Yiqun, additional, Magana, Alejandra J., additional, Benes, Bedrich, additional, and Popescu, Voicu, additional
Published: 2024
Full Text: View/download PDF

46. Character motion in function space

Author: Yoo, Innfarn, Fišer, Marek, Hu, Kaimo, and Benes, Bedrich
Published: 2021
Full Text: View/download PDF

47. Wood identification based on longitudinal section images by using deep learning

Author: Wu, Fanyou, Gazo, Rado, Haviarova, Eva, and Benes, Bedrich
Published: 2021
Full Text: View/download PDF

48. DeepTree: Modeling Trees With Situated Latents

Author: Zhou, Xiaochen, Li, Bosheng, Benes, Bedrich, Fei, Songlin, and Pirk, Soren
Abstract: In this article, we propose DeepTree, a novel method for modeling trees based on learning developmental rules for branching structures instead of manually defining them. We call our deep neural model “situated latent” because its behavior is determined by the intrinsic state -encoded as a latent space of a deep neural model- and by the extrinsic (environmental) data that is “situated” as the location in the 3D space and on the tree structure. We use a neural network pipeline to train a situated latent space that allows us to locally predict branch growth only based on a single node in the branch graph of a tree model. We use this representation to progressively develop new branch nodes, thereby mimicking the growth process of trees. Starting from a root node, a tree is generated by iteratively querying the neural network on the newly added nodes resulting in the branching structure of the whole tree. Our method enables generating a wide variety of tree shapes without the need to define intricate parameters that control their growth and behavior. Furthermore, we show that the situated latents can also be used to encode the environmental response of tree models, e.g., when trees grow next to obstacles. We validate the effectiveness of our method by measuring the similarity of our tree models and by procedurally generated ones based on a number of established metrics for tree form.
Published: 2024
Full Text: View/download PDF

49. Driving Interdisciplinary Collaboration through Adapted Conjecture Mapping: A Case Study with the PECAS Mediator

Author: Chang, Michael Alan, primary, Magana, Alejandra, additional, Benes, Bedrich, additional, Kao, Dominic, additional, and Fusco, Judith, additional
Published: 2022
Full Text: View/download PDF

50. Enhancing Buoyant force learning through a visuo-haptic environment: a case study.

Author: Neri, Luis, Noguez, Julieta, Escobar-Castillejos, David, Robledo-Rella, Víctor, García-Castelán, Rosa María Guadalupe, Gonzalez-Nucamendi, Andres, Magana, Alejandra J., Benes, Bedrich, Shiakolas, Panos, Vadcard, Lucile, and Osgouei, Reza Haghighi
Subjects: CONCEPT mapping, LEARNING, ENGINEERING students, EXPERIMENTAL groups, PRE-tests & post-tests, HAPTIC devices
Abstract: Introduction: This study aimed to develop, implement, and test a visuohaptic simulator designed to explore the buoyancy phenomenon for freshman engineering students enrolled in physics courses. The primary goal was to enhance students' understanding of physical concepts through an immersive learning tool. Methods: The visuo-haptic simulator was created using the VIS-HAPT methodology, which provides high-quality visualization and reduces development time. A total of 182 undergraduate students were randomly assigned to either an experimental group that used the simulator or a control group that received an equivalent learning experience in terms of duration and content. Data were collected through pre- and post-tests and an exit-perception questionnaire. Results: Data analysis revealed that the experimental group achieved higher learning gains than the control group (p = 0.079). Additionally, students in the experimental group expressed strong enthusiasm for the simulator, noting its positive impact on their understanding of physical concepts. The VIS-HAPT methodology also reduced the average development time compared to similar visuo-haptic simulators. Discussion: The results demonstrate the efficacy of the buoyancy visuo-haptic simulator in improving students' learning experiences and validate the utility of the VIS-HAPT method for creating immersive educational tools in physics. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

558 results on '"Benes, Bedrich"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources