Author: "Heibi, Ivan" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Heibi, Ivan"' showing total 124 results

Start Over Author "Heibi, Ivan"

124 results on '"Heibi, Ivan"'

1. The OpenCitations Index

Author: Heibi, Ivan, Moretti, Arianna, Peroni, Silvio, and Soricetti, Marta
Subjects: Computer Science - Digital Libraries
Abstract: This article presents the OpenCitations Index, a collection of open citation data maintained by OpenCitations, an independent, not-for-profit infrastructure organisation for open scholarship dedicated to publishing open bibliographic and citation data using Semantic Web and Linked Open Data technologies. The collection involves citation data harvested from multiple sources. To address the possibility of different sources providing citation data for bibliographic entities represented with different identifiers, therefore potentially representing same citation, a deduplication mechanism has been implemented. This ensures that citations integrated into OpenCitations Index are accurately identified uniquely, even when different identifiers are used. This mechanism follows a specific workflow, which encompasses a preprocessing of the original source data, a management of the provided bibliographic metadata, and the generation of new citation data to be integrated into the OpenCitations Index. The process relies on another data collection: OpenCitations Meta, and on the use of a new globally persistent identifier, namely OMID (OpenCitations Meta Identifier). As of July 2024, OpenCitations Index stores over 2 billion unique citation links, harvest from Crossref, the National Institute of Heath Open Citation Collection (NIH-OCC), DataCite, OpenAIRE, and the Japan Link Center (JaLC). OpenCitations Index can be systematically accessed and queried through several services, including SPARQL endpoint, REST APIs, and web interfaces. Additionally, dataset dumps are available for free download and reuse (under CC0 waiver) in various formats (CSV, N-Triples, and Scholix), including provenance and change tracking information.
Published: 2024
Full Text: View/download PDF

2. Why do you cite? An investigation on citation intents and decision-making classification processes

Author: Paolini, Lorenzo, Vahdati, Sahar, Di Iorio, Angelo, Wardenga, Robert, Heibi, Ivan, and Peroni, Silvio
Subjects: Computer Science - Computation and Language
Abstract: Identifying the reason for which an author cites another work is essential to understand the nature of scientific contributions and to assess their impact. Citations are one of the pillars of scholarly communication and most metrics employed to analyze these conceptual links are based on quantitative observations. Behind the act of referencing another scholarly work there is a whole world of meanings that needs to be proficiently and effectively revealed. This study emphasizes the importance of trustfully classifying citation intents to provide more comprehensive and insightful analyses in research assessment. We address this task by presenting a study utilizing advanced Ensemble Strategies for Citation Intent Classification (CIC) incorporating Language Models (LMs) and employing Explainable AI (XAI) techniques to enhance the interpretability and trustworthiness of models' predictions. Our approach involves two ensemble classifiers that utilize fine-tuned SciBERT and XLNet LMs as baselines. We further demonstrate the critical role of section titles as a feature in improving models' performances. The study also introduces a web application developed with Flask and currently available at http://137.204.64.4:81/cic/classifier, aimed at classifying citation intents. One of our models sets as a new state-of-the-art (SOTA) with an 89.46% Macro-F1 score on the SciCite benchmark. The integration of XAI techniques provides insights into the decision-making processes, highlighting the contributions of individual words for level-0 classifications, and of individual models for the metaclassification. The findings suggest that the inclusion of section titles significantly enhances classification performances in the CIC task. Our contributions provide useful insights for developing more robust datasets and methodologies, thus fostering a deeper understanding of scholarly communication., Comment: 42 pages, 14 figures, 1 table, submitted to Scientometrics Journal
Published: 2024
Full Text: View/download PDF

3. A Proposal for a FAIR Management of 3D Data in Cultural Heritage: The Aldrovandi Digital Twin Case

Author: Barzaghi, Sebastian, Bordignon, Alice, Gualandi, Bianca, Heibi, Ivan, Massari, Arcangelo, Moretti, Arianna, Peroni, Silvio, and Renda, Giulia
Subjects: Computer Science - Digital Libraries, Computer Science - Social and Information Networks
Abstract: In this article we analyse 3D models of cultural heritage with the aim of answering three main questions: what processes can be put in place to create a FAIR-by-design digital twin of a temporary exhibition? What are the main challenges in applying FAIR principles to 3D data in cultural heritage studies and how are they different from other types of data (e.g. images) from a data management perspective? We begin with a comprehensive literature review touching on: FAIR principles applied to cultural heritage data; representation models; both Object Provenance Information (OPI) and Metadata Record Provenance Information (MRPI), respectively meant as, on the one hand, the detailed history and origin of an object, and - on the other hand - the detailed history and origin of the metadata itself, which describes the primary object (whether physical or digital); 3D models as cultural heritage research data and their creation, selection, publication, archival and preservation. We then describe the process of creating the Aldrovandi Digital Twin, by collecting, storing and modelling data about cultural heritage objects and processes. We detail the many steps from the acquisition of the Digital Cultural Heritage Objects (DCHO), through to the upload of the optimised DCHO onto a web-based framework (ATON), with a focus on open technologies and standards for interoperability and preservation. Using the FAIR Principles for Heritage Library, Archive and Museum Collections as a framework, we look in detail at how the Digital Twin implements FAIR principles at the object and metadata level. We then describe the main challenges we encountered and we summarise what seem to be the peculiarities of 3D cultural heritage data and the possible directions for further research in this field.
Published: 2024

4. A Workflow for GLAM Metadata Crosswalk

Author: Moretti, Arianna, Heibi, Ivan, and Peroni, Silvio
Subjects: Computer Science - Digital Libraries
Abstract: The acquisition of physical artifacts not only involves transferring existing information into the digital ecosystem but also generates information as a process itself, underscoring the importance of meticulous management of FAIR data and metadata. In addition, the diversity of objects within the cultural heritage domain is reflected in a multitude of descriptive models. The digitization process expands the opportunities for exchange and joint utilization, granted that the descriptive schemas are made interoperable in advance. To achieve this goal, we propose a replicable workflow for metadata schema crosswalks that facilitates the preservation and accessibility of cultural heritage in the digital ecosystem. This work presents a methodology for metadata generation and management in the case study of the digital twin of the temporary exhibition "The Other Renaissance - Ulisse Aldrovandi and the Wonders of the World". The workflow delineates a systematic, step-by-step transformation of tabular data into RDF format, to enhance Linked Open Data. The methodology adopts the RDF Mapping Language (RML) technology for converting data to RDF with a human contribution involvement. This last aspect entails an interaction between digital humanists and domain experts through surveys leading to the abstraction and reformulation of domain-specific knowledge, to be exploited in the process of formalizing and converting information., Comment: Submitted to AIUCD conference 2024 1 figure 8 pages
Published: 2024

5. Developing Application Profiles for Enhancing Data and Workflows in Cultural Heritage Digitisation Processes

Author: Barzaghi, Sebastian, Heibi, Ivan, Moretti, Arianna, and Peroni, Silvio
Subjects: Computer Science - Digital Libraries
Abstract: As a result of the proliferation of 3D digitisation in the context of cultural heritage projects, digital assets and digitisation processes - being considered as proper research objects - must prioritise adherence to FAIR principles. Existing standards and ontologies, such as CIDOC CRM, play a crucial role in this regard, but they are often over-engineered for the need of a particular application context, thus making their understanding and adoption difficult. Application profiles of a given standard - defined as sets of ontological entities drawn from one or more semantic artefacts for a particular context or application - are usually proposed as tools for promoting interoperability and reuse while being tied entirely to the particular application context they refer to. In this paper, we present an adaptation and application of an ontology development methodology, i.e. SAMOD, to guide the creation of robust, semantically sound application profiles of large standard models. Using an existing pilot study we have developed in a project dedicated to leveraging virtual technologies to preserve and valorise cultural heritage, we introduce an application profile named CHAD-AP, that we have developed following our customised version of SAMOD. We reflect on the use of SAMOD and similar ontology development methodologies for this purpose, highlighting its strengths and current limitations, future developments, and possible adoption in other similar projects.
Published: 2024

6. Saving temporary exhibitions in virtual environments: the Digital Renaissance of Ulisse Aldrovandi -- acquisition and digitisation of cultural heritage objects

Author: Balzani, Roberto, Barzaghi, Sebastian, Bitelli, Gabriele, Bonifazi, Federica, Bordignon, Alice, Cipriani, Luca, Colitti, Simona, Collina, Federica, Daquino, Marilena, Fabbri, Francesca, Fanini, Bruno, Fantini, Filippo, Ferdani, Daniele, Fiorini, Giulia, Formia, Elena, Forte, Anna, Giacomini, Federica, Girelli, Valentina Alena, Gualandi, Bianca, Heibi, Ivan, Iannucci, Alessandro, Del Fà, Rachele Manganelli, Massari, Arcangelo, Moretti, Arianna, Peroni, Silvio, Pescarin, Sofia, Renda, Giulia, Ronchi, Diego, Sullini, Mattia, Tini, Maria Alessandra, Tomasi, Francesca, Travaglini, Laura, and Vittuari, Luca
Subjects: Computer Science - Graphics, Computer Science - Digital Libraries
Abstract: As per the objectives of Project CHANGES, particularly its thematic sub-project on the use of virtual technologies for museums and art collections, our goal was to obtain a digital twin of the temporary exhibition on Ulisse Aldrovandi called "The Other Renaissance", and make it accessible to users online. After a preliminary study of the exhibition, focussing on acquisition constraints and related solutions, we proceeded with the digital twin creation by acquiring, processing, modelling, optimising, exporting, and metadating the exhibition. We made hybrid use of two acquisition techniques to create new digital cultural heritage objects and environments, and we used open technologies, formats, and protocols to make available the final digital product. Here, we describe the process of collecting and curating bibliographical exhibition (meta)data and the beginning of the digital twin creation to foster its findability, accessibility, interoperability, and reusability. The creation of the digital twin is currently ongoing.
Published: 2023
Full Text: View/download PDF

7. Retractions in Arts and Humanities: an Analysis of the Retraction Notices

Author: Heibi, Ivan and Peroni, Silvio
Subjects: Computer Science - Digital Libraries
Abstract: The aim of this work is to understand the retraction phenomenon in the arts and humanities domain through an analysis of the retraction notices: formal documents stating and describing the retraction of a particular publication. The retractions and the corresponding notices are identified using the data provided by Retraction Watch. Our methodology for the analysis combines a metadata analysis and a content analysis (mainly performed using a topic modeling process) of the retraction notices. Considering 343 cases of retraction, we found that many retraction notices are neither identifiable nor findable. In addition, these were not always separated from the original papers, introducing ambiguity in understanding how these notices were perceived by the community (i.e., cited). Also, we noticed that there is no systematic way to write a retraction notice. Indeed, some retraction notices presented a complete discussion of the reasons for retraction, while others tended to be more direct and succinct. We have also reported many notices having similar text while addressing different retractions. We think a further study with a larger collection should be done using the same methodology to confirm and investigate our findings further.
Published: 2023

8. A Prototype for a Controlled and Valid RDF Data Production Using SHACL

Author: Rizzetto, Elia, Massari, Arcangelo, Heibi, Ivan, and Peroni, Silvio
Subjects: Computer Science - Databases, Computer Science - Digital Libraries
Abstract: The paper introduces a tool prototype that combines SHACL's capabilities with ad-hoc validation functions to create a controlled and user-friendly form interface for producing valid RDF data. The proposed tool is developed within the context of the OpenCitations Data Model (OCDM) use case. The paper discusses the current status of the tool, outlines the future steps required for achieving full functionality, and explores the potential applications and benefits of the tool.
Published: 2023

9. OpenCitations Meta

Author: Massari, Arcangelo, Mariani, Fabio, Heibi, Ivan, Peroni, Silvio, and Shotton, David
Subjects: Computer Science - Digital Libraries
Abstract: OpenCitations Meta is a new database that contains bibliographic metadata of scholarly publications involved in citations indexed by the OpenCitations infrastructure. It adheres to Open Science principles and provides data under a CC0 license for maximum reuse. The data can be accessed through a SPARQL endpoint, REST APIs, and dumps. OpenCitations Meta serves three important purposes. Firstly, it enables disambiguation of citations between publications described using different identifiers from various sources. For example, it can link publications identified by DOIs in Crossref and PMIDs in PubMed. Secondly, it assigns new globally persistent identifiers (PIDs), known as OpenCitations Meta Identifiers (OMIDs), to bibliographic resources without existing external persistent identifiers like DOIs. Lastly, by hosting the bibliographic metadata internally, OpenCitations Meta improves the speed of metadata retrieval for citing and cited documents. The database is populated through automated data curation, including deduplication, error correction, and metadata enrichment. The data is stored in RDF format following the OpenCitations Data Model, and changes and provenance information are tracked. OpenCitations Meta and its production. OpenCitations Meta currently incorporates data from Crossref, DataCite, and the NIH Open Citation Collection. In terms of semantic publishing datasets, it is currently the first in data volume., Comment: 26 pages, 7 figures
Published: 2023
Full Text: View/download PDF

10. A maturity model for catalogues of semantic artefacts

Author: Corcho, Oscar, Ekaputra, Fajar J., Heibi, Ivan, Jonquet, Clement, Micsik, Andras, Peroni, Silvio, and Storti, Emanuele
Published: 2024
Full Text: View/download PDF

11. Representing provenance and track changes of cultural heritage metadata in RDF: a survey of existing approaches

Author: Massari, Arcangelo, Peroni, Silvio, Tomasi, Francesca, and Heibi, Ivan
Subjects: Computer Science - Digital Libraries
Abstract: In the realm of Digital Humanities, the management of cultural heritage metadata is pivotal for ensuring data trustworthiness. Provenance information - contextual metadata detailing the origin and history of data - plays a crucial role in this process. However, tracking provenance and changes in metadata using the Resource Description Framework (RDF) presents significant challenges due to the limitations of foundational Semantic Web technologies. This article offers a comprehensive review of existing models and approaches for representing provenance and tracking changes in RDF, with a specific focus on cultural heritage metadata. It examines W3C standard proposals such as RDF Reification and n-ary relations, along with various alternative systems. Through an in-depth analysis, the study identifies Named Graphs, RDF*, the Provenance Ontology (PROV-O), Dublin Core (DC), Conjectural Graphs, and the OpenCitations Data Model (OCDM) as the most effective solutions. These models are evaluated based on their compliance with RDF standards, scalability, and applicability across different domains. The findings underscore the importance of selecting the appropriate model to ensure robust and reliable management of provenance in RDF datasets, thereby contributing to the ongoing discourse on provenance representation in the Digital Humanities., Comment: 23 pages, 4 figures, submitted to Digital Scholarship in the Humanities
Published: 2023

12. A maturity model for catalogues of semantic artefacts

Author: Corcho, Oscar, Ekaputra, Fajar J., Heibi, Ivan, Jonquet, Clement, Micsik, Andras, Peroni, Silvio, and Storti, Emanuele
Subjects: Computer Science - Digital Libraries
Abstract: This work presents a maturity model for assessing catalogues of semantic artefacts, one of the keystones that permit semantic interoperability of systems. We defined the dimensions and related features to include in the maturity model by analysing the current literature and existing catalogues of semantic artefacts provided by experts. In addition, we assessed 26 different catalogues to demonstrate the effectiveness of the maturity model, which includes 12 different dimensions (Metadata, Openness, Quality, Availability, Statistics, PID, Governance, Community, Sustainability, Technology, Transparency, and Assessment) and 43 related features (or sub-criteria) associated with these dimensions. Such a maturity model is one of the first attempts to provide recommendations for governance and processes for preserving and maintaining semantic artefacts and helps assess/address interoperability challenges.
Published: 2023
Full Text: View/download PDF

13. OpenCitations, an open e-infrastructure to foster maximum reuse of citation data

Author: Di Giambattista, Chiara, Heibi, Ivan, Peroni, Silvio, and Shotton, David
Subjects: Computer Science - Digital Libraries
Abstract: OpenCitations is an independent not-for-profit infrastructure organization for open scholarship dedicated to the publication of open bibliographic and citation data by the use of Semantic Web (Linked Data) technologies. OpenCitations collaborates with projects that are part of the Open Science ecosystem and complies with the UNESCO founding principles of Open Science, the I4OC recommendations, and the FAIR data principles that data should be Findable, Accessible, Interoperable and Reusable. Since its data satisfies all the Reuse guidelines provided by FAIR in terms of richness, provenance, usage licenses and domain-relevant community standards, OpenCitations provides an example of a successful open e-infrastructure in which the reusability of data is integral to its mission.
Published: 2022

14. How to structure citations data and bibliographic metadata in the OpenCitations accepted format

Author: Massari, Arcangelo and Heibi, Ivan
Subjects: Computer Science - Digital Libraries
Abstract: The OpenCitations organization is working on ingesting citation data and bibliographic metadata directly provided by the community (e.g., scholars and publishers). The aim is to improve the general coverage of open citations, which is still far from being complete, and use the provided metadata to enrich the characterization of the citing and cited entities. This paper illustrates how the citation data and bibliographic metadata should be structured to comply with the OpenCitations accepted format., Comment: 5 pages, submitted to JCDL 2022
Published: 2022

15. Enabling Portability and Reusability of Open Science Infrastructures

Author: Grieco, Giuseppe, Heibi, Ivan, Massari, Arcangelo, Moretti, Arianna, and Peroni, Silvio
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: This paper presents a methodology for designing a containerized and distributed open science infrastructure to simplify its reusability, replicability, and portability in different environments. The methodology is depicted in a step-by-step schema based on four main phases: (1) Analysis, (2) Design, (3) Definition, and (4) Managing and provisioning. We accompany the description of each step with existing technologies and concrete examples of application., Comment: 8 pages, 1 PostScript figure, submitted to TPDL 2022
Published: 2022
Full Text: View/download PDF

16. A quantitative and qualitative open citation analysis of retracted articles in the humanities

Author: Heibi, Ivan and Peroni, Silvio
Subjects: Computer Science - Digital Libraries, Computer Science - Information Retrieval
Abstract: In this article, we show and discuss the results of a quantitative and qualitative analysis of open citations to retracted publications in the humanities domain. Our study was conducted by selecting retracted papers in the humanities domain and marking their main characteristics (e.g., retraction reason). Then, we gathered the citing entities and annotated their basic metadata (e.g., title, venue, subject, etc.) and the characteristics of their in-text citations (e.g., intent, sentiment, etc.). Using these data, we performed a quantitative and qualitative study of retractions in the humanities, presenting descriptive statistics and a topic modeling analysis of the citing entities' abstracts and the in-text citation contexts. As part of our main findings, we noticed that there was no drop in the overall number of citations after the year of retraction, with few entities which have either mentioned the retraction or expressed a negative sentiment toward the cited publication. In addition, on several occasions, we noticed a higher concern/awareness when it was about citing a retracted publication, by the citing entities belonging to the health sciences domain, if compared to the humanities and the social science domains. Philosophy, arts, and history are the humanities areas that showed the higher concern toward the retraction.
Published: 2021

17. A protocol to gather, characterize and analyze incoming citations of retracted articles

Author: Heibi, Ivan and Peroni, Silvio
Subjects: Computer Science - Digital Libraries
Abstract: In this article, we present a methodology which takes as input a collection of retracted articles, gathers the entities citing them, characterizes such entities according to multiple dimensions (disciplines, year of publication, sentiment, etc.), and applies a quantitative and qualitative analysis on the collected values. The methodology is composed of four phases: (1) identifying, retrieving, and extracting basic metadata of the entities which have cited a retracted article, (2) extracting and labeling additional features based on the textual content of the citing entities, (3) building a descriptive statistical summary based on the collected data, and finally (4) running a topic modeling analysis. The goal of the methodology is to generate data and visualizations that help understanding possible behaviors related to retraction cases. We present the methodology in a structured step-by-step form following its four phases, discuss its limits and possible workarounds, and list the planned future improvements.
Published: 2021
Full Text: View/download PDF

18. Saving temporary exhibitions in virtual environments: The Digital Renaissance of Ulisse Aldrovandi – Acquisition and digitisation of cultural heritage objects

Author: Balzani, Roberto, Barzaghi, Sebastian, Bitelli, Gabriele, Bonifazi, Federica, Bordignon, Alice, Cipriani, Luca, Colitti, Simona, Collina, Federica, Daquino, Marilena, Fabbri, Francesca, Fanini, Bruno, Fantini, Filippo, Ferdani, Daniele, Fiorini, Giulia, Formia, Elena, Forte, Anna, Giacomini, Federica, Girelli, Valentina Alena, Gualandi, Bianca, Heibi, Ivan, Iannucci, Alessandro, Manganelli Del Fà, Rachele, Massari, Arcangelo, Moretti, Arianna, Peroni, Silvio, Pescarin, Sofia, Renda, Giulia, Ronchi, Diego, Sullini, Mattia, Tini, Maria Alessandra, Tomasi, Francesca, Travaglini, Laura, and Vittuari, Luca
Published: 2024
Full Text: View/download PDF

19. Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019

Author: Abbas, Nacira, Alghamdi, Kholoud, Alinam, Mortaza, Alloatti, Francesca, Amaral, Glenda, d'Amato, Claudia, Asprino, Luigi, Beno, Martin, Bensmann, Felix, Biswas, Russa, Cai, Ling, Capshaw, Riley, Carriero, Valentina Anita, Celino, Irene, Dadoun, Amine, De Giorgis, Stefano, Delva, Harm, Domingue, John, Dumontier, Michel, Emonet, Vincent, van Erp, Marieke, Arias, Paola Espinoza, Fallatah, Omaima, Ferrada, Sebastián, Ocaña, Marc Gallofré, Georgiou, Michalis, Gesese, Genet Asefa, Gillis-Webber, Frances, Giovannetti, Francesca, Buey, Marìa Granados, Harrando, Ismail, Heibi, Ivan, Horta, Vitor, Huber, Laurine, Igne, Federico, Jaradeh, Mohamad Yaser, Keshan, Neha, Koleva, Aneta, Koteich, Bilal, Kurniawan, Kabul, Liu, Mengya, Ma, Chuangtao, Maas, Lientje, Mansfield, Martin, Mariani, Fabio, Marzi, Eleonora, Mesbah, Sepideh, Mistry, Maheshkumar, Tirado, Alba Catalina Morales, Nguyen, Anna, Nguyen, Viet Bach, Oelen, Allard, Pasqual, Valentina, Paulheim, Heiko, Polleres, Axel, Porena, Margherita, Portisch, Jan, Presutti, Valentina, Pustu-Iren, Kader, Mendez, Ariam Rivas, Roshankish, Soheil, Rudolph, Sebastian, Sack, Harald, Sakor, Ahmad, Salas, Jaime, Schleider, Thomas, Shi, Meilin, Spinaci, Gianmarco, Sun, Chang, Tietz, Tabea, Dhouib, Molka Tounsi, Umbrico, Alessandro, Berg, Wouter van den, and Xu, Weiqin
Subjects: Computer Science - Artificial Intelligence
Abstract: One of the grand challenges discussed during the Dagstuhl Seminar "Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web" and described in its report is that of a: "Public FAIR Knowledge Graph of Everything: We increasingly see the creation of knowledge graphs that capture information about the entirety of a class of entities. [...] This grand challenge extends this further by asking if we can create a knowledge graph of "everything" ranging from common sense concepts to location based entities. This knowledge graph should be "open to the public" in a FAIR manner democratizing this mass amount of knowledge." Although linked open data (LOD) is one knowledge graph, it is the closest realisation (and probably the only one) to a public FAIR Knowledge Graph (KG) of everything. Surely, LOD provides a unique testbed for experimenting and evaluating research hypotheses on open and FAIR KG. One of the most neglected FAIR issues about KGs is their ongoing evolution and long term preservation. We want to investigate this problem, that is to understand what preserving and supporting the evolution of KGs means and how these problems can be addressed. Clearly, the problem can be approached from different perspectives and may require the development of different approaches, including new theories, ontologies, metrics, strategies, procedures, etc. This document reports a collaborative effort performed by 9 teams of students, each guided by a senior researcher as their mentor, attending the International Semantic Web Research School (ISWS 2019). Each team provides a different perspective to the problem of knowledge graph evolution substantiated by a set of research questions as the main subject of their investigation. In addition, they provide their working definition for KG preservation and evolution.
Published: 2020

20. A qualitative and quantitative analysis of open citations to retracted articles: the Wakefield et al.'s case

Author: Heibi, Ivan and Peroni, Silvio
Subjects: Computer Science - Digital Libraries
Abstract: In this article, we show the results of a quantitative and qualitative analysis of open citations on a popular and highly cited retracted paper: "Ileal-lymphoid-nodular hyperplasia, non-specific colitis, and pervasive developmental disorder in children" by Wakefield et al., published in 1998. The main purpose of our study is to understand the behavior of the publications citing retracted articles and the characteristics of the citations the retracted articles accumulated over time. Our analysis is based on a methodology which illustrates how we gathered the data, extracted the topics of the citing articles, and visualized the results. The data and services used are all open and free to foster the reproducibility of the analysis. The outcomes concerned the analysis of the entities citing Wakefield et al.'s article and their related in-text citations. We observed a constant increasing number of citations in the last 20 years, accompanied with a constant increment in the percentage of those acknowledging its retraction. Citing articles have started either discussing or dealing with the retraction of Wakefield et al.'s article even before its full retraction, happened in 2010. Articles in the social sciences domain citing the Wakefield et al.'s one were among those that have mostly discussed its retraction. In addition, when observing the in-text citations, we noticed that a large part of the citations received by Wakefield et al.'s article has focused on general discussions without recalling strictly medical details, especially after the full retraction. Medical studies did not hesitate in acknowledging the retraction and often provided strong negative statements on it.
Published: 2020

21. MITAO: a tool for enabling scholars in the Humanities to use Topic Modelling in their studies

Author: Heibi, Ivan, Peroni, Silvio, Pareschi, Luca, and Ferri, Paolo
Subjects: Computer Science - Digital Libraries
Abstract: Automatic text analysis methods, such as Topic Modelling, are gaining much attention in Humanities. However, scholars need to have extensive coding skills to use such methods appropriately. The need of having this technical expertise prevents the broad adoption of these methods in Humanities research. In this paper, to help scholars in the Humanities to use Topic Modelling having no or limited coding skills, we introduce MITAO, a web-based tool that allow the definition of a visual workflow which embeds various automatic text analysis operations and allows one to store and share both the workflow and the results of its execution to other researchers, which enables the reproducibility of the analysis. We present an example of an application of use of Topic Modelling with MITAO using a collection of English abstracts of the articles published in "Umanistica Digitale". The results returned by MITAO are shown with dynamic web-based visualizations, which allowed us to have preliminary insights about the evolution of the topics treated over the time in the articles published in "Umanistica Digitale". All the results along with the defined workflows are published and accessible for further studies.
Published: 2020

22. Creating RESTful APIs over SPARQL endpoints using RAMOSE

Author: Daquino, Marilena, Heibi, Ivan, Peroni, Silvio, and Shotton, David
Subjects: Computer Science - Databases
Abstract: Semantic Web technologies are widely used for storing RDF data and making them available on the Web through SPARQL endpoints, queryable using the SPARQL query language. While the use of SPARQL endpoints is strongly supported by Semantic Web experts, it hinders broader use of RDF data by common Web users, engineers and developers unfamiliar with Semantic Web technologies, who normally rely on Web RESTful APIs for querying Web-available data and creating applications over them. To solve this problem, we have developed RAMOSE, a generic tool developed in Python to create REST APIs over SPARQL endpoints. Through the creation of source-specific textual configuration files, RAMOSE enables the querying of SPARQL endpoints via simple Web RESTful API calls that return either JSON or CSV-formatted data, thus hiding all the intrinsic complexities of SPARQL and RDF from common Web users. We provide evidence that the use of RAMOSE to provide REST API access to RDF data within OpenCitations triplestores is beneficial in terms of the number of queries made by external users to such RDF data using the RAMOSE API compared with the direct access via the SPARQL endpoint. Our findings show the importance for suppliers of RDF data of having an alternative API access service, which enables its use by those with no (or little) experience in Semantic Web technologies and the SPARQL query language. RAMOSE can be used both to query any SPARQL endpoint and to query any other Web API, and thus it represents an easy generic technical solution for service providers who wish to create an API service to access Linked Data stored as RDF in a conventional triplestore.
Published: 2020

23. COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations

Author: Heibi, Ivan, Peroni, Silvio, and Shotton, David
Subjects: Computer Science - Digital Libraries
Abstract: In this paper, we present COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations (http://opencitations.net/index/coci). COCI is the first open citation index created by OpenCitations, in which we have applied the concept of citations as first-class data entities, and it contains more than 445 million DOI-to-DOI citation links derived from the data available in Crossref. These citations are described in RDF by means of the newly extended version of the OpenCitations Data Model (OCDM). We introduce the workflow we have developed for creating these data, and also show the additional services that facilitate the access to and querying of these data via different access points: a SPARQL endpoint, a REST API, bulk downloads, Web interfaces, and direct access to the citations via HTTP content negotiation. Finally, we present statistics regarding the use of COCI citation data, and we introduce several projects that have already started to use COCI data for different purposes., Comment: Submitted to Scientometrics (https://link.springer.com/journal/11192)
Published: 2019
Full Text: View/download PDF

24. Crowdsourcing open citations with CROCI -- An analysis of the current status of open citations, and a proposal

Author: Heibi, Ivan, Peroni, Silvio, and Shotton, David
Subjects: Computer Science - Digital Libraries
Abstract: In this paper, we analyse the current availability of open citations data in one particular dataset, namely COCI (the OpenCitations Index of Crossref open DOI-to-DOI citations; http://opencitations.net/index/coci) provided by OpenCitations. The results of these analyses show a persistent gap in the coverage of the currently available open citation data. In order to address this specific issue, we propose a strategy whereby the community (e.g. scholars and publishers) can directly involve themselves in crowdsourcing open citations, by uploading their citation data via the OpenCitations infrastructure into our new index, CROCI, the Crowdsourced Open Citations Index., Comment: 7 pages, 3 figures, accepted to ISSI 2019 (https://www.issi2019.org/)
Published: 2019

25. Retractions in arts and humanities: an analysis of the retraction notices

Author: Heibi, Ivan, primary and Peroni, Silvio, additional
Published: 2024
Full Text: View/download PDF

26. OpenCitations Meta

Author: Massari, Arcangelo, primary, Mariani, Fabio, additional, Heibi, Ivan, additional, Peroni, Silvio, additional, and Shotton, David, additional
Published: 2024
Full Text: View/download PDF

27. A qualitative and quantitative analysis of open citations to retracted articles: the Wakefield 1998 et al.'s case

Author: Heibi, Ivan and Peroni, Silvio
Published: 2021
Full Text: View/download PDF

28. Enabling Portability and Reusability of Open Science Infrastructures

Author: Grieco, Giuseppe, primary, Heibi, Ivan, additional, Massari, Arcangelo, additional, Moretti, Arianna, additional, and Peroni, Silvio, additional
Published: 2022
Full Text: View/download PDF

29. Open Bibliographical Data Workflows and the Multilinguality Challenge

Author: Malínek, Vojtěch, primary, Umerle, Tomasz, additional, Gray, Edward, additional, Heibi, Ivan, additional, Király, Péter, additional, Klaes, Christiane, additional, Korytkowski, Przemysław, additional, Lindemann, David, additional, Moretti, Arianna, additional, Panušková, Charlotte, additional, Péter, Róbert, additional, Tolonen, Mikko, additional, Tomczyńska, Aldona, additional, and Vimr, Ondřej, additional
Published: 2024
Full Text: View/download PDF

30. The Integration of the Japan Link Center’s Bibliographic Data into OpenCitations

Author: Moretti, Arianna, primary, Soricetti, Marta, additional, Heibi, Ivan, additional, Massari, Arcangelo, additional, Peroni, Silvio, additional, and Rizzetto, Elia, additional
Published: 2024
Full Text: View/download PDF

31. Retractions in Arts and Humanities: an Analysis of the Retraction Notices

Author: Heibi, Ivan, primary and Peroni, Silvio, additional
Published: 2023
Full Text: View/download PDF

32. Saving temporary exhibitions in virtual environments: The Digital Renaissance of Ulisse Aldrovandi – Acquisition and digitisation of cultural heritage objects

Author: Balzani, Roberto, primary, Barzaghi, Sebastian, additional, Bitelli, Gabriele, additional, Bonifazi, Federica, additional, Bordignon, Alice, additional, Cipriani, Luca, additional, Colitti, Simona, additional, Collina, Federica, additional, Daquino, Marilena, additional, Fabbri, Francesca, additional, Fanini, Bruno, additional, Fantini, Filippo, additional, Ferdani, Daniele, additional, Fiorini, Giulia, additional, Formia, Elena, additional, Forte, Anna, additional, Giacomini, Federica, additional, Girelli, Valentina Alena, additional, Gualandi, Bianca, additional, Heibi, Ivan, additional, Iannucci, Alessandro, additional, Manganelli Del Fà, Rachele, additional, Massari, Arcangelo, additional, Moretti, Arianna, additional, Peroni, Silvio, additional, Pescarin, Sofia, additional, Renda, Giulia, additional, Ronchi, Diego, additional, Sullini, Mattia, additional, Tini, Maria Alessandra, additional, Tomasi, Francesca, additional, Travaglini, Laura, additional, and Vittuari, Luca, additional
Published: 2023
Full Text: View/download PDF

33. OSCAR: A Customisable Tool for Free-Text Search over SPARQL Endpoints

Author: Heibi, Ivan, Peroni, Silvio, Shotton, David, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, González-Beltrán, Alejandra, editor, Osborne, Francesco, editor, Peroni, Silvio, editor, and Vahdati, Sahar, editor
Published: 2018
Full Text: View/download PDF

34. Retractions in arts and humanities: an analysis of the retraction notices

Author: Heibi, Ivan and Peroni, Silvio
Abstract: The aim of this work is to understand the retraction phenomenon in the arts and humanities domain through an analysis of the retraction notices—formal documents stating and describing the retraction of a particular publication. The retractions and the corresponding notices are identified using the data provided by Retraction Watch. Our methodology for the analysis combines a metadata analysis and a content analysis (mainly performed using a topic modelling process) of the retraction notices. Considering 343 cases of retraction, we found that many retraction notices are neither identifiable nor findable. In addition, these were not always separated from the original papers, introducing ambiguity in understanding how these notices were perceived by the community (i.e. cited). Also, we noticed that there is no systematic way to write a retraction notice. Indeed, some retraction notices presented a complete discussion of the reasons for retraction, while others tended to be more direct and succinct. We have also reported many notices having similar text while addressing different retractions. We think a further study with a larger collection should be done using the same methodology to confirm and investigate our findings further.
Published: 2024
Full Text: View/download PDF

35. Software review: COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations

Author: Heibi, Ivan, Peroni, Silvio, and Shotton, David
Published: 2019
Full Text: View/download PDF

36. Presentation of 'Representing provenance and track changes of cultural heritage metadata in RDF: a survey of existing approaches'

Author: Massari, Arcangelo, Peroni, Silvio, Tomasi, Francesca, and Heibi, Ivan
Subjects: Digital Humanities, change-tracking, provenance, RDF
Abstract: The data within collections from all Digital Humanities fields must be trustworthy. To this end, both provenance and change-tracking systems are needed. This contribution offers a systematic review of the metadata representation models for provenance in RDF, focusing on the problem of modellinghumanistic data. This work, deposited on arXiv and Zenodo, was presented at the ADHO Digital Humanities Conferences 2023 (DH2023) on 2023-07-13.
Published: 2023
Full Text: View/download PDF

37. EOSC-IF / Interoperability Guideline: Research Product Deposition

Author: Bardi, Alessia, Manghi, Paolo, Gonzalez Lopez, Jose Benito, Ariyo, Chris, Czerniak, Andreas, van Dongen, Paul Gondim, Kakaletris, Georgios, Palma, Raul, Peroni, Silvio, van Piggelen, Hans, van de Sanden, Mark, Scardaci, Diego, Schirrwagen, Jochen, Testi, Debora, Tournoy, Raphaël, Vipavc, Irena, Grbac, Deborah, Enell, Carl-Fredrik, Aben, Guido, Heibi, Ivan, van Kemenade, Jorik, and Bardi, Alessia
Subjects: EOSC, Open Science, Interoperability guidelines
Abstract: Open Science calls for researchers to publish as soon as possible any type of research product in such a way their research activity can be transparently assessed, reviewed, reproduced, and rewarded in all its aspects. However, the publishing process has become more and more a burden for scientists, who must, most of the time, spend time to publish their articles, data, software, and other products in the many institutional or thematic repositories of reference. Scenarios include first-time publishing of new resource products or double-publishing of research products, to satisfy institutional mandates and community practices. Such tedious work is often incomplete, with some products ending up unpublished and others showing incomplete or imprecise metadata. Some communities investigated and realised the integration of their research performing services, from research infrastructures and clusters, with repositories for research product deposition. The integration ensures that outcomes of such services are deposited automatically, prior authorization of the users, into a given repository, giving life to an end-to-end scientific workflow, from experimentation to publishing. The limit of existing approaches is to be bound to a specific repository API and format; introducing multiple repositories as potential targets of deposition for the service, multiplies the problem, as bilateral interactions with the respective repository API must be established. For example, the Zenodo deposition API and the B2SHARE API are similar but different in many ways; a service willing to automate publishing into either repositories would require implementing and maintaining two different workflows. For the EOSC to act as enabler for Open Science practices, its Interoperability Framework should guide services of research infrastructures and clusters of the EOSC on how to implement (semi-)automated workflows for the deposition and consumption of research products. To support different integration options, two modalities are supported by these guidelines: SWORD protocol v3 for push mode and a combination of COAR Notify and Signposting for pull mode. The EOSC guidelines for research product onboarding are suggested as metadata exchange format., The guidelines are proposed by the EOSC Future Working Group on Research Product Publishing
Published: 2023
Full Text: View/download PDF

38. Review of: "The Ethics of Retraction"

Author: Heibi, Ivan, primary
Published: 2023
Full Text: View/download PDF

39. OSCAR: A Customisable Tool for Free-Text Search over SPARQL Endpoints

Author: Heibi, Ivan, primary, Peroni, Silvio, additional, and Shotton, David, additional
Published: 2018
Full Text: View/download PDF

40. EOSC IF Interoperability Guideline: Access to content via PID

Author: Bardi, Alessia, Manghi, Paolo, Gonzalez Lopez, Jose Benito, Ariyo, Chris, Czerniak, Andreas, van Dongen, Paul Gondim, Kakaletris, Georgios, Palma, Raul, Peroni, Silvio, van Piggelen, Hans, van de Sanden, Mark, Scardaci, Diego, Schirrwagen, Jochen, Testi, Debora, Tournoy, Raphaël, Vipavc, Irena, Grbac, Deborah, Enell, Carl-Fredrik, Aben, Guido, Heibi, Ivan, van Kemenade, Jorik, and Bardi, Alessia
Subjects: EOSC, Interoperability guidelines, Open Science
Abstract: An important aspect of Open Science is the possibility to re-use existing research products (e.g. research data), deposited in repositories and accessible via their persistent identifiers (e.g. handle, doi, ark). However, there is no standard way a service can access the actual content behind persistent identifiers, as these typically resolve to the landing pages of the research products. The lack of standard for accessing the actual content identified by persistent identifiers makes the automatic consumption of research products hardly implementable and, when possible, limited to the persistent identifiers issued by a specific repository (e.g. the first prototype of the EGI Data Transfer Service integrated in the EOSC EXPLORE portal supported only DOIs from Zenodo). The EOSC Future Working Group on Research Product Publishing proposes the adoption of the Publication Boundary Pattern of the SignPosting protocol and recomends it for inclusion as interoperability guideline in the EOSC IF., The guidelines are proposed by the EOSC Future Working Group on "Research Product Publishing".
Published: 2023

41. OAWeek: OpenCitations an infrastructure for open bibliographical metadata

Author: Heibi, Ivan
Subjects: open science, open citations, bibliographical citations, citation data
Abstract: As per tradition, OpenAIRE will actively contribute to the International Open Access Week 2022 initiatives with interactive sessions and thought-provoking panel discussions connected to the theme of this year “Open for Climate Justice”. OpenAIRE prepared two series of webinars that will showcase the different ways in which we can all work together and make Open Science a means to tackle the challenges ahead of us. This session was dedicated to the OpenCitations:an infrastructure for open bibliographical metadata Speakers: - Ivan Heibi, University of Bologna Recordings also available in Youtube -https://youtu.be/zGDyEeh6XnM
Published: 2022
Full Text: View/download PDF

42. Enabling Portability and Reusability of Open Science Infrastructures

Author: Heibi, Ivan
Subjects: OpenCitations, POSI, Open Science Infrastructures, FAIR
Abstract: The slides used in the presentation held by Ivan Heibiin the context of the26th International Conference on Theory and Practice of Digital Libraries (TPDL 2022). Abstract This paper presents a methodology for designing a containerized and distributed open science infrastructure to simplify its reusability, replicability, and portability in different environments. The methodology is depicted in a step-by-step schema based on four main phases: (1) Analysis, (2) Design, (3) Definition, and (4) Managing and provisioning. We accompany the description of each step with existing technologies and concrete examples of application.
Published: 2022
Full Text: View/download PDF

43. Why arts and humanities publications get retracted: a topic modeling analysis of the retraction notices

Author: Heibi, Ivan and Peroni, Silvio
Subjects: topic modeling, retraction, humanities
Abstract: The slides used in the presentation held by Ivan Heibiin the context of the Digital Humanities 2022 (DH2022) Conference. Abstract Considering the less attention that has been given to the study of the retraction phenomenon and, to the reasons for retraction in the arts and humanities domain, the aim of this work is to investigate the reasons for retraction in the arts and humanities through automatic textual analysis of the retraction notices and a comparison of these results with the data provided by other services which have worked on labeling such reasons such as Retraction Watch.
Published: 2022
Full Text: View/download PDF

44. Science of Retracted Science: a Citation Analysis of the Arts and Humanities Domain

Author: Heibi, Ivan and Peroni, Silvio
Subjects: Humanities, Citation analysis, Topic Modeling, FOS: Humanities, INF/01 Informatica, Science of Science, Retraction
Abstract: The slides used in the presentation held by Ivan Heibiin the context of its PhD dissertation defense. Abstract of the PhD thesis In the scholarly publishing domain, a retraction is raised when a specific publication is considered erroneous by the venue in which it appeared after it was published. The aim of this work is uncovering new insights and learn new important information to help us understand the retraction phenomenon in the arts and humanities domain. Our investigation is based on a methodology defined using quantitative and qualitative measures derived from previous studies in the transdisciplinary research field of “science of science” (SciSci). The designed methodology takes into account a general case of retraction and applies a citation analysis based on five phases. Citations to retracted publications (before and after their retraction) are gathered and characterized with a set of attributes, including general metadata and information extracted from citing entities’ full text. The annotated characteristics are further considered for a statistical and a textual analysis (i.e., a topic modeling analysis). The contribution of this thesis is grounded by addressing the following research questions: (RQ1) How did scholarly research cite retracted humanities publications before and after their retraction? (RQ2) Did all the humanities areas behave similarly concerning the retraction phenomenon? (RQ3) What are the main differences and similarities in the retraction dynamics between the humanities domain and the STEM disciplines? RQ1 and RQ2 are addressed by tuning and applying the methodology on the analysis of the retracted publications in the humanities domain. RQ3 is addressed on two levels, i.e., considering and comparing: (L1) the outcomes of the past studies on the retraction in STEM, and (L2) the results obtained from an analysis of a retraction case in STEM using the defined methodology.
Published: 2022
Full Text: View/download PDF

45. OpenCitations, an open e-infrastructure to foster maximum reuse of citation data (short abstract)

Author: Di Giambattista, Chiara, Heibi, Ivan, Peroni, Silvio, and Shotton, David
Subjects: open citations, open data, OpenCitations, FAIR
Abstract: A proposal abstract submitted forto the 17th International Digital Curation Conference takes place on 13-16 June 2022, Edinburgh, Scotland.
Published: 2022
Full Text: View/download PDF

46. OpenCitations: an Open e-Infrastructure to Foster Maximum Reuse of Citation Data

Author: Di Giambattista, Chiara, primary, Heibi, Ivan, additional, Peroni, Silvio, additional, and Shotton, David, additional
Published: 2022
Full Text: View/download PDF

47. A protocol to gather, characterize and analyze incoming citations of retracted articles

Author: Heibi, Ivan, primary and Peroni, Silvio, additional
Published: 2022
Full Text: View/download PDF

48. Science of retracted science: a citation analysis of the arts and humanities domain

Author: Peroni, Silvio, Heibi, Ivan <1989>, Peroni, Silvio, and Heibi, Ivan <1989>
Abstract: In the scholarly publishing domain, a retraction is raised when a specific publication is considered erroneous by the venue in which it appeared after it was published. The aim of this work is uncovering new insights and learn new important information to help us understand the retraction phenomenon in the arts and humanities domain. Our investigation is based on a methodology defined using quantitative and qualitative measures derived from previous studies in the transdisciplinary research field of “science of science” (SciSci). The designed methodology takes into account a general case of retraction and applies a citation analysis based on five phases. Citations to retracted publications (before and after their retraction) are gathered and characterized with a set of attributes, including general metadata and information extracted from citing entities’ full text. The annotated characteristics are further considered for a statistical and a textual analysis (i.e., a topic modeling analysis). The contribution of this thesis is grounded by addressing the following research questions: (RQ1) How did scholarly research cite retracted humanities publications before and after their retraction? (RQ2) Did all the humanities areas behave similarly concerning the retraction phenomenon? (RQ3) What are the main differences and similarities in the retraction dynamics between the humanities domain and the STEM disciplines? RQ1 and RQ2 are addressed by tuning and applying the methodology on the analysis of the retracted publications in the humanities domain. RQ3 is addressed on two levels, i.e., considering and comparing: (L1) the outcomes of the past studies on the retraction in STEM, and (L2) the results obtained from an analysis of a retraction case in STEM using the defined methodology.
Published: 2022

49. MITAO: a tool for enabling scholars in the Humanities to use Topic Modelling in their studies

Author: Heibi, Ivan, Peroni, Silvio, Pareschi, Luca, Ferri, Paolo, Federico Boschetti, Angelo Mario Del Grosso, Enrica Salvatori, Heibi, Ivan, Peroni, Silvio, Pareschi, Luca, and Ferri, Paolo
Subjects: FOS: Computer and information sciences, topic modelling, MITAO, tool, Computer Science - Digital Libraries, Digital Libraries (cs.DL)
Abstract: Automatic text analysis methods, such as Topic Modelling, are gaining much attention in Humanities. However, scholars need to have extensive coding skills to use such methods appropriately. The need of having this technical expertise prevents the broad adoption of these methods in Humanities research. In this paper, to help scholars in the Humanities to use Topic Modelling having no or limited coding skills, we introduce MITAO, a web-based tool that allow the definition of a visual workflow which embeds various automatic text analysis operations and allows one to store and share both the workflow and the results of its execution to other researchers, which enables the reproducibility of the analysis. We present an example of an application of use of Topic Modelling with MITAO using a collection of English abstracts of the articles published in "Umanistica Digitale". The results returned by MITAO are shown with dynamic web-based visualizations, which allowed us to have preliminary insights about the evolution of the topics treated over the time in the articles published in "Umanistica Digitale". All the results along with the defined workflows are published and accessible for further studies.
Published: 2021

50. Creating RESTful APIs over SPARQL endpoints using RAMOSE

Author: Daquino, Marilena, primary, Heibi, Ivan, additional, Peroni, Silvio, additional, and Shotton, David, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

124 results on '"Heibi, Ivan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources