101 results
Search Results
2. The pedagogical value of papers: a collaborative-filtering based paper recommender.
- Author
-
Tang TY and McCalla G
- Abstract
In this paper we discuss the pedagogical features necessary to make appropriate recommendations of papers to students in an e-learning domain. Analyzing data collected in a human subject study several characteristics of learners and of papers are found that are important to making good recommendations. These pedagogical features distinguish e-learning domains from many commercial domains where the only key factor is a user's likes and dislikes. [ABSTRACT FROM AUTHOR]
- Published
- 2009
3. FISHNet: encouraging data sharing and reuse in the freshwater science community.
- Subjects
- *
DATABASE management , *INFORMATION retrieval , *RESEARCH , *SCIENCE , *WATER , *ACCESS to information - Abstract
This paper describes the FISHNet project, which developed a repository environment for the curation and sharing of data relating to freshwater science, a discipline whose research community is distributed thinly across a variety of institutions, and usually works in relative isolation as individual researchers or within small groups. As in other "small sciences", these datasets tend to be small and "hand-crafted", created to address particular research questions rather than with a view to reuse, so they are rarely curated effectively, and the potential for sharing and reusing them is limited. The paper addresses a variety of issues and concerns raised by freshwater researchers as regards data sharing, describes our approach to developing a repository environment that addresses these concerns, and identifies the potential impact within the research community of the system. [ABSTRACT FROM AUTHOR]
- Published
- 2012
4. Repository as a Service (RaaS).
- Author
-
Lewis, Stuart, Shepherd, Kim, Latt, Yin Yin, Schweer, Andrea, and Field, Adam
- Subjects
- *
DATABASE management , *INFORMATION retrieval , *INFORMATION services , *SCHOLARLY method , *UNIVERSITIES & colleges , *ELECTRONIC publications - Abstract
In his oft-quoted seminal paper 'Institutional Repositories: Essential Infrastructure For Scholarship In The Digital Age' Clifford Lynch (2003) described the Institutional Repository as "a set of services that a university offers to the members of its community for the management and dissemination of digital materials created by the institution and its community members." This paper seeks instead to define the repository service at a more primitive level, without the specialism of being an 'Institutional Repository', and looks at how it can viewed as providing a service within appropriate boundaries, and what that could mean for the future development of repositories, our expectations of what repositories should be, and how they could fit into the set of services required to deliver an Institutional Repository service as described by Lynch. [ABSTRACT FROM AUTHOR]
- Published
- 2012
5. Digital Whistleblowing in Restricted Environments.
- Author
-
Bell, Graeme B.
- Subjects
- *
FRAUD prevention , *ALGORITHMS , *ATTITUDE (Psychology) , *COMPUTER graphics , *GROUP identity , *WHISTLEBLOWING , *ELECTRONIC publications , *SEARCH engines , *DATA security - Abstract
The exposure of an organisation's illegal or unethical practices is often known as whistleblowing. It is currently a high-profile activity as a consequence of whistleblowing websites such as Wikileaks. However, modern digital fingerprinting technologies allow the identification of the human users associated with a particular copy of a leaked digital file. Fear of such discovery may discourage the public from exposing illegal or unethical practices. This paper therefore introduces the novel whistleblower-defending problem, a unique variant of the existing document marking and traitor-tracing problems. It is addressed here by outlining practical steps that real-world whistleblowers can take to improve their safety, using only standard desktop OS features. ZIP compression is found to be useful for indirect file comparison, n cases where direct file comparison or use of checksums is impossible, inconvenient or easily traceable. The methods of this paper are experimentally evaluated and found to be effective. [ABSTRACT FROM AUTHOR]
- Published
- 2011
6. Preserving repository content: practical tools for repository managers.
- Author
-
Pickton, Miggie, Morris, Debra, Meece, Stephanie, Coles, Simon, and Hitchcock, Steve
- Subjects
- *
UNIVERSITIES & colleges , *DECISION making , *INFORMATION retrieval , *MANAGEMENT , *METADATA , *ELECTRONIC publications , *DATA security - Abstract
The stated aim of many repositories is to provide permanent open access to their content. However, relatively few repositories have implemented practical action plans towards permanence. Repository managers often lack time and confidence to tackle the important but scary problem of preservation. Written by, and aimed at, institutional repository managers, this paper describes how the JISC--funded KeepIt project has been bringing together existing preservation tools and services with appropriate training and advice to enable repository managers to formulate practical and achievable preservation plans. Three elements of the KeepIt project are described: 1. The initial, exploratory phase in which repository managers and a preservation specialist established the current status of each repository and its preservation objectives; 2. The repository--specific KeepIt preservation training course which covered the organisational and financial framework of repository preservation; metadata; the new preservation tools; and issues of trust between repository, users and services; 3. The application of some of the tools and lessons learned from the training course to four exemplar repositories and the impact that this has made. The paper concludes by recommending practical steps that all repository managers may take to ensure their repositories are preservation--ready. [ABSTRACT FROM AUTHOR]
- Published
- 2011
7. AgEcon Search: A case study on the differences between operating a subject repository and an institutional repository.
- Author
-
Kelly, Julie and Letnes, Louise
- Subjects
- *
AGRICULTURAL research , *ECONOMIC research , *ORGANIZATION , *INSTITUTIONAL repositories , *INTELLECTUAL property , *INTERNET - Abstract
AgEcon Search is a subject repository containing the full text of working papers, conference papers and small press journals in agricultural and other areas of applied economics. In existence since 1995, it contains material from 215 organizations. Comparisons are made between the operations of a subject repository and those of an institutional repository, with each having easier and more challenging aspects. The field of economics has characteristics that contribute to the success of a subject repository, such as a pre-print culture and an interest in intellectual property and the economics of publishing. [ABSTRACT FROM AUTHOR]
- Published
- 2010
8. User needs in television archive access: Acquiring knowledge necessary for system design.
- Author
-
Lunn, Brian Kirkegaard
- Subjects
- *
INFORMATION resources management , *CULTURAL property , *TELEVISION broadcasting , *EMPIRICAL research , *INFORMATION retrieval , *KNOWLEDGE management - Abstract
This paper presents a methodical approach for generating deep knowledge about users, as a prerequisite for design and construction of digital information access to cultural heritage information objects. We exemplify this methodical approach by reporting on an explorative study of information need characteristics in a television broadcast context. The methodical approach is inspired by naturalistic research, and our main data is nine in-depth interviews conducted with scholars and students within the academic field of Media Studies. The analysis identifies four characteristics. Firstly, broadcasts are needed as objects of analysis in empirical research. Secondly, the needs are related to three broadcast dimensions: 1) Transmission; 2) Archive; and 3) Reception. Thirdly, four fundamental types of information needs are verified in a television broadcast context: 1) Known item; 2) Factual data; 3) Known topic or content; and 4) Muddled topic or content. Fourthly, the interviewees' needs consist of four phases: 1) Getting an overview of transmitted broadcasts; 2) Identification of borderline exemplars; 3) Selection of specific programmes; and 4) Verification of facts. The present paper presents novel research on characteristics of information needs in a television broadcast context. We demonstrate how one may go about generating knowledge which is imperative for the design and construction of future broadcast retrieval systems. [ABSTRACT FROM AUTHOR]
- Published
- 2009
9. Sheer Curation of Experiments: Data, Process, Provenance.
- Subjects
- *
COMPUTER software , *DATABASE management , *RESEARCH , *SCIENCE , *ACCESS to information - Abstract
This paper describes an environment for the "sheer curation" of the experimental data of a group of researchers in the fields of biophysics and structural biology. The approach involves embedding data capture and interpretation within researchers' working practices, so that it is automatic and invisible to the researcher. The environment does not capture just the individual datasets generated by an experiment, but the entire workflow that represent the "story" of the experiment, including intermediate files and provenance metadata, so as to support the verification and reproduction of published results. As the curation environment is decoupled from the researchers' processing environment, the provenance is inferred from a variety of domain-specific contextual information, using software that implements the knowledge and expertise of the researchers. We also present an approach to publishing the data files and their provenance according to linked data principles by using OAI-ORE (Open Archives Initiative Object Reuse and Exchange) and OPMV. [ABSTRACT FROM AUTHOR]
- Published
- 2012
10. Kindura: Repository services for researchers based on hybrid clouds.
- Author
-
Waddington, Simon, Jun Zhang, Knight, Gareth, Hedges, Mark, Jensen, Jens, and Downing, Roger
- Subjects
- *
COMPUTER software , *DATABASE management , *RESEARCH , *ACCESS to information , *CLOUD computing - Abstract
The paper describes the investigations and outcomes of the JISC-funded Kindura project, which is piloting the use of hybrid cloud infrastructure to provide repository-focused services to researchers. The hybrid cloud services integrate external commercial cloud services with internal IT infrastructure, which has been adapted to provide cloud-like interfaces. The system provides services to manage and process research outputs, primarily focusing on research data. These services include both repository services, based on use of the Fedora Commons repository, as well as common services such as preservation operations that are provided by cloud compute services. Kindura is piloting the use of the DuraCloud2, open source software developed by DuraSpace, to provide a common interface to interact with cloud storage and compute providers. A storage broker integrates with DuraCloud to optimise the usage of available resources, taking into account such factors as cost, reliability, security and performance. The development is focused on the requirements of target groups of researchers. [ABSTRACT FROM AUTHOR]
- Published
- 2012
11. DAR: A Modern Institutional Repository with a Scalability Twist.
- Author
-
Mikhail, Youssef, Adly, Noha, and Nagi, Magdy
- Subjects
- *
ARCHIVES , *CATALOGING , *INFORMATION resources management , *INFORMATION retrieval , *METADATA , *PUBLISHING , *ELECTRONIC publications - Abstract
The Digital Assets Repository (DAR) is an Institutional Repository developed at the Bibliotheca Alexandrina to manage the full lifecycle of a digital asset: its creation and ingestion, its metadata management, storage and archival in addition to the necessary mechanisms for publishing and dissemination. DAR was designed with a focus on integrating DAR with different sources of digital objects and metadata in addition to integration with applications built on top of the repository. As a modern repository, the system architecture demonstrates a modular design relying on components that are best of the breed, a flexible content model for digital objects based on current standards and heavily relying on RDF triples to define relations. In this paper we will demonstrate the building blocks of DAR as an example of a modern repository, discussing how the system addresses the challenges that face an institution in consolidating its assets and a focus on solving scalability issues. [ABSTRACT FROM AUTHOR]
- Published
- 2012
12. Redocumenting computer-mediated activity from its traces: a model-based approach for narrative construction.
- Author
-
Yahiaoui, Leila, Prié, Yannick, Boufaida, Zizette, and Champin, Pierre-Antoine
- Subjects
- *
COMPUTER software , *COMPUTERS , *DOCUMENTATION , *ELECTRONIC publications , *NARRATIVES - Abstract
Our activities are becoming more and more computer-mediated. For documenting these activities, it is no longer sufficient to automatically record their traces. In this paper we introduce the redocumentation process of computer-mediated activity as a narrative construction that ties together the content of activity traces and the users' knowledge in describing their activities in new easily exchangeable documents. We present a generic semi-automatic approach for this process, which is based on rhetorical structure theory. This approach uses formal models for process input and output, and handles the process through two main phases: an automatic phase to generate a fragmented document from traces as a first description of the activity and an interactive phase to allow the user to tailor this first description according to his particular needs and choices. We also present ActRedoc, a tool developed for text-based redocumentation, for which a first evaluation was conducted. [ABSTRACT FROM AUTHOR]
- Published
- 2011
13. Engineering an Open Web Syndication Interchange with Discovery and Recommender Capabilities.
- Author
-
O'Riordan, Adrian and O'Mahony, Aliver
- Subjects
- *
COMPUTER software , *CURRENT awareness services , *INTERNET , *MUSIC , *XML (Extensible Markup Language) , *SEARCH engines - Abstract
Web syndication has become a popular means of delivering relevant information to people online but the complexity of standards, algorithms and applications pose considerable challenges to engineers. This paper describes the design and development of a novel Web-based syndication intermediary called InterSynd and a simple Web client as a proof of concept. We developed format-neutral middleware that sits between content sources and the user. Additional objectives were to add feed discovery and recommendation components to the intermediary. A search-based feed discovery module helps users find relevant feed sources. Implicit collaborative recommendations of new feeds are also made to the user. The syndication software built uses open standard XML technologies and the free open source libraries. Extensibility and re-configurability were explicit goals. The experience shows that a modular architecture can combine open source modules to build state-of-the-art syndication middleware and applications. The data produced by software metrics indicate the high degree of modularity retained. [ABSTRACT FROM AUTHOR]
- Published
- 2011
14. Generic Adaptation Framework: a Process-Oriented Perspective.
- Author
-
Knutov, Evgeny, De Bra, Paul, and Pechenizkiy, Mykola
- Subjects
- *
INFORMATION retrieval , *INFORMATION storage & retrieval systems , *WORLD Wide Web , *SEARCH engines , *ACCESS to information - Abstract
Adaptive Hypermedia Systems (AHS) have long been mostly domain- or application-specific systems. Existing reference models provide domain-independent high-level descriptions of AHS. They focus on abstract data structures and only briefly describe the adaptation process in a generic way. In this paper we consider the process aspects of AHS from the very first classical user modelling-adaptation loop to a generic detailed flowchart of the adaptation in AHS.We introduce a Generic Adaptation Process (GAP) and by aligning it with a layered (data-oriented) AHS architecture we show that it can serve as the process part of a new reference model for AHS. [ABSTRACT FROM AUTHOR]
- Published
- 2011
15. ICE-Theorem - End to end semantically aware eResearch infrastructure for theses.
- Author
-
Sefton, Peter, Downing, Jim, and Day, Nick
- Subjects
- *
INSTITUTIONAL repositories , *WORKFLOW , *INFORMATION superhighway , *SEMANTICS , *DATABASES - Abstract
ICE-TheOREM was a project which made several important contributions to the repository domain, promoting deposit by integrating the repository with authoring workflows and enhancing open access by prototyping new infrastructure to allow fine-grained embargo management within an institution without impacting on existing open access repository infrastructure. In the area of scholarly communications workflows, the project produced a complete end-to-end demonstration of eScholarship for word processor users, with tools for authoring, managing and disseminating semantically-rich thesis documents fully integrated with supporting data. This work is focused on theses, as it is well understood that early career researchers are the most likely to lead the charge in new innovations in scholarly publishing and dissemination models. The authoring tools are built on the ICE content management system, which allows authors to work within a word processing system (as most authors do) with easy-to-use toolbars to structure and format their documents. The ICE system manages both small data files and links to larger data sets. The result is research publications which are available not just as paper-ready PDF files but as fully interactive semantically aware web documents which can be disseminated via repository software such as ePrints, DSpace and Fedora as complete supported web-native and PDF publications. ICE-TheOREM combined the Object Reuse and Exchange (OAI-ORE, IONSREPORT 2008) and SWORD-APP protocols to transfer content between a content management system, a thesis management system and multiple repository software packages and looked at ways to describe aggregate objects which include both data and documents, and to represent structure withing thesis documents. This can be generalized to domains other than chemistry. ICE-TheOREM has demonstrated how focusing on the use of the web architecture (including ORE) enables repository functions to be distributed between systems for complex, data-rich compound objects. [ABSTRACT FROM AUTHOR]
- Published
- 2010
16. Towards an Open Repository Environment.
- Author
-
Aschenbrenner, Andreas, Küster, Marc W., Blanke, Tobias, and Pempe, Wolfgang
- Subjects
- *
INSTITUTIONAL repositories , *DATABASES , *METADATA , *PUBLICATIONS , *ARCHIVES , *INTERNETWORKING - Abstract
Repositories used to be fairly monolithic systems, with a single object store, a tailored content model, and a dedicated application on top. First steps for searching across repositories (e.g. Z39.50, first drafted in 1988; OAI-PMH for content aggregation, first released 2001) are an exception to this. However, we are still far away from an "open repository environment", in which repositories interact on all levels with other agents (e.g. other repositories, added-value services, registries). This paper creates a more fine-grained view on repository federation and analyses existing approaches by decomposing them into a physical, a logical, and a conceptual layer for both the object and the system. Among these attributes, the most evident gap pertains to interaction "patterns" between agents. In particular, the notification pattern is more immediate and directed than existing query and harvesting mechanisms, enabling new federation scenarios and laying the grounds for open repository environments. Prototypes of the concepts presented here are being implemented in the scope of the project Dariah, which establishes an e-Infrastructure for the humanities. [ABSTRACT FROM AUTHOR]
- Published
- 2010
17. Authoring, Editing and Visualizing Compound Objects for Literary Scholarship.
- Author
-
Gerber, Anna and Hunter, Jane
- Subjects
- *
INFORMATION resources , *INFORMATION services , *BIBLIOGRAPHIC databases , *ONTOLOGY , *TEACHING , *CATALOGS - Abstract
This paper presents LORE (Literature Object Re-use and Exchange), a light-weight tool designed to enable scholars and teachers of literature to author, edit and publish OAI-ORE-compliant compound information objects that encapsulate related digital resources and bibliographic records. LORE provides a graphical user interface for creating, labelling and visualizing typed relationships between individual objects using terms from a bibliographic ontology based on the IFLA FRBR. After creating a compound object, users can attach metadata and publish it to a repository (as an RDF graph) where it can be searched, retrieved, edited and reused by others. LORE has been developed in the context of the Australian Literature Resource project (AustLit) and hence focuses on compound objects for teaching and research within the Australian literary studies community. However it can easily be tailored to support the creation of compound objects for literary and bibliographic research more generally. [ABSTRACT FROM AUTHOR]
- Published
- 2010
18. Adding OAI-ORE Support to Repository Platforms.
- Author
-
Maslov, Alexey, Creel, James, Mikeal, Adam, Phillips, Scott, Leggett, John, and McFarland, Mark
- Subjects
- *
DIGITAL libraries , *ELECTRONIC dissertations , *ARCHIVES , *DATA transmission systems , *INSTITUTIONAL repositories - Abstract
The Texas Digital Library (TDL) is a cooperative initiative of Texas universities. One of TDL's core services is a federated collection of electronic theses and dissertations (ETDs) from its member schools. As this collection grew, the need for tools to manage the content exchange from the local to the federated repository became evident. This paper presents our experiences in adding harvesting support to the DSpace repository platform using the ORE and PMH protocols from the Open Archives Initiative. We describe our use case for a statewide ETD repository and the mapping of the OAI-ORE data model to the DSpace architecture. We discuss our implementation that adds both dissemination and harvesting functionality to the repository. We conclude by discussing the architectural flexibility added to the TDL repository through this project. [ABSTRACT FROM AUTHOR]
- Published
- 2010
19. Towards Affordable Disclosure of Spoken Heritage Archives.
- Author
-
Ordelman, Roeland, Heeren, Willemijn, De Jong, Franciska, Huijbregts, Marijn, and Hiemstra, Djoerd
- Subjects
- *
ARCHIVES , *INTERVIEWING , *WORLD War II , *SPEECH perception , *WEB portals , *SEMANTICS - Abstract
This paper presents and discusses ongoing work aiming at affordable disclosure of real-world spoken heritage archives in general, and in particular of a collection of recorded interviews with Dutch survivors of World War II concentration camp Buchenwald. Given such collections, we at least want to provide search at different levels and a flexible way of presenting results. Strategies for automatic annotation based on speech recognition -- supporting e.g., within-document search--are outlined and discussed with respect to the Buchenwald interview collection. In addition, usability aspects of the spoken word search are discussed on the basis of our experiences with the online Buchenwald web portal. It is concluded that, although user feedback is generally fairly positive, automatic annotation performance is not yet satisfactory, and requires additional research. [ABSTRACT FROM AUTHOR]
- Published
- 2009
20. Advanced Information Access to Parliamentary Debates.
- Author
-
Marx, Maarten
- Subjects
- *
INFORMATION science , *POLITICIANS , *LEGISLATIVE bodies , *LEGAL rights , *INFORMATION retrieval , *PDF (Computer file format) , *XML (Extensible Markup Language) - Abstract
Parliamentary debates are highly structured transcripts of meetings of politicians in parliament. These debates are an important part of the cultural heritage of many countries; they are often free of copy-right; citizens often have a legal right to inspect them; and several countries make great effort to digitize their entire historical collection and make it available to the general public. This provides many opportunities for the Information Retrieval community. In this paper, we analyze the structure of parliamentary proceedings and sketch a widely applicable DTD. We show how proceedings in PDF format can be transformed into deeply nested XML. Having the proceedings in XML makes a wide range of applications possible. We elaborate on five applications: entry point retrieval, advanced content and structure search; automatic creation of tables of contents and hyperlinked navigation menus; graphical result aggregation; large savings on storage space and bandwidth for scanned documents. [ABSTRACT FROM AUTHOR]
- Published
- 2009
21. Designing Personal Information Management Systems for Creative Practitioners.
- Author
-
Coughlan, Tim and Johnson, Peter
- Subjects
- *
INFORMATION resources management , *SOCIETIES , *ECONOMICS , *WORK environment , *ECONOMIC structure , *PROFESSIONS - Abstract
This paper first explores information management by creative practitioners through a review of research related to the area. Two studies are then used to explore this area from different viewpoints. Topics of interest include the types of information that are commonly managed, the reasons for creating representations, the processes of finding and interacting with materials, and the tools used. This understanding is then applied and extended through the design and evaluation of an 'Associative Scrapbook' application inspired by the common use of scrapbooks by practitioners. Creative practice generally involves the retention, development and communication of ideas, inspirational materials and structures of associations represented in a range of media over long periods of time. Extensive variations in information management behaviour between individuals are seen as a natural part of creative processes. Through the studies and the example of the Associative Scrapbook, we argue that support for PIM in creative practice should better integrate work on specific task instances with the long-term collection and reuse of information related to the practitioners' interests. Whilst there are needs that are specific for particular types of creative tasks, the studies and initial evaluations of the prototype provide evidence to suggest that creative PIM needs and processes show similarities across domains. [ABSTRACT FROM AUTHOR]
- Published
- 2009
22. Text and Web Mining Approaches in Order to Build Specialized Ontologies.
- Author
-
Roche, Mathieu and Kodratoff, Yves
- Subjects
- *
TEXT mining , *DATA mining , *DATABASE searching , *INFORMATION retrieval , *PERSONNEL management , *WEBSITES - Abstract
This paper presents a text-mining approach in order to extract candidate terms from a corpus. The relevant candidates are selected using a web-mining approach. The terms (i.e. relevant candidate terms) we find are the instances of specialized ontologies built during this process. The experiments are based on real data -- Human Resources corpus -- and they show the quality of our text and web mining approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2009
23. Supporting Visual Problem Solving in Spatial Hypertext.
- Author
-
Hsieh, Haowei and Shipman III, Frank M.
- Subjects
- *
HYPERTEXT systems , *INFORMATION resources management , *DATA visualization , *PROBLEM solving , *INTERACTIVE multimedia , *INFORMATION retrieval - Abstract
This paper describes the VITE system, a spatial hypertext system that supports two-way mapping for projecting structured information to a two-dimensional workspace and updating the structured information based on user interactions in the workspace. VITE uses information visualization techniques to render structured information in the workspace and provides users an environment to interact with digital information in a spatial hypertext setting. The two-way mapping connects the objects in the workspace to the structured information and provides users direct access to the information. The spatial hypertext environment encourages users to engage more directly with the information related to their tasks. An evaluation of VITE was conducted to study how people adapt to two-way mappings and how two-way mappings can help in problem solving tasks. The results show that users could quickly design visual mappings to help their problem-solving tasks and developed more sophisticated strategies for visual problem-solving over time. [ABSTRACT FROM AUTHOR]
- Published
- 2009
24. Tagging tagging. Analysing user keywords in scientific bibliography management systems.
- Author
-
Heckner M, Mühlbacher S, and Wolff C
- Abstract
In this paper, an empirical study of tagging behaviour in web-based bibliographic annotation systems is presented. Starting from an initial category finding phase in which tags attributed to selected articles from Connotea were classified we have set up a category model for linguistic and functional aspects of tag usage as well as for the relationship between tags and document full text. In a second phase this model is applied to approx. 500 tagged articles from the information and computer technology domain randomly selected from Connotea. Our findings show significant differences to other tagging research which was primarily conducted using popular (non-scientific) tagging platforms like Flickr or Delicious. We observe a great overlap of tag material and document text and rather few non-content related tags. The comparison of user tags with author keywords shows that users tend to use less and more general tags. Finally, system functionality seems to play a role for users' tagging behaviour. [ABSTRACT FROM AUTHOR]
- Published
- 2008
25. Adaptive personal information environment based on the semantic Web.
- Author
-
Maneewatthana T, Wills G, and Hall W
- Abstract
Personalised information systems aim to give the individual user support in accessing, retrieving and storing information. In order to support knowledge workers during their tasks of searching, locating and manipulating information, a system that provides information suitable for a particular user's needs, and that is also able to facilitate the sharing and reuse information, is essential. This paper presents Adaptive Personal Information Environment (a-PIE); a service-oriented framework using Open Hypermedia and Semantic Web technologies to provide a personalised web-based system. a-PIE models the information structures (data and links) and context as Fundamental Open Hypermedia Model (FOHM) structures which are manipulated by using the Auld Linky contextual link service. a-PIE provides an information environment that enables users to search an information space based on ontologically defined domain concepts. The users can add and manipulate (edit, delete, comment, etc.) interesting data, or parts of information structures, into their information space leaving the original published data, or information structures, unchanged. a-PIE facilitates the shareability and reusability of knowledge according to users' requirements. [ABSTRACT FROM AUTHOR]
- Published
- 2007
26. Automatic summary evaluation based on text grammars.
- Author
-
Branny E
- Abstract
In this paper, I describe a method for evaluating automatically generated text summaries. The method is inspired by research in text grammars by Teun Van Dijk. It addresses a text as a complex structure, the elements of which are interconnected both on the level of form and meaning, and the well-formedness of which should be described on both of these levels. The method addresses current problems of summary evaluation methods, especially the problem of quantifying informativity, as well as the problem of objective measurement of well-formedness of text. It is believed that the ideas from this research can contribute to evaluation methods for algorithms transforming complex meaningful entities into other complex meaningful entities (text, hypertext, sound, vision). [ABSTRACT FROM AUTHOR]
- Published
- 2007
27. A Web system trace model and its application to Web design.
- Author
-
Kong X, Liu L, and Lowe D
- Abstract
Traceability analysis is crucial to the development of web-centric systems, particularly those with frequent system changes, fine-grained evolution and maintenance, and high level of requirements uncertainty. A trace model at the level of the web system architecture is presented in this paper to address the specific challenges of developing web-centric systems. The trace model separates the concerns of different stakeholders in the web development life cycle into viewpoints; and classifies each viewpoint into structure and behaviour. Tracing relationships are presented along two dimensions: within viewpoints; and among viewpoints. Examples of tracing relationships are presented using UML. This trace model is demonstrated through its application to the design of a commercial web project using a web-design process. The design artifacts in each activity are transformed based on the artifacts tracing relationship in the trace model. The model provides mechanisms for verification of consistency, completeness and coverage within each viewpoint and the connectedness across viewpoints. [ABSTRACT FROM AUTHOR]
- Published
- 2007
28. Secure Embedded Data Schemes for User Adaptive Multimedia Presentation.
- Author
-
Sinha, Neelu
- Subjects
- *
DIGITAL media , *DIGITAL libraries , *WORLD Wide Web , *DATABASES , *INDEXING , *CUSTOMIZATION - Abstract
In this Digital/Internet Age, digital multimedia holds an unlimited potential, and virtually all forms of media content, including books, video games, music and software are now available for digital distribution. Digital multimedia libraries, comprising a large amount of such digital media (the so-called Digital Intellectual Property), in the form of images, video, audio and graphics are rapidly growing. Also, due to the unprecedented growth of the World Wide Web, vast amounts of multimedia data is readily available leading to an explosion of multimedia and hypermedia database creation and sharing. Digital information embedding techniques for various types of media, for a variety of applications including digital libraries, museum cataloging, medical and healthcare industries, digital preservation systems, educational systems and personalization systems, are of significant interest in two areas. Firstly, these are useful for the realization of efficient database indexing schemes and customization, which in turn lead to efficient tools for the organization, retrieval, adaptive presentation, and distribution of digital media content. Secondly, these are useful in developing tools to protect, detect and verify ownership and/or usage rights for the Digital Intellectual Property and also the tracking of these in the distribution medium. In either of these applications, information embedding schemes which allow for a detailed level of source description, and which are robust to some of the distortions encountered in the distribution medium (e.g., JPEG compression for images), are particularly attractive. This paper presents an overview of multimedia presentation adaptation through the use of robust information descriptors and a novel information embedding technique for digital images that allows for significantly higher information throughput and increased robustness compared to many of the existing techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2005
29. Fault-tolerant fulltext search for large multilingual scientific text corpora.
- Author
-
Esser WM
- Abstract
In the work reported here, we present a new way of performing fault-tolerant fulltext retrieval on large text corpora, such as scientific encyclopedias. The weighted pattern morphing (WPM) technique introduced in this paper overcomes disadvantages of both the popular edit distance measure and the Soundex code approaches, yet keeping their flexibility. This algorithm handles phonetic similarities; common typing errors such as omission or transposition of letters, and inconsistent usage of abbreviations and hyphenation. After showing how WPM can be implemented efficiently, we present a novel method of how the weights of the internal penalty matrix can be automatically adjusted for even better results. Though the described technique can be applied without prior knowledge of actual user patterns, re-examination with a large number of online-user's patterns proves the portability of this fine-tuning approach. We further show how shifting the penalty matrix from one language to another can be accomplished. The described WPM technique is integrated into a large commercial pharmaceutical encyclopedia CDROM, an online dermatological encyclopedia, and an online-reference encyclopedia of parasitology research, thus also proving its 'road capability'. [ABSTRACT FROM AUTHOR]
- Published
- 2005
30. AgEcon search: a case study on the differences between operating a subject repository and an institutional repository.
- Author
-
Kelly J and Letnes L
- Abstract
AgEcon Search is a subject repository containing the full text of working papers, conference papers and small press journals in agricultural and other areas of applied economics. In existence since 1995, it contains material from 170 organizations. Comparisons are made between the operations of a subject repository and those of an institutional repository, with each having easier and more challenging aspects. The field of economics has characteristics that contribute to the success of a subject repository, such as a pre-print culture and an interest in intellectual property and the economics of publishing. [ABSTRACT FROM AUTHOR]
- Published
- 2010
31. Lost in social space: information retrieval issues in Web 1.5.
- Author
-
Dron J and Anderson T
- Abstract
This paper is concerned with the application of Web 2.0 technologies within a conventional institutional learning setting. After considering the affordances of Web 2.0 technologies vs Web 1.0 technologies and a framework for viewing social software in terms of groups, networks and collectives, we describe an instance of trying to use Elgg, a rich social application, to support a distance-taught course within a conventional face-to-face university. A number of issues are identified, some of which are related to Elgg's interface but the biggest of which relate to the tensions between top-down and bottom-up control and the shifting contexts of personal, group, network and collective modes of engagement. These problems suggest that, in their current form, social technologies pose intractable difficulties in information organisation and retrieval when used for formal learning. We propose a range of solutions that make use of the wisdom of the crowd combined with human intervention. This paper addresses and extends themes explored in SIRTEL 07. [ABSTRACT FROM AUTHOR]
- Published
- 2009
32. Studying Social Tagging and Folksonomy: A Review and Framework.
- Author
-
Trant, J.
- Subjects
- *
TAGS (Metadata) , *FOLKSONOMIES , *RESEARCH , *INDEXING , *INFORMATION retrieval , *SOCIAL bookmarks - Abstract
This paper reviews research into social tagging and folksonomy (as reflected in about 180 sources published through December 2007). Methods of researching the contribution of social tagging and folksonomy are described, and outstanding research questions are presented. This is a new area of research, where theoretical perspectives and relevant research methods are only now being defined. This paper provides a framework for the study of folksonomy, tagging and social tagging systems. Three broad approaches are identified, focusing first, on the folksonomy itself (and the role of user tags in indexing and retrieval); secondly, on tagging (and the behaviour of users); and thirdly, on the nature of social tagging systems (as socio-technical frameworks). [ABSTRACT FROM AUTHOR]
- Published
- 2009
33. Studying social tagging and folksonomy: a review and framework.
- Author
-
Trant J
- Abstract
This paper reviews research into social tagging and folksonomy (as reflected in about 180 sources published through December 2007). Methods of researching the contribution of social tagging and folksonomy are described, and outstanding research questions are presented. This is a new area of research, where theoretical perspectives and relevant research methods are only now being defined. This paper provides a framework for the study of folksonomy, tagging and social tagging systems. Three broad approaches are identified, focusing first, on the folksonomy itself (and the role of user tags in indexing and retrieval); secondly, on tagging (and the behaviour of users); and thirdly, on the nature of social tagging systems (as socio-technical frameworks). [ABSTRACT FROM AUTHOR]
- Published
- 2009
34. The ALOCOM framework: towards scalable content reuse.
- Author
-
Verbert K, Ochoa X, and Duval E
- Abstract
This paper presents a framework that enables felxible content reuse. Unlike the usual practice where document components, such as images, definitions, text fragments, tables or diagrams, are assembled manually through copy-and-paste, the framework enables on-the-fly access and reuse. Retrieval of relevant components is enabled by automatic decomposition of legacy documents and storage of individual components, enriched with metadata. Furthermore, the automatic assembly of these components in mainstream authoring tools is supported. The paper describes the framework and its current support for reassembling PowerPoint, Wikipedia and SCORM components in authoring tools. In addition, an evaluation is presented that aims to assess the effectiveness and efficiency of such content reuse for presentations. [ABSTRACT FROM AUTHOR]
- Published
- 2008
35. The ALOCOM Framework: Towards Scalable Content Reuse.
- Author
-
Verbert, Katrien, Ochoa, Xavier, and Duval, Erik
- Subjects
- *
INFORMATION retrieval , *ARCHIVES , *METADATA , *INFORMATION science , *CHARTS, diagrams, etc. - Abstract
This paper presents a framework that enables flexible content reuse. Unlike the usual practice where document components, such as images, definitions, text fragments, tables or diagrams, are assembled manually through copy-and-paste, the framework enables on-the-fly access and reuse. Retrieval of relevant components is enabled by automatic decomposition of legacy documents and storage of individual components, enriched with metadata. Furthermore, the automatic assembly of these components in mainstream authoring tools is supported. The paper describes the framework and its current support for re-assembling PowerPoint, Wikipedia and SCORM components in authoring tools. In addition, an evaluation is presented that aims to assess the effectiveness and efficiency of such content reuse for presentations. [ABSTRACT FROM AUTHOR]
- Published
- 2008
36. Adaptive Hypermedia System Interoperability: a 'real world' evaluation.
- Author
-
Cristea, Alexandra, Stewart, Craig, Brailsford, Tim, and Cristea, Paul
- Subjects
- *
INTERACTIVE multimedia authoring programs , *COMPUTER software , *INTERACTIVE computer systems , *INTERNETWORKING , *COMPUTER networks , *WEBSITE authoring programs - Abstract
Adaptive Hypermedia (AH) authoring is widely acknowledged to be complex and time consuming, yet this vital process is rarely evaluated. Recent research has approached the authoring problem by ensuring that previously created materials can be converted from one system to another. This paper evaluates the results of this research, specifically the creation of adaptive materials in MOT and their conversion and subsequent delivery in WHURLE. A group of technically experienced IT users who are novice AH authors were exposed to MOT and WHURLE during an introductory week long course. This paper interprets the results of these authors using a "write once, deliver many" paradigm of adaptive hypermedia creation. [ABSTRACT FROM AUTHOR]
- Published
- 2007
37. Adaptive hypermedia system interoperability: a 'real world' evaluation.
- Author
-
Cristea A, Stewart C, Brailsford T, and Cristea P
- Abstract
Adaptive Hypermedia (AH) authoring is widely acknowledged to be complex and time consuming, yet this vital process is rarely evaluated. Recent research has approached the authoring problem by ensuring that previously created materials can be converted from one system to another. This paper evaluates the results of this research, specifically the creation of adaptive materials in MOT and their conversion and subsequent delivery in WHURLE. A group of technically experienced IT users who are novice AH authors were exposed to MOT and WHURLE during an introductory week long course. This paper interprets the results of these authors using a 'write once, deliver many' paradigm of adaptive hypermedia creation. [ABSTRACT FROM AUTHOR]
- Published
- 2007
38. Metadata and data quality problems in the digital library.
- Author
-
Beall J
- Abstract
This paper describes the main types of data quality errors that occur in digital libraries, both in full-text objects and in metadata. Studying these errors is important because they can block access to online documents and because digital libraries should eliminate errors where possible. Some types of common errors include typographical errors, scanning and data conversion errors, and find and replace errors. Errors in metadata can also hinder access in digital libraries. The paper also discusses the responsibility for errors in digital documents and offers suggestions for managing digital library data quality. [ABSTRACT FROM AUTHOR]
- Published
- 2005
39. The INVENT framework: examining the role of information visualization in the reconceptualization of digital libraries.
- Author
-
Fast KV and Sedig K
- Abstract
The objective of this paper is to show how information visualization can play an important and catalytic role in the reconceptualization of digital libraries as interactive knowledge environments. Information visualization has long been described as a beneficial and promising technology for digital libraries. Today, however, few digital libraries rely on information visualization concepts and techniques. This is because the research agenda has been dominated by first-generation challenges, such as digitization, organization, preservation, and facilitating access through conventional search and browse interfaces. As a result, digital libraries are still conceptualized as curated, networked, and searchable document repositories. But new research directions are reconceptualizing them as interactive knowledge environments. This paper re-examines the role of information visualization in this reconceptualization. It introduces a new conceptual framework for digital libraries called INVENT: INteractive Visual ENironmenTs. The INVENT framework emphasizes the importance of rich interaction with representations of information, especially visual representations, for supporting cognitive and knowledge work activities. There are six elements in the framework: digital objects, representations, activities, interactions, actors, and ecologies. This paper suggests that these elements should be conceptual cornerstones in the knowledge environment conceptualization of digital libraries. [ABSTRACT FROM AUTHOR]
- Published
- 2005
40. Metadata and Data Quality Problems in the Digital Library.
- Author
-
Beall, Jeffrey
- Subjects
- *
ERROR , *DIGITAL libraries , *METADATA , *PUBLISHED errata , *LIBRARIES - Abstract
This paper describes the main types of data quality errors that occur in digital libraries, both in full-text objects and in metadata. Studying these errors is important because they can block access to online documents and because digital libraries should eliminate errors where possible. Some types of common errors include typographical errors, scanning and data conversion errors, and find and replace errors. Errors in metadata can also hinder access in digital libraries. The paper also discusses the responsibility for errors in digital documents and offers suggestions for managing digital library data quality. [ABSTRACT FROM AUTHOR]
- Published
- 2005
41. The INVENT framework: Examining the role of information visualization in the reconceptualization of digital libraries.
- Author
-
Fast, Karl V. and Sedig, Kamran
- Subjects
- *
VISUALIZATION , *DIGITAL libraries , *TECHNOLOGY , *VISUAL programming languages (Computer science) , *ENVIRONMENTAL sciences - Abstract
The objective of this paper is to show how information visualization can play an important and catalytic role in the reconceptualization of digital libraries as interactive knowledge environments. Information visualization has long been described as a beneficial and promising technology for digital libraries. Today, however, few digital libraries rely on information visualization concepts and techniques. This is because the research agenda has been dominated by first-generation challenges, such as digitization, organization, preservation, and facilitating access through conventional search and browse interfaces. As a result, digital libraries are still conceptualized as curated, networked, and searchable document repositories. But new research directions are reconceptualizing them as interactive knowledge environments. This paper re-examines the role of information visualization in this reconceptualization. It introduces a new conceptual framework for digital libraries called INVENT: INteractive Visual ENironmenTs. The INVENT framework emphasizes the importance of rich interaction with representations of information, especially visual representations, for supporting cognitive and knowledge work activities. There are six elements in the framework: digital objects, representations, activities, interactions, actors, and ecologies. This paper suggests that these elements should be conceptual cornerstones in the knowledge environment conceptualization of digital libraries. [ABSTRACT FROM AUTHOR]
- Published
- 2005
42. Digital Text Cycles: From Medieval Manuscripts to Modern Markup.
- Author
-
Hillesund, Terje
- Subjects
- *
PUBLISHING , *TECHNOLOGICAL innovations , *CULTURE , *CULTURAL industries , *BOOK industry - Abstract
The paper argues that the current implementation of digital publishing is a minor step in a long development of digital text cycles. Rather than being a revolution, the digital transformation of text is an evolutionary process heavily influenced by social and cultural factors. The paper introduces the concept of a "text cycle". An examination of basic features of paper-based text cycles and features of digital text cycles demonstrates that digital technology has a potential for change that far exceeds that of the "Gutenberg revolution". However, by applying a historical perspective, I will try to show how the deep and enduring cultural heritage of print is impeding the radical potential of digital texts. [ABSTRACT FROM AUTHOR]
- Published
- 2005
43. Archival description in OAI-ORE.
- Author
-
Kaplan, Deborah, Sauer, Anne, and Wilczek, Eliot
- Subjects
- *
ARCHIVES , *CATALOGING , *INFORMATION resources management , *INTERNET , *ELECTRONIC publications , *ACCESS to information - Abstract
This paper proposes using OAI-ORE as the basis for a new method to represent and manage the description of archival collections. This strategy adapts traditional archival description methods for the contemporary reality of digital collections and takes advantage of the power of OAI-ORE to allow for a multitude of non-linear relationships, providing richer and more powerful access and description. A schema for representing finding aids in OAI-ORE would facilitate more sophisticated methods for modeling archival collection descriptions. [ABSTRACT FROM AUTHOR]
- Published
- 2011
44. Curation Micro-Services: A Pipeline Metaphor for Repositories.
- Author
-
Abrams, Stephen, Cruse, Patricia, Kunze, John, and Minor, David
- Subjects
- *
INFORMATION resources management , *INFORMATION retrieval , *INFORMATION services , *ACCESS to information , *DATA security - Abstract
The effective long-term curation of digital content requires expert analysis, policy setting, and decision making, and a robust technical infrastructure that can effect and enforce curation policies and implement appropriate curation activities. Since the number, size, and diversity of content under curation management will undoubtedly continue to grow over time, and the state of curation understanding and best practices relative to that content will undergo a similar constant evolution, one of the overarching design goals of a sustainable curation infrastructure is flexibility. In order to provide the necessary flexibility of deployment and configuration in the face of potentially disruptive changes in technology, institutional mission, and user expectation, a useful design metaphor is provided by the Unix pipeline, in which complex behavior is an emergent property of the coordinated action of a number of simple independent components. The decomposition of repository function into a highly granular and orthogonal set of independent but interoperable micro-services is consistent with the principles of prudent engineering practice. Since each micro-service is small and self-contained, they are individually more robust and collectively easier to implement and maintain. By being freely interoperable in various strategic combinations, any number of micro-services-based repositories can be easily constructed to meet specific administrative or technical needs. Importantly, since these repositories are purposefully built from policy neutral and protocol and platform independent components to provide the function minimally necessary for a specific context, they are not constrained to conform to an infrastructural monoculture of prepackaged repository solutions. The University of California Curation Center has developed an open source micro-services infrastructure that is being used to manage the diverse digital collections of the ten campus University system and a number of non-university content partners. This paper provides a review of the conceptual design and technical implementation of this micro-services environment, a case study of initial deployment, and a look at ongoing micro-services developments. [ABSTRACT FROM AUTHOR]
- Published
- 2011
45. Authoring, editing and visualizing compound objects for literary scholarship.
- Author
-
Gerber A and Hunter J
- Abstract
This paper presents LORE (Literature Object Re-use and Exchange), a light-weight tool designed to enable scholars and teachers of literature to author, edit and publish OAI-ORE-compliant compound information objects that encapsulate related digital resources and bibliographic re-cords. LORE provides a graphical user interface for creating, labelling and visualizing typed rela-tionships between individual objects using terms from a bibliographic ontology based on the IFLA FRBR. After creating a compound object, users can attach metadata and publish it to a re-pository (as an RDF graph) where it can be searched, retrieved, edited and re-used by others. LORE has been developed in the context of the Australian Literature Resource project (AustLit) and hence focuses on compound objects for teaching and research within the Australian literary studies community. However it can easily be tailored to support the creation of compound ob-jects for literary and bibliographic research more generally. [ABSTRACT FROM AUTHOR]
- Published
- 2010
46. ICE-TheOREM -- end to end semantically aware eResearch infrastructure for theses.
- Author
-
Sefton P, Downing J, and Day N
- Abstract
ICE-TheOREM was a project which made several important contributions to the repository domain, promoting deposit by integrating the repository with authoring workflows and enhancing open access by prototyping new infrastructure to allow fine-grained embargo management within an institution without impacting on existing open access repository infrastructure. In the area of scholarly communications workflows, the project produced a complete end-to-end demonstration of eScholarship for word processor users, with tools for authoring, managing and disseminating semantically-rich thesis documents fully integrated with supporting data. This work is focused on theses, as it is well understood that early career researchers are the most likely to lead the charge in new innovations in scholarly publishing and dissemination models. The authoring tools are built on the ICE content management system, which allows authors to work within a word processing system (as most authors do) with easy-to-use toolbars to structure and format their documents. The ICE system manages both small data files and links to larger data sets. The result is research publications which are available not just as paper-ready PDF files but as fully interactive semantically aware web documents which can be disseminated via repository software such as ePrints, DSpace and Fedora as complete supported web-native and PDF publications. ICE-TheOREM combined the Object Reuse and Exchange (OAI-ORE, IONSREPORT 2008) and SWORD-APP protocols to transfer content between a content management system, a thesis management system and multiple repository software packages and looked at ways to describe aggregate objects which include both data and documents, and to represent structure withing thesis documents. This can be generalized to domains other than chemistry. ICE-TheOREM has demonstrated how focusing on the use of the web architecture (including ORE) enables repository functions to be distributed between systems for complex, data-rich compound objects. [ABSTRACT FROM AUTHOR]
- Published
- 2010
47. Adding OAI-ORE support to repository platforms.
- Author
-
Maslov A, Creel J, Mikeal A, Phillips S, Leggett J, and McFarland M
- Abstract
The Texas Digital Library (TDL) is a cooperative initiative of Texas universities. One of TDL's core services is a federated collection of electronic theses and dissertations (ETDs) from its member schools. As this collection grew, the need for tools to manage the content exchange from the local to the federated repository became evident. This paper presents our experiences in adding harvesting support to the DSpace repository platform using the ORE and PMH protocols from the Open Archives Initiative. We describe our use case for a statewide ETD repository and the mapping of the OAI-ORE data model to the DSpace architecture. We discuss our implementation that adds both dissemination and harvesting functionality to the repository. We conclude by discussing the architectural flexibility added to the TDL repository through this project. [ABSTRACT FROM AUTHOR]
- Published
- 2010
48. Towards an open repository environment.
- Author
-
Aschenbrenner A, Küster MW, Blanke T, and Pempe W
- Abstract
Repositories used to be fairly monolithic systems, with a single object store, a tailored content model, and a dedicated application on top. The federation protocol OAI-PMH for content aggregation is an exception to this. However, we are still far away from an open repository environment, in which repositories interact on all levels with other agents (e.g. other repositories, added-value services, registries). This paper creates a more fine-grained view on repository federation and analyses existing approaches with regard to four interoperability attributes: syntax, structure, semantics, and patterns. Among these attributes, the most evident gap pertains to interaction patterns between agents. Besides the prevalent client-pull and harvesting patterns, notification offers a more immediate and directed mechanism, enabling new federation scenarios and laying the grounds for open repository environments. Prototypes of the concepts presented here are being implemented in the scope of the project DARIAH, which establishes an e-Infrastructure for the humanities. [ABSTRACT FROM AUTHOR]
- Published
- 2010
49. Text and Web mining approaches in order to build specialized ontologies.
- Author
-
Roche M and Kodratoff Y
- Abstract
This paper presents a text-mining approach in order to extract candidate terms from a corpus. The relevant candidates are selected using a web-mining approach. The terms (i.e. relevant candidate terms) found by this process are the instances of specialized ontologies built. The experiments based on real data - Human Resources corpus - show the quality of our text and web mining approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2009
50. Supporting visual problem solving in spatial hypertext.
- Author
-
Hsieh H and Shipman FM III
- Abstract
This paper describes the VITE system, a spatial hypertext system that supports two-way mapping for projecting structured information to a two-dimensional workspace and updating the structured information based on user interactions in the workspace. VITE uses information visualization techniques to render structured information in the workspace and provides users an environment to interact with digital information in a spatial hypertext setting. The two-way mapping connects the objects in the workspace to the structured information and provides users direct access to the information. The spatial hypertext environment encourages users to engage more directly with the information related to their tasks. An evaluation of VITE was conducted to study how people adapt to two-way mappings and how two-way mappings can help in problem solving tasks. The results show that users could quickly design visual mappings to help their problem-solving tasks and developed more sophisticated strategies for visual problem-solving over time. [ABSTRACT FROM AUTHOR]
- Published
- 2009
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.