445 results on '"ELECTRONIC file management"'
Search Results
2. Working with iText for PDFs.
- Author
-
Petersen, John V.
- Subjects
- *
PDF (Computer file format) , *APPLICATION software , *SOURCE code , *ELECTRONIC file management , *COMPUTER programming - Abstract
In the article, the author discusses the basic steps and processes in reading and writing data to and from a portable document format. Other topics include how to use a PDF Library in the process, PDFLibrary source codes and samples, iText 7 PDF libraries, iText7 NuGet package, as well as how to incorporate PDF functionality into an application.
- Published
- 2020
3. Optimal linear–quadratic control of coupled parabolic–hyperbolic PDEs.
- Author
-
Aksikas, I., Moghadam, A. Alizadeh, and Forbes, J. F.
- Subjects
- *
H2 control , *PDF (Computer file format) , *ELECTRONIC file management , *PDF software , *HYPERBOLIC processes , *PARTIAL differential equations - Abstract
This paper focuses on the optimal control design for a system of coupled parabolic–hypebolic partial differential equations by using the infinite-dimensional state-space description and the corresponding operator Riccati equation. Some dynamical properties of the coupled system of interest are analysed to guarantee the existence and uniqueness of the solution of the linear–quadratic (LQ)-optimal control problem. A state LQ-feedback operator is computed by solving the operator Riccati equation, which is converted into a set of algebraic and differential Riccati equations, thanks to the eigenvalues and the eigenvectors of the parabolic operator. The results are applied to a non-isothermal packed-bed catalytic reactor. The LQ-optimal controller designed in the early portion of the paper is implemented for the original nonlinear model. Numerical simulations are performed to show the controller performances. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
4. IndexConvert: what does it do and why and how would you use it?
- Author
-
Haskins, Lucie
- Subjects
PDF (Computer file format) ,ELECTRONIC file management ,INDEXING ,DATABASES ,COMPUTER software - Abstract
The article offers information on IndexConvert, which converts an existing PDF or RTF/doc index into a format that can be read into the indexing software. It explains why one needs to convert an existing index. The need for conversion utilities to contain the intelligence to retain the implied links within the index entry groups in order to import indexes with subentries into dedicated indexing software is cited.
- Published
- 2017
- Full Text
- View/download PDF
5. Closing the PDF gap: ReadCube's experiments in reader-focused design.
- Author
-
Hodgson, Alex and Schlager, Lucas
- Subjects
- *
PUBLISHING , *PDF (Computer file format) , *ELECTRONIC file management , *CONTENT analysis - Abstract
Key pointsTwenty years ago, a fraction of articles were paired with supplements; now, one in five research stories go beyond PDF.Why has the evolution of research content's delivery vehicles lagged behind the evolution of the research story?Hyperlinked in ‐ line references are the most used feature of the Enhanced PDF – exposing faster, easier ways to follow the path of research.Discovery need not start and end with the search engine – it can start within the PDF itself.Discovery is not a solitary process; therefore, content platforms must support collaboration to drive readership. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
6. Thinking the Unprintable in Contemporary Post - D igital Publishing.
- Author
-
SEITA, SOPHIE
- Subjects
- *
PDF (Computer file format) , *ON-demand publications , *PUBLISHING , *AVANT-garde (Arts) , *ELECTRONIC file management - Abstract
The article presents information on publication of portable document format files and print on demand versions of those files by publishers such as Troll Thread and Gauss PDF. According to the author, print in post-digital avant-garde publishing exposes similarities as well as the differences between printed and digital materials.
- Published
- 2017
7. Research and Realization about Conversion Algorithm of PDF Format into PS Format.
- Author
-
Xingfu Wang, Lei Qian, Fuyou Mao, and Zhaosheng Zhu
- Subjects
ALGORITHMS ,PDF (Computer file format) ,ELECTRONIC file management ,POSTSCRIPT (Computer program language) ,PROGRAMMING languages - Abstract
This paper firstly introduces the characteristics of PostScript document and PDF document as the basis, and proposes the necessity and the feasibility of the conversion from the PDF document format to the PostScript language program. Secondly, it studies the main algorithm and technology of the conversion process and realizes the information extraction for PDF document lastly, with achieving the software algorithm for the conversion from PDF document format into PS format on the basis of the text. [ABSTRACT FROM AUTHOR]
- Published
- 2010
8. Reports and other PDF documents.
- Author
-
Cámara, Rafael J. A.
- Subjects
- *
STATISTICAL software , *DATA analysis , *DESCRIPTIVE statistics , *PDF (Computer file format) , *ELECTRONIC file management - Abstract
Stata users often need to combine text, tables, and figures. The author's command, lpdf, generates reports and other PDF documents. lpdf compiles text stored in global macros, tables stored as dataset tables or LATEX table input files, and figures stored as Stata graphs or PDF figure files. LATEX must be installed, but familiarity with LATEX is not necessary. lpdf performs every step through Stata and with Stata syntax. It generates documents in report or article style and portrait or landscape orientation. The default author name, document title, and date can be modified. Further format options include the font and margin sizes. For each table and figure, the width and layout can be adapted. Stata users with LATEX skills may benefit from additional possibilities. The lpdf adofile includes two other useful commands called latexize and latext. latexize processes the content of string variables to properly type special characters and symbols in LATEX input files. latext modifies text stored in global macros in the same way. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
9. What Lives Where & Why? Understanding Biodiversity through Geospatial Exploration.
- Author
-
TRAUTMANN, NANCY M., MAKINSTER, JAMES G., and BATEK, MICHAEL
- Subjects
- *
BIODIVERSITY , *PDF (Computer file format) , *ELECTRONIC file management , *BIOLOGY , *WEB-based user interfaces - Abstract
Using an interactive map-based PDF, students learn key concepts related to biodiversity while developing data-analysis and critical-thinking skills. The Bird Island lesson provides students with experience in translating geospatial data into bar graphs, then interpreting these graphs to compare biodiversity across ecoregions on a fictional island. When the lesson is extended to include real data for Puerto Rico, students can explore distributions of selected bird species based on environmental attributes, making connections between each species' adaptations, habitat requirements, and distribution across the island. This introductory lesson provides a jumping-off point for field and Web-based biodiversity investigations. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
10. All in One - Complete Issue: ChemInform 33/2013.
- Subjects
- *
ELECTRONIC file management , *PDF (Computer file format) - Abstract
ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 100 leading chemistry journals. The following PDF file contains a complete ChemInform issue, thus enabling easy electronic browsing further facilitated by electronic bookmarks. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
11. PDF to PDF/A: Evaluation of Converter Software for Implementation in Digital Repository Workflow.
- Author
-
Koo, Jamin and Chou, CarolC. H.
- Subjects
- *
PDF (Computer file format) , *WORKFLOW software , *WORKFLOW , *ELECTRONIC records , *ELECTRONIC file management - Abstract
PDF/A is a version of Portable Document Format designed for archiving and preservation. Due to its popularity, many electronic documents exist in PDF format, and the ability to convert an existing PDF into a conforming PDF/A file is as important, if not more, as being able to produce documents in PDF/A format. The Florida Digital Archive conducted a study to select a PDF to PDF/A conversion application as part of its format normalization strategy in the summer of 2012. This article documents the evaluation process and presents the results in such a way that they provide insight into challenges and potential drawbacks during similar evaluation or implementation. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
12. Layout-aware text extraction from full-text PDF of scientific articles.
- Author
-
Ramakrishnan, Cartic, Patnia, Abhishek, Howy, Eduard, and Burns, Gully APC
- Subjects
- *
PDF (Computer file format) , *OPEN source software , *ACCESS to information , *ELECTRONIC file management , *COMPUTER files - Abstract
Background: The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the 'Layout-Aware PDF Text Extraction' (LA-PDFText) system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Results: Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1) Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2) Classifying text blocks into rhetorical categories using a rule-based method and (3) Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF. Finally, we discuss preliminary error analysis for our system and identify further areas of improvement.Conclusions: LA-PDFText is an open-source tool for accurately extracting text from full-text scientific articles. The release of the system is available at http://code.google.com/p/lapdftext/. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
13. Automatic extraction of metadata from scientific publications for CRIS systems.
- Author
-
Kovačević, Aleksandar, Ivanović, Dragan, Milosavljević, Branko, Konjović, Zora, and Surla, Dušan
- Subjects
- *
METADATA , *PDF (Computer file format) , *ELECTRONIC file management , *MACHINE learning , *INFORMATION retrieval - Abstract
Purpose – The aim of this paper is to develop a system for automatic extraction of metadata from scientific papers in PDF format for the information system for monitoring the scientific research activity of the University of Novi Sad (CRIS UNS). Design/methodology/approach – The system is based on machine learning and performs automatic extraction and classification of metadata in eight pre-defined categories. The extraction task is realised as a classification process. For the purpose of classification each row of text is represented with a vector that comprises different features: formatting, position, characteristics related to the words, etc. Experiments were performed with standard classification models. Both a single classifier with all eight categories and eight individual classifiers were tested. Classifiers were evaluated using the five-fold cross validation, on a manually annotated corpus comprising 100 scientific papers in PDF format, collected from various conferences, journals and authors' personal web pages. Findings – Based on the performances obtained on classification experiments, eight separate support vector machines (SVM) models (each of which recognises its corresponding category) were chosen. All eight models were established to have a good performance. The F-measure was over 85 per cent for almost all of the classifiers and over 90 per cent for most of them. Research limitations/implications – Automatically extracted metadata cannot be directly entered into CRIS UNS but requires control of the curators. Practical implications – The proposed system for automatic metadata extraction using support vector machines model was integrated into the software system, CRIS UNS. Metadata extraction has been tested on the publications of researchers from the Department of Mathematics and Informatics of the Faculty of Sciences in Novi Sad. Analysis of extracted metadata from these publications showed that the performance of the system for the previously unseen data is in accordance with that obtained by the cross-validation from eight separate SVM classifiers. This system will help in the process of synchronising metadata from CRIS UNS with other institutional repositories. Originality/value – The paper documents a fully automated system for metadata extraction from scientific papers that was developed. The system is based on the SVM classifier and open source tools, and is capable of extracting eight types of metadata from scientific articles of any format that can be converted to PDF. Although developed as part of CRIS UNS, the proposed system can be integrated into other CRIS systems, as well as institutional repositories and library management systems. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
14. Mendeley: Creating Communities of Scholarly Inquiry Through Research Collaboration.
- Author
-
Zaugg, Holt, West, Richard E., Tateishi, Isaku, and Randall, Daniel L.
- Subjects
- *
ONLINE social networks , *INTERNET research , *RESEARCH management , *PDF (Computer file format) , *ELECTRONIC file management , *WEB 2.0 , *SCHOLARLY method - Abstract
Mendeley is a free, web-based tool for organizing research citations and annotating their accompanying PDF articles. Adapting Web 2.0 principles for academic scholarship, Mendeley integrates the management of the research articles with features for collaborating with researchers locally and worldwide. In this article the features of Mendeley are discussed and critiqued in comparison to other, similar tools. These features include citation management, online synchronization and collaboration, PDF management and annotation, and integration with word processing software. The article concludes with a discussion of how a social networking tool such as Mendeley might impact the academic scholarship process. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
15. The Technology and Law of the Form of Production of Electronically Stored Information.
- Author
-
Waxse, David J.
- Subjects
- *
ELECTRONIC information resources , *TECHNOLOGY , *CIVIL procedure , *PDF (Computer file format) , *ELECTRONIC file management , *LAWYERS , *FEDERAL courts - Abstract
The article talks about the form of production of electronically stored information (ESI), a key aspect in the use of technology which is an important component of civil litigation in U.S. federal court. The Federal Rule of Civil Procedure 34(a) was changed to establish the general procedures for producing ESI. It defines Tag Image File Format (TIFF) and Portable Document Format (PDF) in which ESI may be produced. It stresses the need for lawyers to have a basic understanding of the nature of this information and its form of production.
- Published
- 2010
16. Media Rich PDF Publishing: Bridging the Gap Between Traditional Print and Multimedia Publishing.
- Author
-
Lisi, Jason
- Subjects
PUBLISHING ,PRINTING industry ,MULTIMEDIA communications ,PDF (Computer file format) ,WEBSITES ,ELECTRONIC file management - Abstract
Traditional print publishing has seen a decline in recent years, partly due to new forms of multimedia publication. Typically there is a clear distinction between print and multimedia publication, and often these two forms of publishing are designed and created separately from one another. Media Rich PDF publishing is a unique form of media convergence that allows designers to blend the superior image clarity and typographical power of printed documents with the interactivity and media flexibility of multimedia publishing. Through media rich PDFs, authors can create high quality printable documents that remain economical in file size, allowing them to be disseminated electronically through email or websites. Media rich PDFs are platform independent, easily managed by inexperienced users, and easy to create. One significant advantage of media rich PDFs is that one file can be created and used to create traditional commercial print products as well as an electronic multimedia experience. This can save publishers significant time and money when trying to publish across multiple mediums. This paper discusses the pros and cons of media rich PDF publishing as it relates to both traditional print publishing and multimedia applications. These arguments will form the basis for the argument that media rich PDFs can be used as a means of bridging two forms of information publishing that are traditionally considered to be incompatible. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
17. Burrokeet, an Application for Creating and Publishing Content Packages with support for Multiple Input and Output Formats.
- Author
-
Bernard, Margaret and Ramnanan, Anil
- Subjects
MOBILE learning ,HTML (Document markup language) ,MIMO systems ,PDF (Computer file format) ,WIRELESS communications ,ELECTRONIC file management ,OPEN source software - Abstract
This paper describes the design and use of Burrokeet, an e-learning software platform which facilitates the creation of SCORM Content Packages. Burrokeet will accept input content documents in most of the common file formats; it provides the tools to sequence that content into a pedagogically sound 'unit of learning' and to generate the IMS Manifest file for the Content Package. Burrokeet also has a publishing engine which can publish Content Packages created within Burrokeet or SCORM-compliant Content Packages generated from external sources. The Publishing engine will generate the output in a unified format, regardless of the input file formats used. Several output formats are supported, including html for website publication, pdf for print and presentation formats for delivery during class time. [ABSTRACT FROM AUTHOR]
- Published
- 2009
18. ONTOLOGY-BASED INFORMATION EXTRACTION FROM PDF DOCUMENTS WITH XONTO.
- Author
-
ORO, ERMELINDA, RUFFOLO, MASSIMO, and SACCÀ, DOMENICO
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *VISUAL programming languages (Computer science) , *VISUAL perception , *ONTOLOGY - Abstract
Information extraction is of paramount importance in several real world applications in the areas of business, competitive and military intelligence because it enables to acquire information contained in unstructured documents and store them in structured forms. Unstructured documents have different internal encodings, one of the most diffused encoding is the visualization-oriented Adobe portable document format (PDF). Although several sophisticated and indeed complex approaches were proposed, they are still limited in many aspects. In particular, existing information extraction systems cannot be applied to PDF documents because of their completely unstructured nature that pose many issues in defining IE approaches. In this paper the novel ontology-based system named XONTO, that allows the semantic extraction of information from PDF documents, is presented. The XONTO system is founded on the idea of self-describing ontologies in which objects and classes can be equipped by a set of rules named descriptors. These rules represent patterns that allow to automatically recognize and extract ontology objects contained in PDF documents also when information is arranged in tabular form. This way a self-describing ontology expresses the semantic of the information to extract and the rules that, in turn, populate itself. In the paper XONTO system behaviors and structure are sketched by means of a running example. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
19. PDF/A standard for long term archiving.
- Author
-
Vasilescu, Ramona
- Subjects
ELECTRONIC file management ,PDF (Computer file format) ,ELECTRONIC records ,BACK up systems ,COMPUTER systems ,ELECTRONIC systems - Abstract
PDF/A is defined by ISO 19005-1 as a file format based on PDF format. The standard provides a mechanism for representing electronic documents in a way that preserves their visual appearance over time, independent of the tools and systems used for creating or storing the files. [ABSTRACT FROM AUTHOR]
- Published
- 2009
20. A Comparison of Tabular PDF Inversion Methods.
- Author
-
Cline, D., Razdan, A., and Wonka, P.
- Subjects
- *
COMPUTER graphics , *PDF (Computer file format) , *DIGITAL image processing , *ELECTRONIC file management , *ALGORITHMS , *INVERSIONS (Geometry) - Abstract
The most common form of tabular inversion used in computer graphics is to compute the cumulative distribution table of a probability distribution (PDF) and then search within it to transform points, using an binary search. Besides the standard inversion method, however, several other discrete inversion algorithms exist that can perform the same transformation in O(1) time per point. In this paper, we examine the performance of three of these alternate methods, two of which are new. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
21. A look at Portable Document Format vulnerabilities
- Author
-
Rautiainen, Sami
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *SYSTEMS software , *COMPUTER files - Abstract
Abstract: Portable Document Format (PDF) developed by Adobe Systems Inc. is a flexible and popular document distribution and delivery file format, and it is supported within various operating systems and devices. This article provides insight for some of the security issues within the format itself as well as an outlook of the vulnerabilities found from various versions of Adobe‘s own PDF viewer implementation. [Copyright &y& Elsevier]
- Published
- 2009
- Full Text
- View/download PDF
22. Guided Sampling via Weak Motion Models and Outlier Sample Generation for Epipolar Geometry Estimation.
- Author
-
Goshen, Liran and Shimshoni, Ilan
- Subjects
- *
PDF (Computer file format) , *STATISTICAL correlation , *STANDARD deviations , *ELECTRONIC file management , *COMPUTER vision , *PATTERN recognition systems , *ARTIFICIAL intelligence - Abstract
The problem of automatic robust estimation of the epipolar geometry in cases where the correspondences are contaminated with a high percentage of outliers is addressed. This situation often occurs when the images have undergone a significant deformation, either due to large rotation or wide baseline of the cameras. An accelerated algorithm for the identification of the false matches between the views is presented. The algorithm generates a set of weak motion models (WMMs). Each WMM roughly approximates the motion of correspondences from one image to the other. The algorithm represents the distribution of the median of the geometric distances of a correspondence to the WMMs as a mixture model of outlier correspondences and inlier correspondences. The algorithm generates a sample of outlier correspondences from the data. This sample is used to estimate the outlier rate and to estimate the outlier pdf. Using these two pdfs the probability that each correspondence is an inlier is estimated. These probabilities enable guided sampling. In the RANSAC process this guided sampling accelerates the search process. The resulting algorithm when tested on real images achieves a speedup of between one or two orders of magnitude. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
23. Simulation of swirling turbulent combustion in the TECFLAM combustor
- Author
-
Yang, Weiping and Zhang, Jian
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *MATHEMATICAL models of turbulence , *FLUID dynamics , *COMBUSTION research , *COMBUSTION ,COMBUSTION measurement - Abstract
Abstract: An algebraic concentration moment (ACM)-PDF turbulent combustion model is proposed and formulated in this paper. The presumed PDF approach is adopted for the closure of the time-averaged temperature relevant quantity. It is integrated with the algebraic expression for the second-order-moment of concentration fluctuations. The obtained ACM-PDF model is employed in the simulation of swirling turbulent diffusion combustion in the TECFLAM combustor. The calculated gas axial, radial and tangential velocities, turbulent kinetic energy, species mass fractions, temperature, and fluctuating temperature are compared with the measured test data. Agreement between the calculation and the measurement is achieved. [Copyright &y& Elsevier]
- Published
- 2008
- Full Text
- View/download PDF
24. Bittracker--A Bitmap Tracker for Visual Tracking under Very General Conditions.
- Author
-
Leichter, Ido, Lindenbaum, Michael, and Rivlin, Ehud
- Subjects
- *
PROBABILITY theory , *DISTRIBUTION (Probability theory) , *PDF (Computer file format) , *ELECTRONIC file management , *IMAGE processing , *INFORMATION processing , *DIGITAL image processing - Abstract
This paper addresses the problem of visual tracking under very general conditions: a possibly nonrigid target whose appearance may drastically change over time, general camera motion, a 3D scene, and no a priori information except initialization. This is in contrast to the vast majority of trackers, which rely on some limited model in which, for example, the target's appearance is known a priori or restricted, the scene is planar, or a pan tilt zoom camera is used. Their goal is to achieve speed and robustness, but their limited context may cause them to fail in the more general case. The proposed tracker works by approximating, in each frame, a probability distribution function (PDF) of the target's bitmap and then estimating the maximum a posteriori bitmap. The PDF is marginalized over all possible motions per pixel, thus avoiding the stage in which optical flow is determined. This is an advantage over other general-context trackers that do not use the motion cue at all or rely on the error-prone calculation of optical flow. Using a Gibbs distribution with respect to the first-order neighborhood system yields a bitmap PDF whose maximization may be transformed into that of a quadratic pseudo-Boolean function, the maximum of which is approximated via a reduction to a maximum-flow problem. Many experiments were conducted to demonstrate that the tracker is able to track under the aforementioned general context. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
25. Automated Windows Memory File Extraction for Cyber Forensics Investigation.
- Author
-
Hejazi, Seyed Mahmood, Debbabi, Mourad, and Talhi, Chamseddine
- Subjects
COMPUTER crimes ,CRIMINAL investigation ,PDF (Computer file format) ,STATISTICAL correlation ,COMPUTER files ,ELECTRONIC file management ,MEMORY ,MODIFICATIONS - Abstract
In digital forensics, the first step to conducting an investigation is to acquire evidence that is most related to the case. Containing most recently accessed data and information about the status of a system, physical memory is a valuable source of digital evidence. When a process runs or accesses a file, all or some parts of the process's executable or accessed data file are mapped into the physical memory. In this article, we propose various methods to find files and extract them from memory in order to rebuild executable and data files that existed in physical memory at the time of incident. We developed a memory analysis plug-in that uses this automated memory file extraction. Using this tool, we have been able to extract a wide range of data file types, including text, PDF, Java Archives (JAR), various logs, EVT (system event-log files, used by the system event viewer), HTML and many more. Investigators can use the result of this research in order to (1) compare the files found on disk with those extracted from memory to find possible tampering or (2) reconstruct those files that no longer exist on the disk. In addition, they can find the last file modifications that have not been mapped out to the corresponding files on the disk. Memory extracted files can be used for the purpose of correlation analysis along with other sources of evidence such as application or network log files, E-mail files, and data files found on disks. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
26. Analysis of microdiversity and dual channel macrodiversity in shadowed fading channels using a compound fading model
- Author
-
Shankar, P.M.
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *DATA transmission systems , *DIGITAL communications - Abstract
Abstract: Wireless channels are affected by short-term fading and long-term fading (shadowing). A compound fading model was proposed for the modeling of shadowed fading channels which resulted in a closed form solution for the probability density function (pdf) of the signal-to-noise ratio (SNR). This model is applied to a case where both micro- and macro-diversity schemes are implemented to mitigate short-term fading and shadowing, respectively. Using the compound fading model, it is shown that the pdf of the signal-to-noise ratio after the implementation of maximal ratio combining (MRC) at the micro level and selection combining (SC) at the macro level can be expressed in analytical form. Even when branch correlation exists, the pdf still can be expressed in analytical form. Thus, the compound pdf model offers significant improvement over approaches which use lognormal pdf for shadowing. The performance of a coherent binary phase shift keying (BPSK) modem is evaluated using this approach. The results demonstrate the simplicity and usefulness of the compound pdf in the performance analyses of shadowed fading channels even when branch correlation exists at the base station or correlation exists between base stations. [Copyright &y& Elsevier]
- Published
- 2008
- Full Text
- View/download PDF
27. UNSUPERVISED ANOMALY DETECTION IN LARGE DATABASES USING BAYESIAN NETWORKS.
- Author
-
Cansado, Antonio and Soto, Alvaro
- Subjects
- *
BAYESIAN analysis , *BAYESIAN field theory , *DATABASES , *INFORMATION storage & retrieval systems , *DATA structures , *GAUSSIAN processes , *GAUSSIAN quadrature formulas , *ELECTRONIC file management , *PDF (Computer file format) - Abstract
Today, there has been a massive proliferation of huge databases storing valuable information. The opportunities of an effective use of these new data sources are enormous; however, the huge size and dimensionality of current large databases calls for new ideas to scale up current statistical and computational approaches. This article presents an application of artificial intelligence technology to the problem of automatic detection of candidate anomalous records in a large database. We build our approach with three main goals in mind: 1) an effective detection of the records that are potentially anomalous; 2) a suitable selection of the subset of attributes that explains what makes a record anomalous; and 3) an efficient implementation that allows us to scale the approach to large databases. Our algorithm, called Bayesian network anomaly detector (BNAD), uses the joint probability density function (pdf) provided by a Bayesian network (BN) to achieve these goals. By using appropriate data structures, advanced caching techniques, the flexibility of Gaussian mixture models, and the efficiency of BNs to model joint pdfs, BNAD manages to efficiently learn a suitable BN from a large dataset. We test BNAD using synthetic and real databases, the latter from the fields of manufacturing and astronomy, obtaining encouraging results. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
28. PDF/A-1: A Ray of Light in the Digital Dark Age?
- Author
-
Dryden, Jean
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *DOCUMENT imaging systems , *ASSOCIATIONS, institutions, etc. - Abstract
The article offers an overview about the ISO 19005-1:2005 Document Management--Electronic Document File Format for Long-Term Preservation--Part 1: Use of PDF 1.4 (PDF/A-1). It refers to an international standard format for the long-term accessibility of page-oriented electronic documents which was approved and published by the International Standards Organization (ISO) in September 2005. The idea of developing an ISO standard that is based on portable document format (PDF) originated from the U.S. The initiative was later led by the Association for Information and Image Management (AIIM) and the National Printing Equipment Association (NPES).
- Published
- 2008
- Full Text
- View/download PDF
29. Author index with titles.
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *ELECTRONIC data processing , *INFORMATION storage & retrieval systems - Abstract
The PDF file provided contains web links to all articles in this volume. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
30. Advanced carving techniques.
- Author
-
Cohen, M.I.
- Subjects
DATA recovery ,CRIMINAL investigation ,COMPUTER crimes ,PDF (Computer file format) ,ELECTRONIC file management - Abstract
Abstract: Carving is the term most often used to indicate the act of recovering a file from unstructured digital forensic images. The term unstructured indicates that the original digital image does not contain useful filesystem information which may be used to assist in this recovery. Typically, forensic analysts resort to carving techniques as an avenue of last resort due to the difficulty of current techniques. Most current techniques rely on manual inspection of the file to be recovered and manually reconstructing this file using trial and error. Manual processing is typically impractical for modern disk images which might contain hundreds of thousands of files. At the same time the traditional process of recovering deleted files using filesystem information is becoming less practical because most modern filesystems purge critical information for deleted files. As such the need for automated carving techniques is quickly arising even when a filesystem does exist on the forensic image. This paper explores the theory of carving in a formal way. We then proceed to apply this formal analysis to the carving of PDF and ZIP files based on the internal structure inherent within the file formats themselves. Specifically this paper deals with carving from the Digital Forensic Research Work-Shop''s (DFRWS) 2007 carving challenge. [Copyright &y& Elsevier]
- Published
- 2007
- Full Text
- View/download PDF
31. Author index with titles.
- Subjects
- *
INDEXES , *PDF (Computer file format) , *ELECTRONIC file management , *WEBSITES - Abstract
The PDF file provided contains web links to all articles in this volume. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
32. Degrees of Separation: Linking and Link Distribution in CNSLP Publisher E-Journal Packages.
- Author
-
Kichuk, Diana
- Subjects
- *
PUBLISHING , *BOOK industry , *ELECTRONIC publishing , *ELECTRONIC journals , *ELECTRONIC publications , *PORTABLE document software , *ELECTRONIC file management , *PDF (Computer file format) , *UTILITIES (Computer programs) - Abstract
This article reports on an analysis of links and link distribution in five publisher e-journal packages licensed through the Canadian National Site Licensing (CNSLP): ACS Web Editions. IOP Electronic Journals, ScienceDirect®, RSC Online Journals and Springer-Link. Hyperlinks were quantified in five page environments journal home page, table of contents, full citation, and full text in HTML and PDF formats. Hyperlinks in tables of contents were mapped to four link types first proposed in a 1998 paper by Stephanie W. Haas and Erika S. Grams: navigation, expansion, resource, and miscellaneous. The study concludes with preliminary thoughts on improving current link practice with the goal of urging publishers and vendors to advance a linking standard for e-journals and packages. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
33. Apago Releases New PDF Enhancer.
- Author
-
Beals, Stephen
- Subjects
- *
PDF (Computer file format) , *COMPUTER software development , *COMPUTER files , *ELECTRONIC file management , *RECORDS management , *FILE conversion (Computer science) , *DIGITAL preservation , *PRINTING industry - Abstract
The article focuses on a new Portable Document Format Enhancer (PDF E) software from Apago Inc. Dwight Kelly, president of the company provides an overview of the product. The company has been in the software development field for the last 15 years and has developed softwares for market giants like Fuji International, Screen Corp. Ltd., Hewlett-Packard Co. and Dupont. But the company is still struggling for a market image and name recognition. It is expected that new PDF E would give the company its long time recognition. PDF E is a stand alone product and does not need updates every time Adobe Systems Inc. updates its Acrobat reader. The product would be of great advantage to the printing industry. PDF E helps in repurposing PDF files. It checks the files, eliminates all redundancies, optimizes compression, analyses for problems and so on. It also fixes many fragmented PDF files. Finally, it generates a list of all the operations performed on the files. The article also provides detailed working of the system. INSET: Setting Up a New Target's Specifications..
- Published
- 2006
34. The Distribution of Order Statistics for Discrete Random Variables with Applications to Bootstrapping.
- Author
-
Evans, Diane L., Leemis, Lawrence M., and Drew, John H.
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *ALGORITHMS , *ORDER statistics , *STATISTICAL bootstrapping , *MATHEMATICAL statistics - Abstract
An algorithm for computing the PDF of order statistics drawn from discrete parent populations is presented, along with an implementation of the algorithm in a computer algebra system. Several examples and applications, including exact bootstrapping analysis, illustrate the utility of this algorithm. Bootstrapping procedures require that B bootstrap samples be generated in order to perform statistical inference concerning a data set. Although the requirements for the magnitude of B are typically modest, a practitioner would prefer to avoid the resampling error introduced by choosing a finite B, if possible. The part of the order-statistic algorithm for sampling with replacement from a finite sample can be used to perform exact bootstrapping analysis in certain applications, eliminating the need for replication in the analysis of a data set. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
35. DISTRIBUTIONAL ANALYSIS OF RELATED SYNSETS IN WordNet FOR A WORD SENSE DISAMBIGUATION TASK.
- Author
-
FRAGOS, KOSTAS and MAISTROS, YANIS
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *ARTIFICIAL intelligence , *PROGRAMMING languages , *POLYSEMY , *SEMANTICS - Abstract
This work presents a new method for an unsupervised word sense disambiguation task using WordNet semantic relations. In this method we expand the context of a word being disambiguated with related synsets from the available WordNet relations and study within this set the distribution of the related synset that correspond to each sense of the target word. A single sample Pearson-Chi-Square goodness-of-fit hypothesis test is used to determine whether the null hypothesis of a composite normality PDF is a reasonable assumption for a set of related synsets corresponding to a sense. The calculated p-value from this test is a critical value for deciding the correct sense. The target word is assigned the sense, the related synsets of which are distributed more "abnormally" relative to the other sets of the other senses. Our algorithm is evaluated on English lexical sample data from the Senseval-2 word sense disambiguation competition. Three WordNet relations, antonymy, hyponymy and hypernymy give a distributional set of related synsets for the context that was proved quite a good word sense discriminator, achieving comparable results with the system obtained the better results among the other competing participants. [ABSTRACT FROM AUTHOR]
- Published
- 2005
- Full Text
- View/download PDF
36. Author index with titles.
- Subjects
- *
ELECTRONIC file management , *PDF (Computer file format) , *DATABASE management - Abstract
The PDF file provided contains web links to all articles in this volume. [ABSTRACT FROM AUTHOR]
- Published
- 2005
- Full Text
- View/download PDF
37. Author index with titles.
- Subjects
- *
AUTHORS , *ELECTRONIC file management , *PDF (Computer file format) - Abstract
The PDF file provided contains web links to all articles in this volume. [ABSTRACT FROM AUTHOR]
- Published
- 2005
- Full Text
- View/download PDF
38. Integration of hospital data using agent technologies – A case study.
- Author
-
Cruz-Correia, R., Vieira-Marques, P., Costa, P., Ferreira, A., Oliveira-Palhares, E., Araújo, F., and Costa-Pereira, A.
- Subjects
- *
INFORMATION retrieval , *MEDICAL records , *HTML (Document markup language) , *PDF (Computer file format) , *ELECTRONIC file management , *INTRANETS (Computer networks) - Abstract
Data retrieval and its integration is one of the major problems that face large and complex health organizations. This is especially relevant when patient information is produced in heterogeneous environments. Implementing a Virtual Electronic Patient Record (VEPR) system may provide an adequate and cost-effective solution for most clinical information needs. In this paper, we describe and discuss the use of agent technologies for the retrieval and integration of clinical records in a VEPR, thus making patient information available at any point of care. Between May 2003 and May 2004, a VEPR was designed and implemented at Hospital S. João, a university hospital with over 1350 beds. An agent-based platform Multi-Agent System for Integration of Data (MAID®) ensures the communication among various hospital information systems. Clinical reports are retrieved from clinical department information systems (DIS) and stored into a central repository in a browser friendly format. Documents are retrieved in HTML and PDF format and are digitally signed at storage. MAID is now running for the last 12 months, regularly scanning 7 DIS and collecting a mean of 2800 new reports each day. A visualization module for the VEPR was made available in October 2004 and the number of users and user sessions has been growing since. Currently, over 340 doctors are using the system on a daily basis. The total budget of the project was less than 400 000 euros. Around 30% of the costs were spent in software development and MAID accounted for only 13% of the total project budget. The use of agent technologies in the implementation of a VEPR enabled the successful integration of a large amount of heterogeneous data that could then be accessed from any workstation in the hospital Intranet. As few changes were required to be made in the existing DIS, the implementation has been done over a relatively short period of time and the stress in the organization was low. Optimization of the scheduling algorithm, automatic notification of health professionals, introspection of clinical reports, retrieval of XML report representations and extension of VEPR to health centres are priorities for future research and development. We strongly believe that agent technologies can and should be used to solve complex data integration and communication problems, which are crucial to the quality of patient care. [ABSTRACT FROM AUTHOR]
- Published
- 2005
39. Adobe Takes a Step Toward JDF with Acrobat 7.
- Author
-
Mittelhaus, Michael
- Subjects
- *
COMPUTER software , *ADOBE software , *PDF (Computer file format) , *ELECTRONIC file management , *PRINTING industry - Abstract
With the Acrobat 7 software, Adobe shows the potential for Job Definition Format (JDF) in the PDF computer file format, but it still has a long way to go to become an effective workflow product. Nonetheless, the ubiquitous Acrobat can be a significant catalyst for the adoption of JDF. JDF will evolve into an industry standard for the future and will eventually lead to an industrialized, integrated and mostly automated production process. Today, however, we do not see much of JDF in practice. Questions however arise on whether the Acrobat 7 JDF software will be the starting point for a broad acceptance of JDF in the printing industry. INSETS: Acrobat 7 Nuisances; Adobe Responds;JDF Properties Helpful in Workflows;Our Criticisms and Adobe's Comments.
- Published
- 2005
40. Multi-user pdf estimation based criteria for adaptive blind separation of discrete sources
- Author
-
Cavalcante, Charles Casimiro and Romano, João Marcos T.
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *SIGNAL processing , *DATABASE management - Abstract
Abstract: This paper deals with criteria for adaptive blind separation of discrete sources. The criteria are based on the estimation of the probability density function (pdf) of the recovered signal using a parametric model and the divergence of Kullback–Leibler to measure the similarities between the involved signals. Two strategies that guarantee the recovering of all sources are employed: the first one introduces a penalty when the sources are correlated and the second one constrains the filtering to an orthogonal global system response. Simulations are carried out to evaluate the performance of the criteria compared with existing blind methods in typical multi-user environments such as spatial and space-time processing. [Copyright &y& Elsevier]
- Published
- 2005
- Full Text
- View/download PDF
41. A Multirate DSP Model for Estimation of Discrete Probability Density Functions.
- Author
-
Byung-Jun Yoon and Vaidyanathan, P. P.
- Subjects
- *
PDF (Computer file format) , *DIGITAL signal processing , *DENSITY functionals , *PROBABILITY theory , *ELECTRONIC file management , *DIGITAL electronics - Abstract
The problem of estimating a probability density function (PDF) from measurements has been widely studied by many researchers. Even though much work has been done in the area of PDF estimation, most of it was focused on the continuous case. In this paper, we propose a new model-based approach for modeling and estimating discrete probability density functions or probability mass functions. This approach is based on multirate signal processing theory, and it has several advantages over the conventional histogram method. We illustrate the PDF estimation procedure and analyze the statistical properties of the PDF estimates. Based on this model, a novel scheme is introduced that can be used for estimating the PDF in the presence of noise. Furthermore, the proposed ideas are extended to the more general case of estimating multivariate PDFs. Finally, we also consider practical issues such as optimizing the coefficients of a digital filter, which is an integral part of the model. This allows us to apply the proposed model to solve real-world problems. Simulation results are given where appropriate in order to demonstrate the ideas. [ABSTRACT FROM AUTHOR]
- Published
- 2005
- Full Text
- View/download PDF
42. Survey Explores the Role of PDF in the Enterprise.
- Author
-
Pfister, Sean
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *ELECTRONIC data processing , *DATABASE management , *INFORMATION storage & retrieval systems , *WORK environment , *SURVEYS - Abstract
This article presents the Seybold Report survey, which seeks to examine the ongoing role of PDF in the corporate enterprise. Respondents are asked to assess PDF files on several dimensions, including utility in the workplace, the frequency of problems and issues and their advantages and disadvantages. The survey also collects information about how the frequency of PDF usage, as well as PDF disposition. PDF receiver and generator work locations employ many versions of PDF. It is noted that there is a huge gap in how the two groups perceive the importance of issues that involve matching the PDF to the original document's color specifications or versions. INSETS: Methodology;Job Definition Format Outside the Radar.
- Published
- 2004
43. Model Enforcement: A Unified Feature Transformation Framework for Classification and Recognition.
- Author
-
Omar, Mohamed Kamal and Hasegawa-Johnson, Mark
- Subjects
- *
PDF (Computer file format) , *NONPARAMETRIC statistics , *ELECTRONIC file management , *BAYESIAN analysis , *PROBABILISM , *RESEARCH - Abstract
Bayesian classifiers rely on models of the a priori and class-conditional feature distributions; the classifier is trained by optimizing these models to best represent features observed in a training corpus according to certain criterion. In many problems of interest, the true class-conditional feature probability density function (PDF) is not a member of the set of PDFs the classifier can represent. Previous research has shown that the effect of this problem may be reduced either by improving the models or by transforming the features used in the classifier. This paper addresses this model mismatch problem in statistical identification, classification, and recognition systems. We formulate the problem as the problem of minimizing the relative entropy, which is also known as the Kuilback-Leibler distance, between the true conditional PDF and the hypothesized probabilistic model. Based on this formulation, we provide a computationally efficient solu- don to the problem based on volume-preserving maps; existing linear transform designs are shown to be special cases of the proposed solution. Using this result, we propose the symplectic maximum likelihood transform (SMLT), which is a nonlinear volume-preserving extension of the maximum likelihood linear transform (MLLT). This approach has many applications in statistical modeling, classification, and recognition. We apply it to the maximum likelihood estimation (MLE) of the joint PDF of order statistics and show a significant increase in the likelihood for the same number of parameters. We provide also phoneme recognition experiments that show recognition accuracy improvement compared with using the baseline Mel-Frequency Cepstrum Coefficient (MFCC) features or using MLLT. We present an iterative algorithm to jointly estimate the parameters of the symplectic map and the probabilistic model for both applications. [ABSTRACT FROM AUTHOR]
- Published
- 2004
- Full Text
- View/download PDF
44. Fresh Ideas Flourish at the First Drupa Innovation Pare.
- Author
-
Zipper, Brend
- Subjects
- *
EXHIBITIONS , *PRINTING industry , *PUBLISHING , *PDF (Computer file format) , *ELECTRONIC file management , *ELECTRONIC data processing , *CONFERENCES & conventions - Abstract
This article presents an overview of the Drupa Innovation Parc (DIP), a special showcase presented by company Drupa for innovative new companies from all over the world. DIP offers the latest developments for the printing and media industries. It quickly became a popular meeting spot for the international portable document format scene. Exhibitors featured various product solutions for pre-press workflow, print communication, digital photography, and printing manuscripts.
- Published
- 2004
45. The Next Publishing Revolution: PDF on the Fly.
- Author
-
Zipper, Bernd and Rödding, Thomas L.
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *DATABASE management , *ELECTRONIC data processing , *INFORMATION storage & retrieval systems , *PUBLISHING - Abstract
This article focuses on the emerging role of PDF-on-the-fly technologies in the publishing industry in the U.S. Not only does the technology offer an attractive business opportunity, it is the only way to properly fuse Web and print technologies. PDF-on-the-fly is based on four technologies that when linked form the foundation for dynamic PDF-production. Due to the internationally accepted ISO-Norm PDF/X, PDF has become a standard for the color-binding description of a digital manuscript. It has become nearly indispensable in producing manuscripts automatically as a container for collected information, such as for individualized text and images and adapted layout for a flyer. INSETS: XML Defined;Cooperation of PDF & XML;XSL-FO Defined.
- Published
- 2004
46. EDITOR'S NOTES.
- Author
-
Lohmann, Roger A.
- Subjects
NONPROFIT organizations ,PUBLISHING ,ASSOCIATIONS, institutions, etc. ,PERIODICAL publishing ,TECHNOLOGICAL innovations ,HIGH technology ,ECONOMIC structure ,PDF (Computer file format) ,ELECTRONIC file management ,FUND accounting ,PEER-to-peer architecture (Computer networks) ,COMPUTER network architectures - Abstract
This issue marks the thirteenth year of publication of the journal "Nonprofit Management and Leadership," and the author's third year as editor. Being a journal editor in a rapidly developing field like this one is somewhat akin to having a comfortable seat on the fifty-yard line in the home stadium of your favorite team. There's a lot of action taking place in a lot of different places, and one has a wonderful view of much of what's happening. Much of the action in scholarly journals, like some of the articles in each issue, deals with matters that are primarily of interest to specialists. And that is as it should be. Specialization is part of the very warp and woof of the nonprofit organizations. Once in a while, however, scholarly work leaps over into the headlines (or perhaps the headlines impinge on a research specialty). Then the author specifically mentions about Sheldon Gelman and Margaret Gibelman's article on accountability in faith-based organizations in this issue of the journal.
- Published
- 2002
- Full Text
- View/download PDF
47. Portable Document Format (PDF) -- Finally, a Universal Document Exchange Technology.
- Author
-
Wan-Lee Cheng, Michael A.
- Subjects
- *
PDF (Computer file format) , *ELECTRONIC file management , *ELECTRONIC data processing - Abstract
Focuses on the portable document format (PDF) as a universal document exchange technology. Features of PDF; PDF software; Description of how to create a PDF file.
- Published
- 2002
- Full Text
- View/download PDF
48. Reanimating Dead PDFs for CAT Tool Use.
- Author
-
ZETZSCHE, JOST and BROWN, ANNETT
- Subjects
OPTICAL character recognition ,ELECTRONIC file management ,PDF (Computer file format) - Published
- 2017
49. Rendering PDF Content in Windows Store Apps.
- Author
-
Poduri, Sridhar
- Subjects
PDF (Computer file format) ,MICROSOFT software ,ELECTRONIC file management ,JAVASCRIPT programming language ,DATA extraction - Abstract
The article discusses the different ways to render portable document format (PDF) content in Windows Store applications developed by technology firm Microsoft. Topics covered include the parts of Windows Runtime, accessibility to JavaScript and focus on extracting PDF content. Also mentioned are steps in opening PDF documents in the applications.
- Published
- 2013
50. Tips for Working With Adobe® Acrobat® X Pro.
- Author
-
Masters, David L.
- Subjects
COMPUTER software ,PDF (Computer file format) ,ELECTRONIC file management ,DATA protection - Abstract
In this article the author offers tips for working with the software Adobe® Acrobat® X Pro that is required for working with Portable Document Format (PDF) files. He informs that the tool helps in assembling documents, creating files and protecting sensitive information. He states that customizing the toolbar makes the Acrobat more efficient.
- Published
- 2012
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.