Author: "Baran, Mateusz" / Database: OAIster - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Baran, Mateusz"' showing total 4 results

Start Over Author "Baran, Mateusz" Database OAIster

4 results on '"Baran, Mateusz"'

1. Electoral Agitation Data Set: The Use Case of the Polish Election

Author: Baran, Mateusz, Wójcik, Mateusz, Kolebski, Piotr, Bernaczyk, Michał, Rajda, Krzysztof, Augustyniak, Łukasz, Kajdanowicz, Tomasz, Baran, Mateusz, Wójcik, Mateusz, Kolebski, Piotr, Bernaczyk, Michał, Rajda, Krzysztof, Augustyniak, Łukasz, and Kajdanowicz, Tomasz
Abstract: The popularity of social media makes politicians use it for political advertisement. Therefore, social media is full of electoral agitation (electioneering), especially during the election campaigns. The election administration cannot track the spread and quantity of messages that count as agitation under the election code. It addresses a crucial problem, while also uncovering a niche that has not been effectively targeted so far. Hence, we present the first publicly open data set for detecting electoral agitation in the Polish language. It contains 6,112 human-annotated tweets tagged with four legally conditioned categories. We achieved a 0.66 inter-annotator agreement (Cohen's kappa score). An additional annotator resolved the mismatches between the first two improving the consistency and complexity of the annotation process. The newly created data set was used to fine-tune a Polish Language Model called HerBERT (achieving a 68% F1 score). We also present a number of potential use cases for such data sets and models, enriching the paper with an analysis of the Polish 2020 Presidential Election on Twitter., Comment: 5 pages, 3 figures, Language Resources and Evaluation Conference
Published: 2023

2. Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform

Author: Wójcik, Mateusz, Kościukiewicz, Witold, Baran, Mateusz, Kajdanowicz, Tomasz, Gonczarek, Adam, Wójcik, Mateusz, Kościukiewicz, Witold, Baran, Mateusz, Kajdanowicz, Tomasz, and Gonczarek, Adam
Abstract: Production deployments in complex systems require ML architectures to be highly efficient and usable against multiple tasks. Particularly demanding are classification problems in which data arrives in a streaming fashion and each class is presented separately. Recent methods with stochastic gradient learning have been shown to struggle in such setups or have limitations like memory buffers, and being restricted to specific domains that disable its usage in real-world scenarios. For this reason, we present a fully differentiable architecture based on the Mixture of Experts model, that enables the training of high-performance classifiers when examples from each class are presented separately. We conducted exhaustive experiments that proved its applicability in various domains and ability to learn online in production environments. The proposed technique achieves SOTA results without a memory buffer and clearly outperforms the reference methods., Comment: arXiv admin note: text overlap with arXiv:2211.14963
Published: 2023

3. Classical Out-of-Distribution Detection Methods Benchmark in Text Classification Tasks

Author: Baran, Mateusz, Baran, Joanna, Wójcik, Mateusz, Zięba, Maciej, Gonczarek, Adam, Baran, Mateusz, Baran, Joanna, Wójcik, Mateusz, Zięba, Maciej, and Gonczarek, Adam
Abstract: State-of-the-art models can perform well in controlled environments, but they often struggle when presented with out-of-distribution (OOD) examples, making OOD detection a critical component of NLP systems. In this paper, we focus on highlighting the limitations of existing approaches to OOD detection in NLP. Specifically, we evaluated eight OOD detection methods that are easily integrable into existing NLP systems and require no additional OOD data or model modifications. One of our contributions is providing a well-structured research environment that allows for full reproducibility of the results. Additionally, our analysis shows that existing OOD detection methods for NLP tasks are not yet sufficiently sensitive to capture all samples characterized by various types of distributional shifts. Particularly challenging testing scenarios arise in cases of background shift and randomly shuffled word order within in domain texts. This highlights the need for future work to develop more effective OOD detection approaches for the NLP problems, and our work provides a well-defined foundation for further research in this area., Comment: 11 pages, 3 figures, Association for Computational Linguistics
Published: 2023

4. Manifolds.jl: An Extensible Julia Framework for Data Analysis on Manifolds

Author: Axen, Seth D., Baran, Mateusz, Bergmann, Ronny, Rzecki, Krzysztof, Axen, Seth D., Baran, Mateusz, Bergmann, Ronny, and Rzecki, Krzysztof
Abstract: We present the Julia package Manifolds.jl, providing a fast and easy-to-use library of Riemannian manifolds and Lie groups. This package enables working with data defined on a Riemannian manifold, such as the circle, the sphere, symmetric positive definite matrices, or one of the models for hyperbolic spaces. We introduce a common interface, available in ManifoldsBase.jl, with which new manifolds, applications, and algorithms can be implemented. We demonstrate the utility of Manifolds.jl using B\'ezier splines, an optimization task on manifolds, and principal component analysis on nonlinear data. In a benchmark, Manifolds.jl outperforms all comparable packages for low-dimensional manifolds in speed; over Python and Matlab packages, the improvement is often several orders of magnitude, while over C/C++ packages, the improvement is two-fold. For high-dimensional manifolds, it outperforms all packages except for Tensorflow-Riemopt, which is specifically tailored for high-dimensional manifolds.
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Baran, Mateusz"'

1. Electoral Agitation Data Set: The Use Case of the Polish Election

2. Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform

3. Classical Out-of-Distribution Detection Methods Benchmark in Text Classification Tasks

4. Manifolds.jl: An Extensible Julia Framework for Data Analysis on Manifolds

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Publication Type

Database

4 results on '"Baran, Mateusz"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources