Back to Search Start Over

Large-Scale Information Extraction under Privacy-Aware Constraints

Authors :
Ranganath Kondapally
Rajeev Gupta
Source :
CIKM
Publication Year :
2022
Publisher :
ACM, 2022.

Abstract

In this digital age, people spend a significant portion of their lives online and this has led to an explosion of personal data from users and their activities. Typically, this data is private and nobody else, except the user, is allowed to look at it. This poses interesting and complex challenges from scalable information extraction point of view: extracting information under privacy aware constraints where there is little data to learn from but need highly accurate models to run on large amount of data across different users. Anonymization of data is typically used to convert private data into publicly accessible data. But this may not always be feasible and may require complex differential privacy guarantees in order to be safe from any potential negative consequences. Other techniques involve building models on a small amount of seen (eyes-on) data and a large amount of unseen (eyes-off) data. In this tutorial, we use emails as representative private data to explain the concepts of scalable IE under privacy-aware constraints.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Accession number :
edsair.doi.dedup.....87416d13861a78005ca2530c9abf7c3b