18 results on '"Praveen Rao"'
Search Results
2. A Novel Detection of Hazardous Fires from CCTV Images using Novel YOLOv7 in Comparison with Faster R-CNN
- Author
-
V P, Praveen Rao, primary and Ramkumar, G, additional
- Published
- 2023
- Full Text
- View/download PDF
3. Enabling Large-Scale Human Genome Sequence Analysis on CloudLab
- Author
-
Praveen Rao and Arun Zachariah
- Published
- 2022
4. FederatedTree: A Secure Serverless Algorithm for Federated Learning to Reduce Data Leakage
- Author
-
Mohamed Gharibi, Srini Bhagavan, and Praveen Rao
- Published
- 2021
5. Scalable Acceleration of Characteristic Mode Analysis Using Big Data Techniques
- Author
-
Khulud Alsultan, Ahmed M. Hassan, and Praveen Rao
- Published
- 2021
6. C3HSB: A Transparent Supply Chain for Multi-cloud and Hybrid Cloud Assets Powered by Blockchain
- Author
-
Praveen Rao, Srini Bhagavan, and Thuan Ngo
- Subjects
business.industry ,Transparency (market) ,Computer science ,Purchase order ,Software as a service ,Supply chain ,020206 networking & telecommunications ,Cloud computing ,02 engineering and technology ,Procurement ,Order (business) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,business ,Database transaction ,Industrial organization - Abstract
Blockchain networks have made inroads in the supply chain domains due to their transparency, security and immutable logs. For cloud providers, a typical supply chain consists of complex networks of suppliers, manufacturers, distributors, retailers, auditors, and consumers. Even though there is an increased focus on self-serve or cloud provider managed SaaS (Software-as-a-service), a portion of sales for an enterprise customer occurs the old-fashioned way with the sales department drawing up a purchase order to begin the procurement process. In many cases, there could be several disjoint suppliers who fulfill the order behind the scenes unbeknownst to the buyer. In this paper we propose C3HSB, a decentralized distributed application architecture using blockchain and smart contracts to build the supply chain. Motivated by challenges in the industry, this approach can be considered as a building block for future supply chain procurement and deployment of cloud assets. The design goal for our system was to facilitate efficient handling of enterprise-scale purchase orders and alternatives in real-time while maintaining data provenance. C3HSB streamlines complex non-repudiated transaction workflows, promotes trust, and achieves cost savings for the customer and suppliers.
- Published
- 2021
7. Deepfakes for Histopathology Images: Myth or Reality?
- Author
-
Praveen Rao, Arun Zachariah, Nouf Alrasheed, Deepthi S. Rao, and Shivika Prasanna
- Subjects
medicine.medical_specialty ,business.industry ,Computer science ,Deep learning ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Digital pathology ,Public concern ,Pattern recognition ,02 engineering and technology ,Food and drug administration ,Cancer genome ,Pattern recognition (psychology) ,0202 electrical engineering, electronic engineering, information engineering ,medicine ,ComputingMilieux_COMPUTERSANDSOCIETY ,020201 artificial intelligence & image processing ,Histopathology ,Artificial intelligence ,business - Abstract
Deepfakes have become a major public concern on the Internet as fake images and videos could be used to spread misleading information about a person or an organization. In this paper, we explore if deepfakes can be generated for histopathology images using advances in deep learning. This is because the field of digital pathology is gaining a lot of momentum since the Food and Drug Administration (FDA) approved a few digital pathology systems for primary diagnosis and consultation in the United States. Specifically, we investigate if state-of-the- art generative adversarial networks (GANs) can produce fake histopathology images that can trick an expert pathologist. For our investigation, we used whole slide images (WSIs) hosted by The Cancer Genome Atlas (TCGA). We selected 3 WSIs of colon cancer patients and produced 100,000 patches of 256×256 pixels in size. We trained three popular GANs to generate fake patches of the same size. We then constructed a set of images containing 30 real and 30 fake patches. An expert pathologist reviewed these images and marked them as either real or fake. We observed that the pathologist marked 10 fake patches as real and correctly identified 34 patches (as fake or real). Thirteen patches were incorrectly identified as fake. The pathologist was unsure of 3 fake patches. Interestingly, the fake patches that were correctly identified by the pathologist, had missing morphological features, abrupt background change, pleomorphism, and other incorrect artifacts. Our investigation shows that while certain parts of a histopathology image can be mimicked by existing GANs, the intricacies of the stained tissue and cells cannot be fully captured by them. Unlike radiology, where it is relatively easier to manipulate an image using a GAN, we argue that it is a harder challenge in digital pathology to generate an entire WSI that is fake.
- Published
- 2020
8. RefinedFed: A Refining Algorithm for Federated Learning
- Author
-
Praveen Rao and Mohamed Gharibi
- Subjects
Distributed database ,Group method of data handling ,Computer science ,Server ,Telecommunications link ,Host (network) ,Algorithm ,Server-side ,MNIST database ,Data modeling - Abstract
Federated learning (FL) is a machine learning approach where the goal is to train a centralized model using a large number of clients that host private datasets. FL trains a smaller version of the model at each dataset site and then aggregates all the models at the server. In practice, clients (i.e., dataset holders) that participate in the learning process may possess corrupted or noisy datasets resulting in low accuracy models. Additionally, malicious clients may poison the data or carry out model discovery attacks.In this paper, we propose a refining algorithm called RefinedFed, to eliminate corrupted, low accuracy, and noisy models that can negatively impact the centralized model by reducing its accuracy or cause other malicious activities. Furthermore, RefinedFed reduces the uplink communication cost with the centralized server, which in return results in faster aggregation on the server side. Based on our preliminary experiments on the MNIST dataset, we observed that RefinedFed improved the global model accuracy from 84% to 91% while consuming less time for aggregation.
- Published
- 2020
9. A Gossip-Based System for Fast Approximate Score Computation in Multinomial Bayesian Networks
- Author
-
Kobus Barnard, Anas Katib, Praveen Rao, Monica Senapati, and Arun Zachariah
- Subjects
Theoretical computer science ,Distributed database ,Computer science ,Computation ,Hash function ,Probabilistic logic ,Bayesian network ,02 engineering and technology ,Gossip ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Multinomial distribution ,Random variable - Abstract
In this paper, we present a system for fast approximate score computation, a fundamental task for score-based structure learning of multinomial Bayesian networks. Our work is motivated by the fact that exact score computation on large datasets is very time consuming. Our system enables approximate score computation on large datasets in an efficient and scalable manner with probabilistic error bounds on the statistics required for score computation. Our system has several novel features including gossip-based decentralized computation of statistics, lower resource consumption via a probabilistic approach of maintaining statistics, and effective distribution of tasks for score computation using hashing techniques. The demo will provide a real-time and interactive experience to a user on how our system employs the principle of gossiping and hashing techniques in a novel way for fast approximate score computation. The user will be able to control different aspects of our system's execution on a cluster with up to 32 nodes. The approximate scores output by our system can be then used by existing score-based structure learning algorithms.
- Published
- 2019
10. A Method for Scalable First-Order Rule Learning on Twitter Data
- Author
-
Laurent Njilla, Praveen Rao, and Monica Senapati
- Subjects
Theoretical computer science ,Computer science ,Scalability ,Graph partition ,Graph (abstract data type) ,Probabilistic inference ,First order ,Partition (database) ,Graph - Abstract
We propose a method for scalable first-order rule learning on large-scale Twitter data. By learning rules, probabilistic inference queries can be executed to reason over the data to ascertain its veracity. Our method employs a divide-and-conquer approach, graph-based modeling, and data parallel processing during rule learning using a commodity cluster to overcome the problem of slow structure learning on large-scale Twitter data. The first-order predicates (constructed on the posts) are first partitioned in a balanced way by pivoting around users to reduce the chance of missing relevant rules. By constructing a weighted graph and applying graph partitioning, balanced partitions of the ground predicates can be created. Each partition is then processed using an existing structure learning approach to get the set of rules for that partition. We report a preliminary evaluation of our method to show that it offers a promising solution for scalable first-order rule learning on Twitter data.
- Published
- 2019
11. Teaching Parallel Programming with Active Learning
- Author
-
Mohammad Amin Kuhail, Joshua W. Neustrom, Spencer Cook, and Praveen Rao
- Subjects
Computer science ,business.industry ,020206 networking & telecommunications ,02 engineering and technology ,Parallel computing ,021001 nanoscience & nanotechnology ,Visualization ,Software ,Parallel processing (DSP implementation) ,Active learning ,ComputingMilieux_COMPUTERSANDEDUCATION ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,Computer science curriculum ,Software system ,0210 nano-technology ,business ,Curriculum - Abstract
Today parallel computing is essential for the success of many real-world applications and software systems. Nonetheless, most computer science undergraduate courses teach students how to think and program sequentially. Further, software professionals have complained about the computer science curriculum's lag behind industry in their failing to cover modern programming technologies such as parallel programming. The emphasis on parallel programming has become even more important due to the increasing adoption of horizontal scaling approaches to cope with massive datasets. In order to help students coming from a serial curriculum comprehend parallel concepts, we used an innovative approach that utilized active learning, visualizations, examples, discussions, and practical exercises. Further, we conducted an experiment to examine the effect of active learning on students' understanding of parallel programming. Results indicate that the students that were actively engaged with the material performed better in terms of understanding parallel programming concepts than other students.
- Published
- 2018
12. Probabilistic Inference on Twitter Data to Discover Suspicious Users and Malicious Content
- Author
-
Anas Katib, Laurent Njilla, Kevin Kwiat, Charles A. Kamhoua, and Praveen Rao
- Subjects
business.industry ,Computer science ,Big data ,Statistical relational learning ,Probabilistic logic ,Inference ,Computer security ,computer.software_genre ,World Wide Web ,Knowledge base ,Malware ,The Internet ,Social media ,business ,computer - Abstract
While the power of social media on the Internet is undeniable, it has become a major weapon for launching cyberattacks against an organization and its people. Today, there is a growing number of cyberattacks being launched through social media such as posting of false content from hacked accounts, posting malicious URLs to spread malware, and others. In this paper, we present a simple and flexible unified framework called SocialKB for modeling social media posts and reasoning about them to ascertain their veracity, a first step towards discovering emerging cyber threats. SocialKB is based on Markov Logic Networks (MLNs), a popular representation in statistical relational learning. It learns a knowledge base (KB) on the social media posts and users' behavior in a unified manner. By conducting probabilistic inference on the KB, SocialKB can identify suspicious users and malicious content. In this work, we specifically focus on tweets posted by users on Twitter. Finally, we report an evaluation of SocialKB on 20,000 tweets and discuss our early inference results.
- Published
- 2016
13. Mobile Computing, Internet of Things, and Big Data for Urban Informatics
- Author
-
Anirban Mondal, Praveen Rao, and Sanjay Kumar Madria
- Subjects
business.industry ,Computer science ,020208 electrical & electronic engineering ,Big data ,Mobile computing ,Information technology ,02 engineering and technology ,World Wide Web ,020204 information systems ,Informatics ,ComputingMilieux_COMPUTERSANDEDUCATION ,0202 electrical engineering, electronic engineering, information engineering ,Key (cryptography) ,Use case ,Internet of Things ,business ,Urban informatics - Abstract
Urban informatics is emerging as a new discipline for cities and governments to improve the lives of citizens using information technology. In this advanced seminar, we introduce the key challenges and opportunities in urban informatics, discuss topics in mobile computing, Internet of Things (IoT) and big data analytics, to advance the state-of-the-art in urban informatics and provide interesting use cases. This seminar is designed for academicians, researchers, city administrators/planners, application developers, and research students with background in mobile computing and database systems.
- Published
- 2016
14. A tool for Internet-scale cardinality estimation of XPath queries over distributed semistructured data
- Author
-
Praveen Rao, Anas Katib, and Vasil Slavov
- Subjects
computer.internet_protocol ,Computer science ,business.industry ,Distributed computing ,Cloud computing ,Load balancing (computing) ,computer.software_genre ,Cardinality ,Gossip ,Scalability ,The Internet ,Data mining ,business ,computer ,XML ,XPath - Abstract
We present a novel tool called XGossip for Internet-scale cardinality estimation of XPath queries over distributed XML data. XGossip relies on the principle of gossip, is scalable, decentralized, and can cope with network churn and failures. It employs a novel divide-and-conquer strategy for load balancing and reducing the overall network bandwidth consumption. It has a strong theoretical underpinning and provides provable guarantees on the accuracy of cardinality estimates, the number of messages exchanged, and the total bandwidth usage. In this demonstration, users will experience three engaging scenarios: In the first scenario, they can set up, configure, and deploy XGossip on Amazon Elastic Compute Cloud (EC2). In the second scenario, they can execute XGossip, pose XPath queries, observe in real-time the convergence speed of XGossip, the accuracy of cardinality estimates, the bandwidth usage, and the number of messages exchanged. In the third scenario, they can introduce network churn and failures during the execution of XGossip and observe how these impact the behavior of XGossip.
- Published
- 2014
15. Jeev: A Low-Cost Cell Phone Application for Tracking the Vaccination Coverage of Children in Rural Communities
- Author
-
Anas Katib, Deepthi S. Rao, Karen B. Williams, and Praveen Rao
- Subjects
National identification ,business.industry ,Internet privacy ,Mobile computing ,Developing country ,Computer security ,computer.software_genre ,Immunization ,Phone ,Software deployment ,Vaccination coverage ,Tracking (education) ,business ,computer - Abstract
Immunization saves millions of lives against vaccine-preventable diseases. Yet, 24 million children born every year do not receive proper immunization during their first year. UNICEF and WHO have emphasized the need to strengthen the immunization surveillance and monitoring in developing countries to reduce childhood deaths. In this regard, we present a software application called Jeev to track the vaccination coverage of children in rural communities. Jeev synergistically combines the power of smart phones and the ubiquity of cellular infrastructure, QR codes, and national identification cards. We present the design of Jeev and highlight its unique features along with a preliminary evaluation of its performance. We plan to pilot test Jeev in a rural population to study its effectiveness and identify socio-cultural issues that may arise in a large-scale deployment.
- Published
- 2013
16. An Internet-Scale Service for Publishing and Locating XML Documents
- Author
-
Praveen Rao and Bongki Moon
- Subjects
Information retrieval ,computer.internet_protocol ,Computer science ,Efficient XML Interchange ,XML Signature ,XML validation ,computer.file_format ,Query language ,Database index ,Distributed hash table ,World Wide Web ,XML Schema Editor ,Streaming XML ,Xml indexing ,computer ,XML ,XPath - Abstract
In recent years, there has been a growing interest for peer-to-peer (P2P) based computing and applications. One of the most important challenges in P2P environments is to quickly locate relevant data across many participating peers. In this demonstration, we present psiX, which is an Internet-scale service for publishing and locating XML documents. This service runs on several PlanetLab nodes geographically spread across the globe. The psiX system adopts a suite of new techniques for XML indexing and pattern matching in a P2P network, namely, (a) representing XML documents and XPath queries compactly via algebraic signatures, (b) searching signatures of documents and value summaries indexed using distributed hierarchical indexesbuilt over a Distributed Hash Table (DHT), and (c) gracefully adapting to failures while running on the Internet, where failures are a norm rather than an exception.
- Published
- 2009
17. SketchTree: Approximate Tree Pattern Counts over Streaming Labeled Trees
- Author
-
Praveen Rao and Bongki Moon
- Subjects
Tree rotation ,Fractal tree index ,Incremental decision tree ,Tree (data structure) ,Theoretical computer science ,Computer science ,Data stream mining ,Segment tree ,Approximation algorithm ,Algorithm design ,Interval tree ,Search tree ,Range tree - Abstract
In recent years, there has been a rising interest in developing online approximation algorithms for data streams. Some of the key challenges are posed by the fact that streaming data can be read only once in a fixed order of arrival and only a limited amount of memory is available for storage. In this paper, we address the problem of approximately counting tree patterns over a stream of labeled trees (e.g., XML documents). We propose a new approximation algorithm called SketchTree that computes a synopsis of the stream in a single pass by processing each tree only once. Using a limited amount of memory, SketchTree provides approximate answers for both ordered and unordered tree pattern counts. Furthermore, we discuss a class of count queries that can be handled by SketchTree and their utility. We provide theoretical analyses to show that our algorithm has provably strong guarantees on the error bounds. Experiments on real datasets demonstrate that SketchTree can indeed estimate tree pattern counts within 10-15% relative error with high confidence under various situations.
- Published
- 2006
18. PRIX: indexing and querying XML using prufer sequences.
- Author
-
Praveen Rao and Moon, B.
- Published
- 2004
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.