Back to Search
Start Over
Deep Reinforcement Learning (DRL): Another Perspective for Unsupervised Wireless Localization
- Source :
- IEEE Internet of Things Journal. 7:6279-6287
- Publication Year :
- 2020
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2020.
-
Abstract
- Location is key to spatialize internet-of-things (IoT) data. However, it is challenging to use low-cost IoT devices for robust unsupervised localization (i.e., localization without training data that have known location labels). Thus, this paper proposes a deep reinforcement learning (DRL) based unsupervised wireless-localization method. The main contributions are as follows. (1) This paper proposes an approach to model a continuous wireless-localization process as a Markov decision process (MDP) and process it within a DRL framework. (2) To alleviate the challenge of obtaining rewards when using unlabeled data (e.g., daily-life crowdsourced data), this paper presents a reward-setting mechanism, which extracts robust landmark data from unlabeled wireless received signal strengths (RSS). (3) To ease requirements for model re-training when using DRL for localization, this paper uses RSS measurements together with agent location to construct DRL inputs. The proposed method was tested by using field testing data from multiple Bluetooth 5 smart ear tags in a pasture. Meanwhile, the experimental verification process reflected the advantages and challenges for using DRL in wireless localization.
- Subjects :
- Signal Processing (eess.SP)
FOS: Computer and information sciences
Computer Science - Machine Learning
Computer Networks and Communications
Computer science
Machine Learning (stat.ML)
02 engineering and technology
Machine learning
computer.software_genre
01 natural sciences
Machine Learning (cs.LG)
Statistics - Machine Learning
Robustness (computer science)
FOS: Electrical engineering, electronic engineering, information engineering
0202 electrical engineering, electronic engineering, information engineering
Reinforcement learning
Electrical Engineering and Systems Science - Signal Processing
Hidden Markov model
Training set
business.industry
010401 analytical chemistry
020206 networking & telecommunications
0104 chemical sciences
Computer Science Applications
Hardware and Architecture
Signal Processing
Artificial intelligence
Markov decision process
business
computer
Wireless sensor network
Information Systems
Test data
Subjects
Details
- ISSN :
- 23722541
- Volume :
- 7
- Database :
- OpenAIRE
- Journal :
- IEEE Internet of Things Journal
- Accession number :
- edsair.doi.dedup.....8fcb59a8b7a1ef1b947971487b1cab71