The Soil Moisture Active Passive (SMAP) mission has delivered valuable sensing of surface soil moisture since 2015. However, it has a short time span and irregular revisit schedules. Utilizing a state‐of‐the‐art time series deep learning neural network, Long Short‐Term Memory (LSTM), we created a system that predicts SMAP level‐3 moisture product with atmospheric forcings, model‐simulated moisture, and static physiographic attributes as inputs. The system removes most of the bias with model simulations and improves predicted moisture climatology, achieving small test root‐mean‐square errors (<0.035) and high‐correlation coefficients >0.87 for over 75% of Continental United States, including the forested southeast. As the first application of LSTM in hydrology, we show the proposed network avoids overfitting and is robust for both temporal and spatial extrapolation tests. LSTM generalizes well across regions with distinct climates and environmental settings. With high fidelity to SMAP, LSTM shows great potential for hindcasting, data assimilation, and weather forecasting. Soil moisture is the water content in soil, and it is a critical component of the water cycle. It controls whether crops or wild vegetation can function properly, the risk of wildfire, and the likelihood of floods. The NASA satellite SMAP, launched in 2015, measures soil moisture near the ground surface over Earth at high accuracy. SMAP data are of great value to global communities to which soil moisture is relevant. However, since it was launched only recently, there is only little overlap with other data sets, which limits its use. To extend SMAP's observations in time, we employ a “deep learning” technology called Long Short‐Term Memory (LSTM), which is one of the pillars of artificial intelligence and is bringing revolutionary changes in many scientific fields and our daily lives. We use LSTM to learn patterns of soil moisture dynamics and where physics‐based models make mistakes in describing moisture changes. This approach shows great promise for projecting SMAP observations into the long past: because soil moisture has a short memory, 2 years of data seem sufficient to train LSTM to successfully capture its dynamics. Because LSTM can handle large and diverse data, it offers valuable alternatives to older statistical methods. With 2 years of data, SMAP L3 data can be extended at high fidelity using a deep learning network (LSTM), showing potential for hindcastingDespite significant, spatially varying bias in Land Surface Models, LSTM can remove bias, correct moisture climatology, and capture extremesLSTM is more generalizable than simpler methods, and its strength seems to derive from its memory and ability to accommodate large data