- AI BEST SEARCH
- AI Glossary & Keyword Index [AI BEST SEARCH]
- LSTM (Long Short-Term Memory)
LSTM (Long Short-Term Memory)
LSTM (Long Short-Term Memory) is a type of recurrent neural network (RNN) designed to handle sequential data such as time series and natural language. Proposed by Sepp Hochreiter and Jürgen Schmidhuber in 1997, it was a groundbreaking solution to the "vanishing gradient" problem that made it difficult for standard RNNs to learn long-range dependencies. The key feature of LSTMs is the "memory cell" structure, which includes built-in mechanisms to retain or forget information over time. This memory is managed by three gate structures: 1. Input Gate: Decides how much new information to store 2. Forget Gate: Decides how much past information to discard 3. Output Gate: Controls what information to output This architecture gives LSTMs excellent ability to capture long-term context and patterns, handling the long-range dependencies that standard RNNs struggle with. Common applications: • Natural language processing tasks such as machine translation and text generation • Time series forecasting for stock prices, demand, and other metrics • Speech recognition and music generation • Anomaly detection using sensor data • Video analysis for motion and activity recognition While LSTMs have largely been superseded by Transformer-based models (e.g., BERT, GPT), they remain in active use as a lightweight and reliable architecture for resource-constrained environments and simpler sequential tasks.