A Comprehensive Guide on Machine Learning System Design Post date July 7, 2025 Post author By Kuriko Iwai Post categories In deep-learning, deployment, machine-learning, system-design-concepts
Time Series Is Everywhere—Here’s How to Actually Forecast It Post date July 7, 2025 Post author By superorange0707 Post categories In ai, data-science, deep-learning, lstm, machine-learning, programming, python, time-series
Deep Reinforcement Learning for Self-Evolving AI Post date June 27, 2025 Post author By Kuriko Iwai Post categories In ai, deep-learning, machine-learning, pytorch, reinforcement-learning
Human Learning No Longer Exists – Enter Human Meaning Post date June 25, 2025 Post author By Jin Park Post categories In ai-ethics, artificial-intelligence, Automation, deep-learning, future-of-work, human-centric-ai, human-learning, philosophy-of-ai
Building Deep Feedforward Networks Post date June 6, 2025 Post author By Kuriko Iwai Post categories In deep-learning, machine-learning, mathematics, neural-networks, python
Accelerating Neural Networks: The Power of Quantization Post date May 28, 2025 Post author By Vladislav Ag Post categories In accelerating-neural-networks, ai, deep-learning, neural-networks, pytorch, quantization, symmetric-quantization, what-is-quantization
How Griffin’s Local Attention Window Beats Global Transformers at Their Own Game Post date May 27, 2025 Post author By Deplatform Post categories In ai-research, attention-mechanism, deep-learning, griffin-model, language-models, machine-learning, nlp, transformers
A Friendly Introduction To Model Pruning With PyTorch Post date May 20, 2025 Post author By Sahib Dhanjal Post categories In ai, compression, deep-learning, pruning, pytorch
torch.compile — The Missing Manual Post date May 13, 2025 Post author By Sahib Dhanjal Post categories In ai, deep-learning, python, pytorch, tips
Quantization-Aware Training With PyTorch Post date May 5, 2025 Post author By Sahib Dhanjal Post categories In deep-learning, Optimization, python, pytorch, quantization
Zero‑Reboot GPU Power: CUDA 12 on WSL 2 in 30 Minutes Post date April 30, 2025 Post author By Christian Loschiavo Post categories In cuda, deep-learning, pytorch, Windows 11, wsl-2
PyTorch — A Comprehensive Performance Tuning Guide Post date April 17, 2025 Post author By Sahib Dhanjal Post categories In ai, deep-learning, python, pytorch, tutorial
Converting Unstructured Data into a Knowledge Graph Using an End-to-End Pipeline Post date April 17, 2025 Post author By Fareed Khan Post categories In ai, artificial-intelligence, deep-learning, machine-learning, python
Fraud Detection Using Artificial Intelligence and Machine Learning Post date April 16, 2025 Post author By Nikhil Kapoor Post categories In ai, ai-fraud-detection, anomaly-detection, deep-learning, feature-engineering, machine-learning, supervised-learning, unsupervised-ml
Only the Beginning Matters: How the LLM Decides Where to Focus Attention Post date April 11, 2025 Post author By Salvatore Raieli Post categories In artificial-intelligence, data-science, deep-learning, machine-learning, programming
Training Deep-Learning Models At Ultra-Scale Using PyTorch Post date April 9, 2025 Post author By Sahib Dhanjal Post categories In ai, deep-learning, model-optimization, model-parallelism, scalability
Stepping into the Future of 3D Vision with VGGT Post date April 3, 2025 Post author By Cristian Leo Post categories In artificial-intelligence, computer-vision, deep-learning, mathematics, python
Scaling Up: A Beginner’s Guide to Multi-Node Distributed Training in PyTorch Post date March 30, 2025 Post author By Christian Loschiavo Post categories In artificial-intelligence, cuda, deep-learning, nvidia, Training
Mastering GPU Memory Management With PyTorch and CUDA Post date March 25, 2025 Post author By Sahib Dhanjal Post categories In ai, cuda, deep-learning, python, pytorch
Linear Attention and Long Context Models Post date March 15, 2025 Post author By Rendering Technology Breakthroughs Post categories In deep-learning, high-throughput-ai, induction-heads-in-ai, long-context-processing, selective-state-space-models, sequence-modeling-with-ssms, state-space-models, transformer-model-alternatives
State Space Models vs RNNs: The Evolution of Sequence Modeling Post date March 15, 2025 Post author By Rendering Technology Breakthroughs Post categories In deep-learning, high-throughput-ai, induction-heads-in-ai, long-context-processing, selective-state-space-models, sequence-modeling-with-ssms, state-space-models, transformer-model-alternatives
How AI Chooses What Information Matters Most Post date March 15, 2025 Post author By Rendering Technology Breakthroughs Post categories In deep-learning, high-throughput-ai, induction-heads-in-ai, long-context-processing, selective-state-space-models, sequence-modeling-with-ssms, state-space-models, transformer-model-alternatives
The HackerNoon Newsletter: Why The Hell is Observability So Darn Expensive!? (3/14/2025) Post date March 14, 2025 Post author By Noonification Post categories In ai, artificial-intelligence, deep-learning, editing-protocol, hackernoon-newsletter, latest-tect-stories, noonification, observability
Unleashing the Beast: Building a Production-Grade, Real-Time Anomaly Detection Pipeline for… Post date March 12, 2025 Post author By AK Post categories In anomaly-detection, data-pipeline, deep-learning, machine-learning, production-ml
Unlock the Hidden Secrets of Machine Learning: A 10-Year Expert’s Journey from Theory to Code That… Post date March 7, 2025 Post author By AK Post categories In artificial-intelligence, coding, data-science, deep-learning, machine-learning
Mastering time series analysis with python Post date March 7, 2025 Post author By Katy Post categories In deep-learning, machine-learning, python, statistical-analysis, time-series-analysis
Artificial Intelligence Is Not What You Think It Is Post date March 3, 2025 Post author By Luca Derumier Post categories In ai, artificial-intelligence, deep-learning, machine-learning, technology
The Math Behind nn.BCELoss() Post date February 28, 2025 Post author By Cristian Leo Post categories In artificial-intelligence, data-science, deep-learning, machine-learning, python
FAISS & RAG: The Dynamic Duo of Knowledge-Powered AI Post date February 25, 2025 Post author By Cristian Leo Post categories In artificial-intelligence, data-science, deep-learning, machine-learning, python
How To Optimize Memory Usage For Training LLMs In PyTorch Post date February 24, 2025 Post author By Sahib Dhanjal Post categories In ai, deep-learning, how-to, python, pytorch
Revolutionizing Deep Learning: Advanced Knowledge Distillation for Optimized Teacher-Student Model… Post date February 14, 2025 Post author By Hasitha Pathum Post categories In ai, deep-learning, distillation, Health, technology
The Chinese Software Industry is Shifting From the Dinosaur Model to the Monkey-Troop Model Post date February 11, 2025 Post author By William Guo Post categories In deep-learning, deepseek, dinosaur-software, it-infrastructure, llms, monkey-troop-software, software, top
Transformer-Squared: Stop Finetuning LLMs Post date February 10, 2025 Post author By Cristian Leo Post categories In artificial-intelligence, data-science, deep-learning, large-language-models, machine-learning
How To Train Your PyTorch Models (Much) Faster Post date February 10, 2025 Post author By Sahib Dhanjal Post categories In ai, deep-learning, python, pytorch, tutorial
Transformers for Long-Term Time Series Forecasting Post date February 4, 2025 Post author By @panData Post categories In data-science, deep-learning, forecasting, machine-learning, programming
A New Approach to Attention — Differential Transformers | Paper Walkthrough and PyTorch… Post date January 28, 2025 Post author By Shubh Mishra Post categories In ai, deep-learning, gpt, mls, transformers
Fine-Tuning of DeepSeek LLM for Text Classification and Sentiment Analysis: Techniques, Code… Post date January 28, 2025 Post author By Ranjeet Tiwari | Senior Architect - AI | IITJ Post categories In artificial-intelligence, deep-learning, deepseek-r1, fine-tuning, large-language-models
Text Classification in the era of Transformers Post date January 27, 2025 Post author By Bhujith Madav Velmurugan Post categories In classification, deep-learning, genai, nlp, transformers
AI Is Now Creating Antidotes for Snake Venom Post date January 23, 2025 Post author By Zac Amos Post categories In ai, Biology, deep-learning, healthcare, medical-ai, protein-engineering, Science, tech-for-good
Creating Human Faces from Scratch: A Hands-On Guide to GANs Post date January 17, 2025 Post author By Harish Siva Subramanian Post categories In deep-learning, generative-adversarial, image-generation, python, pytorch
Icon Detection for Test Automation: A Deep Learning Playbook Post date January 17, 2025 Post author By Abdelkader HASSINE Post categories In artificial-intelligence, computer-vision, deep-learning, software-testing
RNNs vs. Transformers: Innovations in Scalability and Efficiency Post date January 14, 2025 Post author By Gating Post categories In ai-research, deep-learning, efficient-ai, linear-attention, rnn-models, scalable-ai, ssm-models, transformers
Hawk and Griffin: Mastering Long-Context Extrapolation in AI Post date January 14, 2025 Post author By Gating Post categories In ai-extrapolation, deep-learning, efficient-ai, griffin-model, hawk-model, language-models, long-context-ai, token-prediction
Griffin Model: Advancing Copying and Retrieval in AI Tasks Post date January 14, 2025 Post author By Gating Post categories In ai-extrapolation, copying-tasks, deep-learning, efficient-ai, griffin-model, language-models, retrieval-tasks, transformers
Hawk and Griffin Models: Superior Latency and Throughput in AI Inference Post date January 14, 2025 Post author By Gating Post categories In ai-inference, deep-learning, efficient-ai, griffin-model, hawk-model, high-throughput, low-latency, transformers
Recurrent Models: Enhancing Latency and Throughput Efficiency Post date January 14, 2025 Post author By Gating Post categories In ai-research, cache-efficiency, deep-learning, high-throughput, language-models, low-latency, recurrent-models, transformers
Recurrent Models: Decoding Faster with Lower Latency and Higher Throughput Post date January 14, 2025 Post author By Gating Post categories In ai-inference, decoding-efficiency, deep-learning, high-throughput, language-models, low-latency, recurrent-models, transformers
Training speed on longer sequences Post date January 14, 2025 Post author By Gating Post categories In ai-models, deep-learning, hawk-and-griffin-models, language-models, nlp-research, rnn-models, scalable-ai, transformers
Efficient linear recurrences on device Post date January 14, 2025 Post author By Gating Post categories In ai-research, custom-kernel, deep-learning, efficient-training, hawk-model, rg-lru-layer, scalable-ai, tpu-optimization
Efficient Training: Scaling Griffin Models for Large-Scale AI on TPUs Post date January 14, 2025 Post author By Gating Post categories In ai-model-scaling, ai-research, deep-learning, efficient-training, griffin-model, model-parallelism, scalable-ai, tpu-optimization
Hawk and Griffin Models: Superior NLP Performance with Minimal Training Data Post date January 13, 2025 Post author By Gating Post categories In ai-research, deep-learning, efficient-ai, griffin-model, hawk-model, llama-v2, nlp-performace, rnn-models
Griffin Models: Outperforming Transformers with Scalable AI Innovation Post date January 13, 2025 Post author By Gating Post categories In ai-research, chinchilla-scaling, deep-learning, efficient-ai, griffin-model, rnn-models, scalable-ai, transformers
Recurrent Models Scale as Efficiently as Transformers Post date January 13, 2025 Post author By Gating Post categories In deep-learning, efficient-ai, griffin-model, hybrid-ai, nlp-scaling, rnn-models, sequence-processing, transformers
RG-LRU: A Breakthrough Recurrent Layer Redefining NLP Model Efficiency Post date January 13, 2025 Post author By Gating Post categories In ai-models, deep-learning, efficient-ai, gating, nlp-innovations, rg-lru, rnn-models, temporal-mixing
RNN Models Hawk and Griffin: Transforming NLP Efficiency and Scaling Post date January 13, 2025 Post author By Gating Post categories In ai-models, deep-learning, efficient-ai, language-models, multi-query-attention, nlp-research, rnn-model, scalable-ai
AI Voice Conversion: Recreate Any Speaker’s Voice with VITS Post date January 10, 2025 Post author By Ben Post categories In artificial-intelligence, computer science, data-science, deep-learning, machie-learning
Implement Transformers (Bidirectional) from Scratch in Pytorch for Sequence Classification Post date January 10, 2025 Post author By Sarvesh Khetan Post categories In artificial-intelligence, deep-learning, machine-learning, transformers
Embeddings for RAG – A Complete Overview Post date November 30, 2024 Post author By Shrinivasan Sankar Post categories In artificial-intelligence, deep-learning, hackernoon-top-story, large-language-models, llms, rag, rag-embeddings, retrieval-augmented-generation
Let’s Build our own GPT Model from Scratch with PyTorch Post date November 8, 2024 Post author By Shubh Mishra Post categories In ai, chatgpt, deep-learning, pytorch, transformers
Why Advanced LLMs, Such as GPT-4 or Claude, Fail in Critical Use Cases Despite Large Training Data Post date November 8, 2024 Post author By Sanjay Nandakumar Post categories In deep-learning, llm, machine-learning, naturallanguageprocessing, nlp