DeepSeek-R1 explored

Why DeepSeek-R1 was a significant step for Open Source AI.

February 9, 2025 · 11 min

A view on BLT

Byte Latent Transformer (BLT), a tokenizer-free architecture for NLP.

January 2, 2025 · 5 min

LLM Quantization in a nutshell

An exploration of LLM quantization methods.

January 28, 2024 · 5 min

What is PEFT?

Fine-tuning of Large Language Models with Parameter Efficient.

November 1, 2023 · 11 min

Hands on with Retrieval Augmented Generation

Build a Chatbot that uses Retrieval Augmented Generation to retrieve domain specific knowledge.

October 7, 2023 · 5 min