Generative AI Course — Mitra AI Projects

overview

What is Generative AI?

Generative AI creates new content — text, images, code, audio — by learning statistical patterns from large amounts of training data. LLMs like GPT-4, Claude, and Gemini are the most widely used form today.

Key Concepts

Concept	Plain-English Meaning
Token	The basic unit of text an LLM processes — roughly 3–4 characters per token
Context Window	Maximum tokens the model can "see" at once (e.g., 128k for Claude 3.5)
Temperature	Controls randomness — 0 = deterministic, 1+ = creative/random
Hallucination	When the model generates plausible-sounding but incorrect facts
Foundation Model	A large pre-trained model that can be adapted to many tasks
In-Context Learning	Guiding the model with examples in the prompt — no retraining needed

Current Model Landscape (2026)

Claude Sonnet 4.6GPT-4oGemini 1.5 ProLlama 3.1MistralQwen

Interactive Notebook

⚡

Notebook: What is GenAI

Tokenisation, LLM landscape, temperature, cost estimation

First load ~30-60s · Saves automatically

Open Notebook

Quiz

Test your understanding -- 10 questions, 70% to pass.

Take Quiz

Pattern	When to use	Example
Zero-shot	Simple tasks with clear instructions	"Summarize this in 3 bullet points."
Few-shot	When you need format consistency	Show 2–3 examples before the actual request
Chain of Thought	Math, reasoning, multi-step problems	"Think step by step before answering."
Role Prompting	Tone and expertise control	"You are a senior ML engineer reviewing code."
Output Format	Structured output (JSON, table)	"Return as JSON with keys: name, score, reason."
Delimiters	Prevent prompt injection	Wrap user input in <doc>...</doc>

Pattern

When to use

Example

Zero-shot

Simple tasks with clear instructions

"Summarize this in 3 bullet points."

Few-shot

When you need format consistency

Show 2–3 examples before the actual request

Chain of Thought

Math, reasoning, multi-step problems

"Think step by step before answering."

Role Prompting

Tone and expertise control

"You are a senior ML engineer reviewing code."

Output Format

Structured output (JSON, table)

"Return as JSON with keys: name, score, reason."

Delimiters

Prevent prompt injection

Wrap user input in <doc>...</doc>

from openai import OpenAI client = OpenAI() response = client.chat.completions.create( model="gpt-4o-mini", messages=[ {"role": "system", "content": "You are a Python code reviewer."}, {"role": "user", "content": f"Review this code:\n\n<code>\n{code}\n</code>"} ], temperature=0.2, ) print(response.choices[0].message.content)

import chromadb from openai import OpenAI client = OpenAI() chroma = chromadb.Client() collection = chroma.get_or_create_collection("docs") def embed(text): res = client.embeddings.create(input=text, model="text-embedding-3-small") return res.data[0].embedding def rag_query(question): q_vec = embed(question) results = collection.query(query_embeddings=[q_vec], n_results=3) context = "\n".join(results["documents"][0]) resp = client.chat.completions.create( model="gpt-4o-mini", messages=[{"role": "user", "content": f"Context:\n{context}\n\nQ: {question}"}] ) return resp.choices[0].message.content

Database	Best for	Notes
ChromaDB	Local dev, prototypes	Easy to run, Python-native
Pinecone	Production, managed	Scalable, fully managed cloud
Supabase pgvector	Postgres-based apps	SQL + vector search in one DB
Qdrant	Open-source production	Payload filtering, self-hosted
Weaviate	Multi-modal search	GraphQL API, schemas

Database

Best for

Notes

ChromaDB

Local dev, prototypes

Easy to run, Python-native

Pinecone

Production, managed

Scalable, fully managed cloud

Supabase pgvector

Postgres-based apps

SQL + vector search in one DB

Qdrant

Open-source production

Payload filtering, self-hosted

Weaviate

Multi-modal search

GraphQL API, schemas

Strategy	What it measures	When to use
BLEU / ROUGE	N-gram overlap with reference text	Translation, summarization with gold references
Human Evaluation	Relevance, coherence, helpfulness	Gold standard, but expensive
LLM-as-Judge	Use a strong LLM to score outputs	Scalable, automated, can use Likert or pairwise
RAG-Specific	Faithfulness, answer relevance, context recall	RAGAs framework
Task-specific	F1, accuracy, exact match on benchmarks	Classification, extraction tasks

Strategy

What it measures

When to use

BLEU / ROUGE

N-gram overlap with reference text

Translation, summarization with gold references

Human Evaluation

Relevance, coherence, helpfulness

Gold standard, but expensive

LLM-as-Judge

Use a strong LLM to score outputs

Scalable, automated, can use Likert or pairwise

RAG-Specific

Faithfulness, answer relevance, context recall

RAGAs framework

Task-specific

F1, accuracy, exact match on benchmarks

Classification, extraction tasks

Library	Purpose
transformers	Load and run pre-trained models (BERT, GPT-2, Llama, etc.)
datasets	Access 10,000+ public datasets with one line of code
tokenizers	Fast tokenization library used by all HF models
PEFT	LoRA, prefix tuning, and other efficient fine-tuning methods
Spaces	Deploy ML demos (Gradio/Streamlit) for free on HF
Inference API	Call HF models via REST API without deploying yourself

Library

Purpose

transformers

Load and run pre-trained models (BERT, GPT-2, Llama, etc.)

datasets

Access 10,000+ public datasets with one line of code

tokenizers

Fast tokenization library used by all HF models

PEFT

LoRA, prefix tuning, and other efficient fine-tuning methods

Spaces

Deploy ML demos (Gradio/Streamlit) for free on HF

Inference API

Call HF models via REST API without deploying yourself

from transformers import pipeline # Sentiment analysis clf = pipeline("text-classification") result = clf("This project guide is excellent!") # Text generation gen = pipeline("text-generation", model="gpt2") text = gen("Machine learning is", max_new_tokens=50)

What is Generative AI?

Key Concepts

Current Model Landscape (2026)

Interactive Notebook

Quiz

Prompt Engineering

Core Patterns

Code Example — OpenAI API

Interactive Notebook

Quiz

RAG — Retrieval-Augmented Generation

RAG Architecture

Code Example

💡 Document Q&A Assistant

Interactive Notebook

Quiz

Fine-Tuning LLMs

When Fine-Tuning is Worth It

LoRA Key Idea

Interactive Notebook

Quiz

Embeddings & Vector Databases

Vector DB Comparison

Interactive Notebook

Quiz

LLM Evaluation

Evaluation Strategies

Interactive Notebook

Quiz

HuggingFace Ecosystem

Key Libraries

Quick Start

Interactive Notebook

Quiz

Kaggle with LLMs

Kaggle LLM Competition Workflow

Interactive Notebook

Quiz