AI PPT Generator

Recreate NotebookLM's AI PPT feature and extend it into a controllable, editable, model-configurable PPT workbench that converts papers, documents, and other materials into beautiful PPT images.

Watch the HD demo video

The demo covers uploading doc/L9.md, entering custom requirements, generating and editing the design outline, confirming page designs, generating a 6-slide deck, editing one slide, confirming the replacement, and exporting PDF/PPTX. Model waiting time is fast-forwarded.

More Than A Clone

NotebookLM's PPT feature is closer to a one-click result generator, with limited visibility into the design process and limited per-slide control. This project turns the workflow into an understandable, editable workbench:

Visible process: Review the deck outline and page-by-page design notes before image generation
Per-slide control: Edit any slide independently, generate new versions, revert history, and confirm replacements
Model control: Configure separate OpenAI-compatible models for text planning, image generation, and image editing
Local-first config: Manage model connections through local config.yaml or WebUI local API configuration; saved projects and exported files do not include API keys
Export-ready output: Export generated decks to PDF/PPTX for presentation or further editing

✨ Features

🎨 Per-slide image generation: Create an editable outline and page designs before converting them into PPT page images
🌐 PPT Workbench: Upload sources, configure model roles, preview slides, edit pages, track history, and export
📝 Multi-format parsing: Supports .md/.txt/.pdf/.docx/.pptx input and converts content to Markdown
✏️ Full-page image editing: Edit each generated slide independently, revert history, and confirm replacements
🔀 Three model roles: Configure prompt_model, image_model, and edit_model separately
🖼️ Image result compatibility: Accepts URLs, Markdown image links, data URLs, b64_json, and raw base64
💾 Local multi-project persistence: Save multiple PPT projects in the browser, including source content, outline, page designs, generated images, and per-slide edit history

🚀 Quick Start

1. Installation & Configuration

# Clone the project
git clone <repository-url>
cd OpenNotebookLM-AIPPT

# Configure API keys
cp config.example.yaml config.yaml
# Edit config.yaml and fill in your API keys

2. Start Services

Option 1: WebUI Interface (Recommended)

# One-click start for both frontend and backend
./start.sh

After startup, visit:

🎨 Frontend: http://localhost:5173
📚 API Docs: http://localhost:8000/docs

Option 2: Start Frontend and Backend Separately

# Terminal 1: Start backend
./start-api.sh

# Terminal 2: Start frontend
cd web && npm install && npm run dev

Option 3: Command Line Usage

# Install dependencies
pip install -r requirements.txt

# Basic usage
python main.py -i doc/L9.md -n 5

# Generate prompts only
python main.py -i doc/L9.md -n 5 --prompt-only -o prompts.json

# Generate from prompt file
python main.py --from-prompt prompts.json

Local Project Persistence

AIPPT stores project content and image assets in the current browser profile's IndexedDB, and uses localStorage for the active project id, UI preferences, and local API configuration. Saved project data includes uploaded sources, content settings, design outlines, page designs, generated images, edited versions, and image data needed for export.

Notes:

Clearing browser site data removes local projects.
Projects do not automatically sync across browsers or devices.
API keys belong to local API configuration; they are not written into saved project records and are not included in exported PDF/PPTX files.

3. WebUI Usage Flow

Upload Document: Drag and drop or click to upload a source file in the left panel
Configure Models: Configure text, image generation, and image editing model roles
Set Parameters & Requirements: Choose page count, resolution, aspect ratio, language, style, audience, and custom requirements
Confirm Design: Generate an editable outline, confirm it, then review the generated page designs
Generate PPT: Generate slide images after page-design confirmation and watch real-time progress
Preview & Edit: Preview generated slides in the right panel and edit a single page when needed
Export: Export to PDF or PPTX

The built-in demo source is doc/L9.md. This is a repository-relative path, so a fresh clone can use it directly in the WebUI or CLI examples.

📁 Project Structure

OpenNotebookLM-AIPPT/
├── src/                    # Core logic
├── api/                    # FastAPI backend
├── web/                    # React frontend
├── tests/                  # Tests
├── doc/                    # Input documents directory
│   └── L9.md               # Default demo source
├── config.yaml             # Configuration file
├── start.sh                # One-click startup script
└── main.py                 # CLI entry point

⚙️ Configuration

All configurations are managed in config.yaml, including:

API configuration (prompt_model, image_model, edit_model)
PPT default settings (language, style, page count)
Timeout and retry settings

See config.example.yaml for detailed configuration examples.

Using OpenAI Compatible API

api:
  models:
    prompt_model:
      adapter: "openai_chat"
      model: "gpt-4o"
      base_url: "https://api.openai.com/v1"
      api_key: "sk-xxx"
    image_model:
      adapter: "raw_chat_multimodal"
      model: "gpt-image-2"
      base_url: "https://api.example.com/v1"
      api_key: "sk-xxx"
    edit_model:
      adapter: "raw_chat_multimodal"
      model: "gpt-image-2"
      base_url: "https://api.example.com/v1"
      api_key: "sk-xxx"

📤 Output Structure

output/ppt_20241201_123456/
├── source_material.txt      # Original input material
├── prompts.json             # Generated prompts
├── result.json              # Generation result
├── presentation.pdf         # Exported PDF
└── images/                  # Slide images

📋 TODO

Upgrade generated PPT images into structured, editable PPT content
Support region selection for partial slide editing
Add more provider profile templates

📄 License

Apache License 2.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI PPT Generator

More Than A Clone

✨ Features

🚀 Quick Start

1. Installation & Configuration

2. Start Services

Local Project Persistence

3. WebUI Usage Flow

📁 Project Structure

⚙️ Configuration

Using OpenAI Compatible API

📤 Output Structure

📋 TODO

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 208 Commits
api		api
doc		doc
docs/assets		docs/assets
src		src
tests		tests
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
config.example.yaml		config.example.yaml
main.py		main.py
requirements.txt		requirements.txt
start-api.sh		start-api.sh
start.sh		start.sh

Folders and files

Latest commit

History

Repository files navigation

AI PPT Generator

More Than A Clone

✨ Features

🚀 Quick Start

1. Installation & Configuration

2. Start Services

Local Project Persistence

3. WebUI Usage Flow

📁 Project Structure

⚙️ Configuration

Using OpenAI Compatible API

📤 Output Structure

📋 TODO

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages