Skip to content


After installing Marvin, the fastest way to get started is by using one of Marvin's high-level AI components. These components are designed to integrate AI into abstractions you already know well, creating the best possible opt-in developer experience.

Configure LLM Provider

Marvin is a high-level interface for working with LLMs. In order to use it, you must configure an LLM provider. At this time, Marvin supports OpenAI's GPT-3.5 and GPT-4 models, Anthropic's Claude 1 and Claude 2 models, and the Azure OpenAI Service. The default model is OpenAI's gpt-4.

To use the default model, provide an API key:

import marvin

# to use an OpenAI model (if not specified, defaults to gpt-4)
marvin.settings.openai.api_key = YOUR_API_KEY

To use another provider or model, please see the configuration docs.

AI Models

Marvin's most basic component is the AI Model, a drop-in replacement for Pydantic's BaseModel. AI Models can be instantiated from any string, making them ideal for structuring data, entity extraction, and synthetic data generation:

from marvin import ai_model
from pydantic import BaseModel, Field

class Location(BaseModel):
    city: str
    state: str = Field(..., description="The two-letter state abbreviation")

Location("The Big Apple")
Location(city='New York', state='NY')

AI Classifiers

AI Classifiers let you build multi-label classifiers with no code and no training data. Given user input, each classifier uses a clever logit bias trick to force an LLM to deductively choose the best option. It's bulletproof, cost-effective, and lets you build classifiers as quickly as you can write your classes.

from marvin import ai_classifier
from enum import Enum

class AppRoute(Enum):
    """Represents distinct routes command bar for a different application"""

    USER_PROFILE = "/user-profile"
    SEARCH = "/search"
    NOTIFICATIONS = "/notifications"
    SETTINGS = "/settings"
    HELP = "/help"
    CHAT = "/chat"
    DOCS = "/docs"
    PROJECTS = "/projects"
    WORKSPACES = "/workspaces"

AppRoute("update my name")
<AppRoute.USER_PROFILE: '/user-profile'>

AI Functions

AI Functions look like regular functions, but have no source code. Instead, an AI uses their description and inputs to generate their outputs, making them ideal for NLP applications like sentiment analysis.

from marvin import ai_fn

def sentiment(text: str) -> float:
    Given `text`, returns a number between 1 (positive) and -1 (negative)
    indicating its sentiment score.

print("Text 1:", sentiment("I love working with Marvin!"))
print("Text 2:", sentiment("These examples could use some work..."))
Text 1: 0.8
Text 2: -0.2

Because AI functions are just like regular functions, you can quickly modify them for your needs. Here, we modify the above example to work with multiple strings at once:

from marvin import ai_fn

def sentiment_list(texts: list[str]) -> list[float]:
    Given a list of `texts`, returns a list of numbers between 1 (positive) and
    -1 (negative) indicating their respective sentiment scores.

        "That was surprisingly easy!",
        "Oh no, not again.",
[0.7, -0.5]

AI Applications

AI Applications are the base class for interactive use cases. They are designed to be invoked one or more times, and automatically maintain three forms of state:

  • state: a structured application state
  • plan: high-level planning for the AI assistant to keep the application "on-track" across multiple invocations
  • history: a history of all LLM interactions

AI Applications can be used to implement many "classic" LLM use cases, such as chatbots, tool-using agents, developer assistants, and more. In addition, thanks to their persistent state and planning, they can implement applications that don't have a traditional chat UX, such as a ToDo app. Here's an example:

from datetime import datetime
from pydantic import BaseModel, Field
from marvin import AIApplication

# create models to represent the state of our ToDo app
class ToDo(BaseModel):
    title: str
    description: str = None
    due_date: datetime = None
    done: bool = False

class ToDoState(BaseModel):
    todos: list[ToDo] = []

# create the app with an initial state and description
todo_app = AIApplication(
        "A simple todo app. Users will provide instructions for creating and updating"
        " their todo lists."

Now we can invoke the app directly to add a to-do item. Note that the app understands that it is supposed to manipulate state, not just respond conversationally.

# invoke the application by adding a todo
response = todo_app("I need to go to the store tomorrow at 5pm")

    f"Response: {response.content}\n",
print(f"App state: {todo_app.state.json(indent=2)}")
Response: Sure! I've added a new task to your to-do list. You need to go to the store tomorrow at 5pm.

App state: {
  "todos": [
      "title": "Go to the store",
      "description": null,
      "due_date": "2023-07-12T17:00:00",
      "done": false

We can inform the app that we already finished the task, and it updates state appropriately

# complete the task
response = todo_app("I already went")

print(f"Response: {response.content}\n")
print(f"App state: {todo_app.state.json(indent=2)}")
Response: Great! I've marked the task "Go to the store" as completed. Is there anything else you need help with?

App state: {
  "todos": [
      "title": "Go to the store",
      "description": null,
      "due_date": "2023-07-12T17:00:00",
      "done": true