Once your fine-tuned model is deployed, you’re ready to start generating chat completions.

First, make sure you’ve set up the SDK properly. See the OpenPipe SDK section for more details. Once the SDK is installed and you’ve added the right OPENPIPE_API_KEY to your environment variables, you’re almost done.

The last step is to update the model that you’re querying to match the ID of your new fine-tuned model.

from openpipe import OpenAI

# Find the config values in "Installing the SDK"
client = OpenAI()

completion = client.chat.completions.create(
    # model="gpt-4o", - original model
    model="openpipe:your-fine-tuned-model-id",
    messages=[{"role": "system", "content": "count to 10"}],
    metadata={"prompt_id": "counting", "any_key": "any_value"},
)

Queries to your fine-tuned models will now be shown in the Request Logs panel.

Feel free to run some sample inference on the PII Redaction model in our public project.