Portkey Docs
HomeAPIIntegrationsChangelog
  • Introduction
    • What is Portkey?
    • Make Your First Request
    • Feature Overview
  • Integrations
    • LLMs
      • OpenAI
        • Structured Outputs
        • Prompt Caching
      • Anthropic
        • Prompt Caching
      • Google Gemini
      • Groq
      • Azure OpenAI
      • AWS Bedrock
      • Google Vertex AI
      • Bring Your Own LLM
      • AI21
      • Anyscale
      • Cerebras
      • Cohere
      • Fireworks
      • Deepbricks
      • Deepgram
      • Deepinfra
      • Deepseek
      • Google Palm
      • Huggingface
      • Inference.net
      • Jina AI
      • Lingyi (01.ai)
      • LocalAI
      • Mistral AI
      • Monster API
      • Moonshot
      • Nomic
      • Novita AI
      • Ollama
      • OpenRouter
      • Perplexity AI
      • Predibase
      • Reka AI
      • SambaNova
      • Segmind
      • SiliconFlow
      • Stability AI
      • Together AI
      • Voyage AI
      • Workers AI
      • ZhipuAI / ChatGLM / BigModel
      • Suggest a new integration!
    • Agents
      • Autogen
      • Control Flow
      • CrewAI
      • Langchain Agents
      • LlamaIndex
      • Phidata
      • Bring Your own Agents
    • Libraries
      • Autogen
      • DSPy
      • Instructor
      • Langchain (Python)
      • Langchain (JS/TS)
      • LlamaIndex (Python)
      • LibreChat
      • Promptfoo
      • Vercel
        • Vercel [Depricated]
  • Product
    • Observability (OpenTelemetry)
      • Logs
      • Tracing
      • Analytics
      • Feedback
      • Metadata
      • Filters
      • Logs Export
      • Budget Limits
    • AI Gateway
      • Universal API
      • Configs
      • Multimodal Capabilities
        • Image Generation
        • Function Calling
        • Vision
        • Speech-to-Text
        • Text-to-Speech
      • Cache (Simple & Semantic)
      • Fallbacks
      • Automatic Retries
      • Load Balancing
      • Conditional Routing
      • Request Timeouts
      • Canary Testing
      • Virtual Keys
        • Budget Limits
    • Prompt Library
      • Prompt Templates
      • Prompt Partials
      • Retrieve Prompts
      • Advanced Prompting with JSON Mode
    • Guardrails
      • List of Guardrail Checks
        • Patronus AI
        • Aporia
        • Pillar
        • Bring Your Own Guardrails
      • Creating Raw Guardrails (in JSON)
    • Autonomous Fine-tuning
    • Enterprise Offering
      • Org Management
        • Organizations
        • Workspaces
        • User Roles & Permissions
        • API Keys (AuthN and AuthZ)
      • Access Control Management
      • Budget Limits
      • Security @ Portkey
      • Logs Export
      • Private Cloud Deployments
        • Architecture
        • AWS
        • GCP
        • Azure
        • Cloudflare Workers
        • F5 App Stack
      • Components
        • Log Store
          • MongoDB
    • Open Source
    • Portkey Pro & Enterprise Plans
  • API Reference
    • Introduction
    • Authentication
    • OpenAPI Specification
    • Headers
    • Response Schema
    • Gateway Config Object
    • SDK
  • Provider Endpoints
    • Supported Providers
    • Chat
    • Embeddings
    • Images
      • Create Image
      • Create Image Edit
      • Create Image Variation
    • Audio
      • Create Speech
      • Create Transcription
      • Create Translation
    • Fine-tuning
      • Create Fine-tuning Job
      • List Fine-tuning Jobs
      • Retrieve Fine-tuning Job
      • List Fine-tuning Events
      • List Fine-tuning Checkpoints
      • Cancel Fine-tuning
    • Batch
      • Create Batch
      • List Batch
      • Retrieve Batch
      • Cancel Batch
    • Files
      • Upload File
      • List Files
      • Retrieve File
      • Retrieve File Content
      • Delete File
    • Moderations
    • Assistants API
      • Assistants
        • Create Assistant
        • List Assistants
        • Retrieve Assistant
        • Modify Assistant
        • Delete Assistant
      • Threads
        • Create Thread
        • Retrieve Thread
        • Modify Thread
        • Delete Thread
      • Messages
        • Create Message
        • List Messages
        • Retrieve Message
        • Modify Message
        • Delete Message
      • Runs
        • Create Run
        • Create Thread and Run
        • List Runs
        • Retrieve Run
        • Modify Run
        • Submit Tool Outputs to Run
        • Cancel Run
      • Run Steps
        • List Run Steps
        • Retrieve Run Steps
    • Completions
    • Gateway for Other API Endpoints
  • Portkey Endpoints
    • Configs
      • Create Config
      • List Configs
      • Retrieve Config
      • Update Config
    • Feedback
      • Create Feedback
      • Update Feedback
    • Guardrails
    • Logs
      • Insert a Log
      • Log Exports [BETA]
        • Retrieve a Log Export
        • Update a Log Export
        • List Log Exports
        • Create a Log Export
        • Start a Log Export
        • Cancel a Log Export
        • Download a Log Export
    • Prompts
      • Prompt Completion
      • Render
    • Virtual Keys
      • Create Virtual Key
      • List Virtual Keys
      • Retrieve Virtual Key
      • Update Virtual Key
      • Delete Virtual Key
    • Analytics
      • Graphs - Time Series Data
        • Get Requests Data
        • Get Cost Data
        • Get Latency Data
        • Get Tokens Data
        • Get Users Data
        • Get Requests per User
        • Get Errors Data
        • Get Error Rate Data
        • Get Status Code Data
        • Get Unique Status Code Data
        • Get Rescued Requests Data
        • Get Cache Hit Rate Data
        • Get Cache Hit Latency Data
        • Get Feedback Data
        • Get Feedback Score Distribution Data
        • Get Weighted Feeback Data
        • Get Feedback Per AI Models
      • Summary
        • Get All Cache Data
      • Groups - Paginated Data
        • Get User Grouped Data
        • Get Model Grouped Data
        • Get Metadata Grouped Data
    • API Keys [BETA]
      • Update API Key
      • Create API Key
      • Delete an API Key
      • Retrieve an API Key
      • List API Keys
    • Admin
      • Users
        • Retrieve a User
        • Retrieve All Users
        • Update a User
        • Remove a User
      • User Invites
        • Invite a User
        • Retrieve an Invite
        • Retrieve All User Invites
        • Delete a User Invite
      • Workspaces
        • Create Workspace
        • Retrieve All Workspaces
        • Retrieve a Workspace
        • Update Workspace
        • Delete a Workspace
      • Workspace Members
        • Add a Workspace Member
        • Retrieve All Workspace Members
        • Retrieve a Workspace Member
        • Update Workspace Member
        • Remove Workspace Member
  • Guides
    • Getting Started
      • A/B Test Prompts and Models
      • Tackling Rate Limiting
      • Function Calling
      • Image Generation
      • Getting started with AI Gateway
      • Llama 3 on Groq
      • Return Repeat Requests from Cache
      • Trigger Automatic Retries on LLM Failures
      • 101 on Portkey's Gateway Configs
    • Integrations
      • Llama 3 on Portkey + Together AI
      • Introduction to GPT-4o
      • Anyscale
      • Mistral
      • Vercel AI
      • Deepinfra
      • Groq
      • Langchain
      • Mixtral 8x22b
      • Segmind
    • Use Cases
      • Few-Shot Prompting
      • Enforcing JSON Schema with Anyscale & Together
      • Detecting Emotions with GPT-4o
      • Build an article suggestion app with Supabase pgvector, and Portkey
      • Setting up resilient Load balancers with failure-mitigating Fallbacks
      • Run Portkey on Prompts from Langchain Hub
      • Smart Fallback with Model-Optimized Prompts
      • How to use OpenAI SDK with Portkey Prompt Templates
      • Setup OpenAI -> Azure OpenAI Fallback
      • Fallback from SDXL to Dall-e-3
      • Comparing Top10 LMSYS Models with Portkey
      • Build a chatbot using Portkey's Prompt Templates
  • Support
    • Contact Us
    • Developer Forum
    • Common Errors & Resolutions
    • December '23 Migration
    • Changelog
Powered by GitBook
On this page
  • What is Autonomous LLM Fine-tuning?
  • Benefits
  • Data Preparation
  • Fine-tuning Process
  • How It Works: Step-by-Step
  • Partnerships
  • Getting Started
  • Best Practices
  • API Support

Was this helpful?

Edit on GitHub
  1. Product

Autonomous Fine-tuning

PreviousCreating Raw Guardrails (in JSON)NextEnterprise Offering

Last updated 9 months ago

Was this helpful?

This feature is in private beta. Please drop us a message on support@portkey.ai or on our if you're interested.

What is Autonomous LLM Fine-tuning?

Autonomous Fine-tuning is a powerful feature offered by Portkey AI that enables organizations to automatically create, manage, and execute fine-tuning jobs for Large Language Models (LLMs) across multiple providers.

This feature leverages your existing API usage data to continuously improve and customize LLM performance for your specific use cases.

Benefits

  • Automated Workflow: Streamline the entire fine-tuning process from data preparation to model deployment.

  • Multi-Provider Support: Fine-tune models across 10+ providers, including OpenAI, Azure, AWS Bedrock, and Anyscale.

  • Data-Driven Improvements: Utilize your actual API usage data to create relevant and effective fine-tuning datasets.

  • Continuous Learning: Set up periodic fine-tuning jobs to keep your models up-to-date with the latest data.

  • Enhanced Performance: Improve model accuracy and relevance for your specific use cases.

  • Cost-Effective: Optimize your LLM usage by fine-tuning models to better suit your needs, potentially reducing the number of API calls required.

  • Centralized Management: Manage all your fine-tuning jobs across different providers from a single interface.

Data Preparation

  1. Log Collection: Portkey's AI gateway automatically collects and stores logs from your LLM API requests.

  2. Data Enrichment:

    • Filter logs based on various criteria.

    • Annotate logs with additional information.

    • Use Portkey's Guardrails feature for automatic log annotation.

  3. Dataset Creation: Utilize filters to select the most relevant logs for your fine-tuning dataset.

  4. Data Export: Export the enriched logs as a dataset suitable for fine-tuning.

Fine-tuning Process

  1. Model Selection: Choose from a wide range of supported LLM providers and models.

  2. Job Configuration: Set up fine-tuning parameters through an intuitive UI.

  3. Execution: Portkey triggers the fine-tuning job on the selected provider's platform.

  4. Monitoring: Track the progress of your fine-tuning jobs through Portkey's dashboard.

  5. Deployment: Once complete, the fine-tuned model becomes available for use through Portkey's API gateway.

How It Works: Step-by-Step

  1. Data Collection: As you use Portkey's AI gateway for LLM requests, logs are automatically collected and stored in your Portkey account.

  2. Data Enrichment:

    • Apply filters to your log data.

    • Add annotations and additional context to logs.

    • Utilize Portkey's Guardrails feature for automatic input/output annotations.

  3. Dataset Creation:

    • Use the enriched log data to create a curated dataset for fine-tuning.

    • Apply additional filters to select the most relevant data points.

  4. Fine-tuning Job Setup:

    • Access the Fine-tuning feature in Portkey's UI.

    • Select your desired LLM provider and model.

    • Choose your prepared dataset.

    • Configure fine-tuning parameters.

  5. Job Execution:

    • Portkey initiates the fine-tuning job on the chosen provider's platform.

    • Monitor the progress through Portkey's dashboard.

  6. Model Deployment:

    • Once fine-tuning is complete, the new model becomes available through Portkey's API gateway.

  7. Continuous Improvement (Optional):

    • Set up periodic fine-tuning jobs (daily, weekly, or monthly).

    • Portkey automatically creates and executes these jobs using the latest data.

Partnerships

Portkey AI has established partnerships to extend the capabilities of its Autonomous Fine-tuning feature:

  • OpenPipe: Integration allows Portkey's enriched data to be used on OpenPipe's fine-tuning platform.

  • Pipeshift: Portkey's datasets can be seamlessly utilized in Pipeshift's inference platform.

These partnerships enable users to leverage Portkey's data preparation capabilities with specialized fine-tuning and inference services.

Getting Started

To begin using Autonomous Fine-tuning:

  1. Ensure you have an active Portkey AI account with the AI gateway set up.

  2. Navigate to the Fine-tuning section in your Portkey dashboard.

  3. Follow the step-by-step wizard to create your first fine-tuning job.

  4. For assistance, consult our detailed documentation or contact Portkey support.

Best Practices

  • Regularly review and update your data filtering criteria to ensure the quality of your fine-tuning datasets.

  • Start with smaller, focused datasets before scaling up to larger fine-tuning jobs.

  • Monitor the performance of your fine-tuned models and iterate as needed.

  • Leverage Portkey's analytics to gain insights into your model's performance improvements.

By utilizing Portkey AI's Autonomous Fine-tuning feature, you can harness the power of your own data to create customized, high-performing LLMs tailored to your specific needs, all while streamlining the management of multiple AI providers.

API Support

Discord
Fine-tuning