Jan Server API Reference
Self-hostable Jan Server powered by vLLM for high-throughput serving
Base URL:
http://your-server:8000/v1 Engine: vLLM
Format: OpenAI Compatible
Authentication Required: All requests to Jan Server require authentication.
Include your API key in the
Authorization header as Bearer YOUR_API_KEY.
Configure authentication in your server settings.
High Performance
Powered by vLLM's PagedAttention for efficient memory usage and high throughput
Auto-Scaling
Automatically scales to handle your workload with intelligent load balancing
Multi-Model Support
Support for various model formats and sizes with optimized serving configurations