RudraTech API
Powerful, flexible, and easy-to-use API for deploying AI models at scale. RESTful endpoints with SDKs in Python, Node.js, and Go.
API Features
Fast & Scalable
Sub-100ms latency with auto-scaling to handle millions of requests
Enterprise Security
End-to-end encryption, SOC2 compliance, and granular API access controls
Multiple SDKs
Official SDKs for Python, Node.js, Go, and TypeScript with full type support
Global Regions
Deploy in US, EU, APAC with automatic load balancing and failover
Streaming Support
Real-time streaming responses for low-latency applications
Rate Limiting
Flexible rate limiting and quota management for your use case
Quick Start Example
import { Rudra } from '@rudratech/sdk';
const rudra = new Rudra({
apiKey: process.env.RUDRA_API_KEY,
});
// Generate predictions with LLM Platform
const response = await rudra.llm.generate({
model: 'gpt-4-turbo',
prompt: 'Explain quantum computing',
temperature: 0.7,
});
console.log(response.text);API Endpoints
/v1/llm/generate
Generate text using language models
/v1/embeddings/create
Create embeddings for semantic search
/v1/vector/search
Search vectors in our database
/v1/models
List available models
/v1/inference/batch
Run batch inference jobs
/v1/jobs/{id}
Get job status and results
Simple, Transparent Pricing
Pay only for what you use. First 1 million tokens are free every month.