The Gateway for all your AI &3rd-party APIs.
Get deep visibility and granular controls across your LLMs and third-party API usage — so you can scale smarter, faster, and more reliably.
Plug-and-Play Infrastructure for Production Workloads
Visibilty
Get full visibility and enhance your metrics on all third-party API calls and endpoints.
Control
An orchestration layer to optimize rate limits, quotas, routing, and caching with pre-made flows.
Scale
Made for high volumes to minimize production errors and developer maintenance time.
Reduce Dependencies, Reduce Latency
Unlike traditional general-purpose gateways, Lunar.dev is tailored for managing
egress traffic, reducing dependancies on any LLM or 3rd-party API.
Unified API Catalog
Full Observability
Centralize visibility into every API your org consumes—apps, services, and AI agents included.
Live Performance Insights
Track latency, errors, and provider health in real time—no matter who’s calling the API.
Deep Consumption Metrics
Get granular with headers and payloads - filter and export to your observability stack. Optimize every API call whether it's made by code or by AI.

Manage Provider Quotas with Precision
Quota Tracking
Track usage in real time to prevent overages or prioritize API calls based on remaining quota.
Quota Allocation
Allocate specific quota portions across teams or applications for more granular consumption control.
Budget Controls
Set budget thresholds to manage costs and optimize API usage within financial limits.

Consumption Control
Client Side Rate Limiting
Implement precise limits per environment, tenant, or API key to effectively manage multiple provider rate limits, ensuring smooth operation of both standard applications and AI-driven processes.
Priority Queue
Prioritize API calls to ensure that critical requests, such as those from premium customers or essential AI operations, are processed first, maintaining optimal performance and user satisfaction.

Measurable impact
Measure the improvement of your API and AI consumption, costs, and errors.
Improvement in rate limit utilization by proactively managing quotas and enforcing client-side rate limiting.
Case studyOf companies spend more time fixing APIs rather than building new features.
Case studyReduction in API dependency downtime by implementing fallback mechanisms
Case studyOf companies report that third-party API related issues require weekly attention
Report
Fault-tolerant and built
for demanding workloads
Focus on scaling your product, not fixing third-party integrations.
Lunar.dev provides stability and flexibility during traffic peaks and outages,
reducing errors, flaky endpoints and surprises in production.
Deploy across apps
Lunar.dev is a dedicated service to manage distributed traffic and quotas that works with Redis to provide shared state.
Fail-safe by design
Built with a gateway
pass-through or API re-routing on timeouts, ensuring no data loss or service disruption.Scalable Gateway clusters
Balance traffic with our Gateway clusters for seamless performance and scalability during spikes.
Infrastructure agnostic
Compatible with any cloud provider, production architecture, and all your APIs, from legacy to AI APIs.
Minimal Latency
Unnoticeable latency, adding only up to 4ms to the overall API call. Experience seamless performance without sacrificing speed.
Up to 90K calls per second
A single gateway can endure enormous volumes of traffic, managing the most demanding workloads.
Lunar.dev Flows
This core feature enables organizations to optimize API usage, control traffic flow, and enforce quotas with unparalleled flexibility.
The set of pre-made Flows are modular and scalable, designed to support both legacy and cloud-native environments, making them adaptable to evolving API ecosystems.
Rate Limiting
Set custom limits to your consumption based on quotas, smooth traffic peaks and avoid 429s.
AI Observability
Measure and track all your LLMs and AI consumption, costs, and errors.
Fallback
Secondary AI model for failover, timeouts, or low confidence, guaranteeing response quality.
Batching
Group multiple API requests into a single call to reduce overhead, optimize efficiency, and improve rate limit management.
Priority Queue
Ensure high-priority requests get through first, keeping lower-priority traffic running smoothly.
API integrations is not your core business, but it is ours.
Reduce ongoing patching and level-up with infrastructural solutions.
Reduce dependancies and costs by tracking and troubleshooting faster.
Get proactive and take control over your API consumption.
Use case examples
Frequently asked questions
What is self-managed and what is SaaS?
Lunar.dev's infrastructure, is installed and managed directly within your own cloud infrastructure. This approach ensures that all data related to API calls and their payloads are exclusively managed and stored locally, providing full control over your data with no external exposure.
Lunar.dev's UI Control Plane is a SaaS component (Software as a Service) offering centralized control and visibility. The data shown in the UI includes system telemetry, system usage, and aggregated metrics on API volumes. Importantly, this data sharing aspect of the SaaS offering can be disabled by the customer, offering flexibility in how much information is transmitted and reflected in the UI.
What’s the difference between the free version and the paid versions?
The free version of Lunar.dev provides essential capabilities, including control over API traffic and a single instance of the Lunar Consumption Gateway, along with access to standard Lunar Flows (policies).
In contrast, the paid versions unlock features designed for complex, high-volume environments, with the key differentiator being scalability:
- Scalable Gateway Cluster: The free version is limited to a single instance of the Lunar Consumption Gateway, whereas the paid versions offer a scalable cluster of Lunar gateways. This cluster architecture shares a Redis-backed state across multiple instances, making it ideal for production environments and growing businesses.
- Failover and Load Balancing: The cluster setup ensures robust failover mechanisms and load balancing, enabling traffic distribution across multiple gateways. This guarantees uptime and reliability, even if one gateway instance goes down.
- Premium Flows and Custom Capabilities: Paid tiers include access to advanced, custom-made Lunar Flows tailored for intricate use cases, along with increased volume handling for API calls managed by the system.
These enhancements make the paid versions well-suited for businesses seeking comprehensive API consumption management, with robust scalability and high availability for their production needs.
Will my API calls experience increased latency?
API calls routed through the Lunar.dev's Consumption Gateway experience minimal additional latency. The maximum latency impact observed is only 4ms at the P95 percentile. By leveraging HAProxy and continuous optimizations, it keeps latency overhead to a bare minimum.
For more details, you can review our comprehensive latency and benchmarking analysis and report of how we benchmark.
Is sensitive data stored by lunar.dev?
Lunar is deployed entirely within your cloud environment, ensuring that no API calls or sensitive Personally Identifiable Information (PII) and customer data are routed to our SaaS infrastructure. The data transmitted to our infrastructure solely consists of system heartbeat, configurations, and telemetry information. Get in touch to get more details about our security measures at info@lunar.dev.
Is lunar.dev another middleware in my environment?
We know the build-vs.buy dilemma. We believe leaving API integrations unmanaged equals more technical debt, and similarly, that building your API middleware will cause you exponential maintenance and issues as you scale. In the Beta Version, implementing Lunar requires a simple import of our supported interceptors via SDK and a one-line installation of the Lunar forward proxy. Removing Lunar is also as simple as that. it is just as easy as if it never existed. Read more about it in this post.
Is lunar.dev a SaaS solution?
In addition to our OSS and self-hosted version, you can request access to our beta SaaS version, and inquire more information at info@lunar.dev.
Is Lunar.dev Free?
Yes, lunar.dev released a free open-source version. Lunar.dev will continue to be free for Open Source or educational projects in it's self-hosted option. To inquire more about SaaS and pricing reach out to info@lunar.dev.
What the community thinks about Lunar.dev
Get the most out of
your API integrations
Gain full visibilty over you 3rd party API consumption, minimize cost, improve security. Built on top of your existing architecture.