Granular controls for
LLMs and 3rd-party APIs.
Gain unmatched visibility, control, and performance over your 3rd-party API consumption. Built for production resilience, Lunar.dev empowers your architecture to thrive in the AI and API-driven era.
Production-grade management layer for LLMs and 3rd-party APIs
Visibilty
Get full visibility and enhance your metrics on all third-party API calls across environments and departments. Fits with your monitoring stack like Data Dog and Prometheus. Use the Lunar.dev dashboard or your preferred tool.
Control
Orchestration layer to optimize any and all API calls across distributed environments and services, and enforce rate limiting, quota management, concurrency control, and caching with pre-made flows.
Scale
Works atop your infrastructure. Ready for high volumes, reduce in-production errors and minimize maintenance time. Get more out of your current API integrations.
Reduce Dependencies, Reduce Latency
Unlike traditional general-purpose gateways, Lunar.dev is tailored for managing egress traffic to any 3rd-party API, making it ideal for businesses reliant on external integrations to reduce dependancies on 3rd parties.
3rd party API Catalog
L7 Observability
Discover, catalog, and manage all consumed APIs across your organization with a centralized view.
Monitoring API Performance
Real-time monitoring of provider performance metrics like latency and error rates to identify issues and measure impact.
Consumption Metrics Extraction
Extract granular metrics from headers and payloads with advanced filtering, easily exportable to your monitoring stack.
Manage Provider Quotas with Precision
Quota Tracking
Track usage in real time to prevent overages or prioritize API calls based on remaining quota.
Sub-Quota Allocation
Allocate specific quota portions across teams or applications for more granular consumption control.
Budget Controls
Set budget thresholds to manage costs and optimize API usage within financial limits.
Control and Optimize Outgoing API Traffic
Client Side Rate Limiting
Solve rate limit challenges by setting limits per environment, tenant, or API key, managing multiple provider rate limits effectively.
Priority Queue
Prioritize API calls to ensure critical requests, like those from premium customers, are processed first.
Caching
Reduce API costs and improve performance by decreasing latency through intelligent caching.
Stable & reliable. At any scale
Retries & Circuit breakers
Automate retries for failed requests and implement circuit breakers to prevent system overloads.
Set Actions on Defined Thresholds
Control traffic flow by setting actions based on soft limit thresholds, avoiding provider rate limits and maintaining smooth operations.
Smooth Traffic Peaks
Regulate unexpected traffic spikes by managing API calls to prevent overloads and ensure service continuity.
Fault-tolerant and built
for demanding workloads
Focus on scaling your product, not fixing third-party integrations.
Lunar.dev provides stability and flexibility during traffic peaks and outages,
reducing errors, flaky endpoints and surprises in production.
Deploy across apps
Multiple installations - Lunar.dev is a dedicated service to manage distributed traffic and quotas and works with Redis to provide shared state.
Fail-safe by design
Built with a gateway
pass-through or API re-routing on timeouts, ensuring no data loss or service disruption.Scalable Gateway clusters
Balance traffic with our Gateway clusters for seamless performance and scalability during spikes.
Infrastructure agnostic
Compatible with any cloud provider, production architecture, and all your APIs, from legacy to cutting-edge AI APIs.
Minimal Latency
Unnoticeable latency with our gateway, adding only up to 4ms to the overall API call. Experience seamless performance without sacrificing speed.
Up to 90K calls per second
A single Lunar gateway can endure enormous volumes of traffic, managing the most demanding workloads.
You are the architect
Instantly gain full control over your company’s API consumption - across department, stages and environments. Get proactive and make the most out of your API integrations.
Works with any stack
Seamlessly integrates with your existing tech stack— Compliments your own middleware or Ingress infrastructure. Deployed self-hosted and with any cloud provider (Azure or GCP). Works with Lambdas and fits with your monitoring stack like Data Dog and Prometheus.
From blind spots to unified control
Unify egress API controls across all teams and applications, providing engineering with the visibility and insights needed to manage and optimize outgoing API traffic effectively.
Measure and demonstrate impact
Visualize performance improvements and cost savings from your API policies, and provide stakeholders with clear insights into the value of controlled API usage. Test, refine, and demonstrate the impact in real time.
API maintenance is not your core business. But it is ours
Stop duct-taping your solutions that deserve infrastructural solutions.
Your API fixes let you down when you need them the most.
Get Proactive and take control over your API consumption.
36%
spend more time fixing APIs rather than building new features.
66%
of companies may be exposed to security risks by under-prioritizing API management
88%
of companies report that third-party API related issues require weekly attention
Frequently asked questions
What is self-managed and what is SaaS?
Lunar.dev's infrastructure, is installed and managed directly within your own cloud infrastructure. This approach ensures that all data related to API calls and their payloads are exclusively managed and stored locally, providing full control over your data with no external exposure.
Lunar.dev's UI Control Plane is a SaaS component (Software as a Service) offering centralized control and visibility. The data shown in the UI includes system telemetry, system usage, and aggregated metrics on API volumes. Importantly, this data sharing aspect of the SaaS offering can be disabled by the customer, offering flexibility in how much information is transmitted and reflected in the UI.
What’s the difference between the free version and the paid versions?
The free version of Lunar.dev provides essential capabilities, including control over API traffic and a single instance of the Lunar Consumption Gateway, along with access to standard Lunar Flows (policies).
In contrast, the paid versions unlock features designed for complex, high-volume environments, with the key differentiator being scalability:
- Scalable Gateway Cluster: The free version is limited to a single instance of the Lunar Consumption Gateway, whereas the paid versions offer a scalable cluster of Lunar gateways. This cluster architecture shares a Redis-backed state across multiple instances, making it ideal for production environments and growing businesses.
- Failover and Load Balancing: The cluster setup ensures robust failover mechanisms and load balancing, enabling traffic distribution across multiple gateways. This guarantees uptime and reliability, even if one gateway instance goes down.
- Premium Flows and Custom Capabilities: Paid tiers include access to advanced, custom-made Lunar Flows tailored for intricate use cases, along with increased volume handling for API calls managed by the system.
These enhancements make the paid versions well-suited for businesses seeking comprehensive API consumption management, with robust scalability and high availability for their production needs.
Will my API calls experience increased latency?
API calls routed through the Lunar.dev's Consumption Gateway experience minimal additional latency. The maximum latency impact observed is only 4ms at the P95 percentile. By leveraging HAProxy and continuous optimizations, it keeps latency overhead to a bare minimum.
For more details, you can review our comprehensive latency and benchmarking analysis and report of how we benchmark.
Is sensitive data stored by lunar.dev?
Lunar is deployed entirely within your cloud environment, ensuring that no API calls or sensitive Personally Identifiable Information (PII) and customer data are routed to our SaaS infrastructure. The data transmitted to our infrastructure solely consists of system heartbeat, configurations, and telemetry information. Get in touch to get more details about our security measures at info@lunar.dev.
Is lunar.dev another middleware in my environment?
We know the build-vs.buy dilemma. We believe leaving API integrations unmanaged equals more technical debt, and similarly, that building your API middleware will cause you exponential maintenance and issues as you scale. In the Beta Version, implementing Lunar requires a simple import of our supported interceptors via SDK and a one-line installation of the Lunar forward proxy. Removing Lunar is also as simple as that. it is just as easy as if it never existed. Read more about it in this post.
Is lunar.dev a SaaS solution?
In addition to our OSS and self-hosted version, you can request access to our beta SaaS version, and inquire more information at info@lunar.dev.
Is Lunar.dev Free?
Yes, lunar.dev released a free open-source version. Lunar.dev will continue to be free for Open Source or educational projects in it's self-hosted option. To inquire more about SaaS and pricing reach out to info@lunar.dev.
Is Lunar.dev any good?
Don’t let your APIs Consume you.
Get the most out of your API integrations. Gain full visibilty over you 3rd party API consumption, minimize cost, improve security. Built on top of your existing architecture.