Aider
AI pair programming in your terminal—free, open-source, any LLM
Exa is an embeddings-first web search API designed for AI agents — used by Cursor and AWS to surface results that keyword search misses. We rate it 78/100.
Exa is an embeddings-first web search API engineered specifically for LLMs and AI agents — instead of matching keywords, it retrieves the web by semantic similarity. We rate it 78/100 — an excellent neural retrieval engine for AI builders, but consumption pricing and a smaller index than Google mean it's not the right default for every team.
Exa is a web search API designed from the ground up for AI consumers, not human readers. Founded in 2021 by Will Bryk and Jeff Wang under the name Metaphor Systems, the company went through Y Combinator's Winter 2022 batch and rebranded to Exa in . Headquartered in San Francisco, it has raised over $111M to date at a reported $700M valuation.
The core insight: pretraining LLMs and indexing search engines are remarkably similar processes. So Exa built a search engine using its own embeddings model and Neural PageRank — letting agents query the web with natural-language descriptions like "a small open-source library that does X" and get back genuinely relevant URLs, even single-star GitHub repos that classic keyword search would never surface.
Developer sentiment is largely positive on the technical side. On Hacker News and the Latent Space podcast, builders consistently praise Exa for surfacing semantically relevant results that Google's API misses — one frequently cited example is finding obscure single-star GitHub repositories from a vague natural-language description. Cursor and AWS publicly use Exa as their default web-search backend, which is a strong signal of production-readiness.
The criticism is just as consistent. Reddit and Product Hunt commenters flag that consumption pricing scales steeply — agents running thousands of queries a day rack up costs that are hard to forecast. Reviewers also note Exa's index is meaningfully smaller than Google's, so for breadth-heavy use cases (news monitoring, brand listening) you can hit coverage gaps. Several developers said they pair Exa with a separate scraping tool because content extraction, while improved, still misses some pages cleanly.
Exa runs on usage-based pricing with a generous free tier and a single Pro subscription. Effective , contents for the first 10 results are bundled into search calls — making most workloads roughly 30% cheaper than the prior model.
| Plan | Price | Key Limits |
|---|---|---|
| Free | $0/month | $10 in free credits, ~1,000 searches/month |
| Pro (API) | From $40/month | $7/1K searches, $12/1K Deep, $15/1K Deep-Reasoning, $1/1K extra page contents |
| Websets | $49/month | 8,000 credits, no-code list-building UI |
| Enterprise | Contact | Custom volume, SLAs, dedicated support |
Best for: AI engineers building research agents, RAG pipelines, lead-generation crawlers, and "find me X on the web" features where semantic relevance beats raw keyword recall. Solid fit for startups and teams already using LangChain, LlamaIndex, or MCP-based agents.
Not ideal for: High-volume consumer search products where forecasting cost is critical, or workloads that need Google-scale index breadth (real-time news monitoring across long-tail publishers, e-commerce SKU search across the entire web).
Pros:
Cons:
The closest alternatives are Tavily (cheaper, optimized for agent search but with weaker semantic ranking), Linkup (newer, French startup focused on LLM-friendly search — see our Linkup review), and Perplexity's Sonar API (good for question-answering but more opinionated and less granular). For pure breadth, Google or Bing search APIs still win on coverage but lose on agent-friendliness.
Yes — if your use case is an AI agent or RAG system that needs semantically relevant results, Exa is the most technically differentiated search API on the market today. The free tier is enough to prove value before committing, and the $7/1K pricing is fair for the quality you get. We'd skip it only if your workload is dominated by simple lookups where a cheaper Tavily-style API will do, or if you absolutely need Google-scale index breadth. Final rating: 78/100.
ServiceNow and Accenture Launch Forward Deployed Engineering Program to Scale Agentic AI in the Enterprise (May 6, 2026)
At Knowledge 2026, ServiceNow and Accenture announced a joint forward deployed engineering program that drops co-located engineer pods into customer environments to ship agentic AI workflows natively on the ServiceNow AI Platform — with access to 300+ pre-built agent skills and the AI Control Tower as the governance backbone.
May 7, 2026
ReFiBuy Raises $13.6M Seed to Help Brands Get Recommended by AI Shopping Agents (May 5, 2026)
ReFiBuy, the Raleigh-based agentic commerce platform from ChannelAdvisor founder Scot Wingo, closed an oversubscribed $13.6M seed led by NewRoad Capital Partners on May 5, 2026 — betting that the next billion-dollar e-commerce moat is being chosen by ChatGPT, Claude and Perplexity.
May 7, 2026
OpenAI Replaces ChatGPT's Default Model With GPT-5.5 Instant — 52.5% Fewer Hallucinations, 30% Shorter Answers (May 5, 2026)
OpenAI on May 5 swapped GPT-5.3 Instant for the new GPT-5.5 Instant as ChatGPT's default model, claiming 52.5% fewer hallucinated claims on high-stakes prompts and 30% more concise answers. The model also rolls into the API as chat-latest and adds personalization from Gmail and past chats for Plus and Pro web users.
May 7, 2026
Is this product worth it?
Built With
Compare with other tools
Open Comparison Tool →