Search & Data Extraction MCP Servers
157 Model Context Protocol servers in the Search & Data Extraction category.
48 of 157 shown
AIweather-Anurag/ottasia-mcp-server
github.comWhere to watch any movie or TV show across 30 Asian and Middle Eastern streaming markets (Netflix, Disney+ Hotstar, Wavve, Shahid, Hoichoi, ZEE5, JioCinema, and 17 more). npx -y @ottasia/mcp-server
mrslbt/rippr
github.comYouTube transcript extraction for AI agents. Clean text, timestamps, or structured JSON from any video. No API keys required. Install via npx rippr-mcp.
0xdaef0f/job-searchoor
github.comAn MCP server for searching job listings with filters for date, keywords, remote work options, and more.
hanselhansel/aeo-cli
github.comAudit URLs for AI crawler readiness — checks robots.txt, llms.txt, JSON-LD schema, and content density with 0-100 AEO scoring.
Aas-ee/open-webSearch
github.comWeb search using free multi-engine search (NO API KEYS REQUIRED) — Supports Bing, Baidu, DuckDuckGo, Brave, Exa, and CSDN.
AceDataCloud/MCPSerp
github.comGoogle SERP search including web, images, news, maps, places, videos, and knowledge graph results via Ace Data Cloud API.
AIMLPM/markcrawl
github.comCrawl websites into clean Markdown, search pages, and extract structured data with LLMs. Built-in MCP server for web research and RAG pipelines.
ac3xx/mcp-servers-kagi
github.comKagi search API integration
adawalli/nexus
github.comAI-powered web search server using Perplexity Sonar models with source citations. Zero-install setup via NPX.
ananddtyagi/webpage-screenshot-mcp
github.comA MCP server for taking screenshots of webpages to use as feedback during UI developement.
andybrandt/mcp-simple-arxiv
github.com🐍 ☁️ MCP for LLM to search and read papers from arXiv
andybrandt/mcp-simple-pubmed
github.com🐍 ☁️ MCP to search and read medical / life sciences papers from PubMed.
andyliszewski/webcrawl-mcp
github.comLocal-first web scraping, search, and crawling. Static pages extracted locally via trafilatura; optional Firecrawl fallback only when JS rendering is needed. Four tools: scrape, search (DuckDuckGo), map, crawl.
angheljf/nyt
github.comSearch articles using the NYTimes API
apify/mcp-server-rag-web-browser
github.comAn MCP server for Apify's open-source RAG Web Browser Actor to perform web searches, scrape URLs, and return content in Markdown.
atlasprzetargow/mcp-server
github.comSearch 800 000+ Polish public tenders (BZP + TED). Profiles of procuring entities and contractors by NIP, market statistics by CPV/province, 90+ term procurement glossary.
AutomateLab-tech/citation-intelligence
github.comWhat LLMs cite, for agents. Check which URLs Perplexity, Claude, ChatGPT, Gemini, Bing, and Google AI Overviews cite for any query. Self-hosted, BYO API key. Install via npx @automatelab/citation-intelligence.
Khamel83/argus
github.comMulti-provider search broker with automatic fallback, RRF ranking, content extraction, and budget enforcement.
idapixl/idapixl-web-research-mcp
github.comPay-per-use web research for AI agents on Apify. Search (Brave + DuckDuckGo), fetch pages to clean markdown, and multi-step research with relevance scoring and key fact extraction.
Bigsy/Clojars-MCP-Server
github.comClojars MCP Server for upto date dependency information of Clojure libraries
blazickjp/arxiv-mcp-server
github.comSearch ArXiv research papers
boikot-xyz/boikot
github.comModel Context Protocol Server for looking up company ethics information. Learn about the ethical and unethical actions of major companies.
brave/brave-search-mcp-server
github.comWeb search capabilities using Brave's Search API
cameronrye/activitypub-mcp
github.comA comprehensive MCP server that enables LLMs to explore and interact with the Fediverse through ActivityPub protocol. Features WebFinger discovery, timeline fetching, instance exploration, and cross-platform support for Mastodon, Pleroma, Misskey, and other ActivityPub servers.
cameronrye/gopher-mcp
github.comModern, cross-platform MCP server enabling AI assistants to browse and interact with both Gopher protocol and Gemini protocol resources safely and efficiently. Features dual protocol support, TLS security, and structured content extraction.
einiba/canyougrab-api
github.comConfidence-scored domain availability checking with real-time DNS + WHOIS lookups. Bulk check up to 100 domains per request. Each result includes availability, confidence level, data source, and registration details.
cevatkerim/unsplash-mcp
github.comUnsplash photo search with proper attribution. Returns ready-to-use attribution text and HTML for each photo, making it easy for LLMs to build content pages with properly credited images. Includes search, random photos, and download tracking.
chanmeng/google-news-mcp-server
github.comGoogle News integration with automatic topic categorization, multi-language support, and comprehensive search capabilities including headlines, stories, and related topics through SerpAPI.
chasesaurabh/mcp-page-capture
github.comMCP server that captures webpage screenshots, with viewport or full-page options and base64 PNG output.
CKBrennan/overtone-news-mcp
github.comReal-time news with tone analysis, brand safety, and narrative shift signals for AI agents.
ConechoAI/openai-websearch-mcp
github.comThis is a Python-based MCP server that provides OpenAI web_search built-in tool.
Crawleo/Crawleo-MCP
github.comCrawleo Search & Crawl API
czottmann/kagi-ken-mcp
github.comWork with Kagi without API access (you'll need to be a customer, tho). Searches and summarizes. Uses Kagi session token for easy authentication.
DappierAI/dappier-mcp
github.comEnable fast, free real-time web search and access premium data from trusted media brands—news, financial markets, sports, entertainment, weather, and more. Build powerful AI agents with Dappier.
deadletterq/mcp-opennutrition
github.comLocal MCP server for searching 300,000+ foods, nutrition facts, and barcodes from the OpenNutrition database.
dealx/mcp-server
github.comMCP Server for DealX platform
deficlow/HyperStore-MCP
github.comSearch 6,500+ curated AI applications from the HyperStore directory. 8 tools (keyword + semantic search, full details, browsing), 3 resources, 3 prompts. Install via uvx hyperstore-mcp or use the hosted endpoint at https://mcp.store.hypergpt.ai/mcp.
★devflowinc/trieve
github.comCrawl, embed, chunk, search, and retrieve information from datasets through Trieve
dorukardahan/domain-search-mcp
github.comFast domain availability aggregator with pricing. Checks Porkbun, Namecheap, GoDaddy, RDAP & WHOIS. Includes bulk search, registrar comparison, AI-powered suggestions, and social media handle checking.
mikusnuz/gsc-mcp
github.comMCP server for Google Search Console & Indexing API — 13 tools for search analytics, sitemaps, URL inspection, and batch indexing.
oso95/domain-suite-mcp
github.comFull domain lifecycle management: availability checking (zero config), registration, DNS, SSL, email auth (SPF/DKIM/DMARC), and WHOIS across Porkbun, Namecheap, GoDaddy, and Cloudflare. 21 tools.
★pepabo/muumuu-domain-mcp
github.comOfficial remote MCP server for Muumuu Domain (GMO Pepabo). Search and register domains, manage owned domains and contracts, and configure DNS records via natural language.
★Dumpling-AI/mcp-server-dumplingai
github.comAccess data, web scraping, and document conversion APIs by Dumpling AI
ekas-io/open-sales-stack
github.comCollection of B2B sales intelligence MCP servers. Includes website analysis, tech stack detection, hiring signals, review aggregation, ad tracking, social profiles, financial reporting and more for AI-powered prospecting by Ekas
emicklei/melrose-mcp
github.comPlays Melrōse music expressions as MIDI
echology-io/decompose
github.comDecompose text into classified semantic units with authority, risk, attention scores, and entity extraction. No LLM. Deterministic. Works as MCP server or CLI.
erithwik/mcp-hn
github.comAn MCP server to search Hacker News, get top stories, and more.
echojobsio/jdl-mcp-server
github.comSearch 1M+ enriched job listings from 20,000+ companies. Filter by skills, salary, location, seniority, remote type, and more. Free — 500 calls/day, no signup required. Also available as a remote MCP server at https://mcp.jobdatalake.com.
Attribution
Data sourced from punkpeye/awesome-mcp-servers (MIT). Synced every 24 hours.