PubSys by Britto
A Search Engine for the World's Public Data.

PubSys ingests, filters, and catalogs public datasets from across the internet—then makes them searchable via UI, REST, GraphQL, and natural language.
What It Does
1
Source Discovery
AI-powered crawlers continuously scan the internet for open data sources and updates.
2
Ingestion Pipeline
Cleans, structures, and tags datasets with schema normalization and quality scoring.
3
NLP Query Engine
Users can ask plain-language questions like "Show water quality in California" and get instant results.
4
Flexible Access
UI for exploration, REST & GraphQL for developers, and full metadata/citation for transparency.
Who It's For
Developers
Build apps with data via GraphQL/REST.
Researchers
Find and cite quality public datasets quickly
Civic Tech & Journalists
Use trustworthy data to inform and empower.
Why It Matters
Public data is everywhere—but finding, cleaning, and using it is hard.
PubSys makes open data usable. At scale.
  • Unified interface across the world's public datasets
  • AI-powered discovery and quality tagging
  • Access via code or plain English
  • Fully traceable, up-to-date, and free to explore
Our Advantage
Be the first to experience our revolutionary data platform
Sign up for our waitlist and be among the first to access our cutting-edge solutions.
PubSys Value Chain
From raw data to insight—here’s how PubSys delivers value at every step.
1. 🌐Discovery
"Where’s the good data?"
  • Autonomous agents crawl public sources (e.g., gov sites, APIs, open data portals).
  • Community submissions expand reach.
  • Sources are scored for freshness, quality, and relevance.
2. 🧹 Ingestion & Normalization
"Can I trust and use this?"
  • Raw data is scraped, cleaned, and transformed into structured formats.
  • Schema normalization and metadata tagging ensure consistency.
  • Deduplication, quality scoring, and change detection flag outdated or low-value data.
3. 📦 Cataloging
"What data do you have?"
  • Each dataset is assigned rich metadata, versioning, and provenance (citations, source links).
  • Datasets are indexed by domain, location, format, and entity types.
  • API-accessible catalog enables discovery via REST, GraphQL, and NLP.
4. 🤖 Query & Access
"How do I get what I need?"
  • Users explore via:
  • UI: Searchable, browsable interface with previews and visualizations.
  • REST & GraphQL APIs: Full developer access with caching and pagination.
  • NLP Engine: Natural language questions become structured queries.
5. 🧠 Insight & Action
"What can I do with this?"
  • Enables downstream use cases:
  • Data journalism
  • Policy research
  • Civic dashboards
  • AI model training
  • Product enrichment
  • Supports real-time integrations and low-code tooling (future).
→ PubSys isn’t just a crawler or API—it's the entire value chain from web to insight.
Team
Todd Fulton — Founder & CTO
Todd is a serial startup founder and engineering leader with over 25 years of experience building high-scale data, payments, and API platforms.
CTO & Co-founder at Red Gorilla
Led $10M VC-backed scale-up, architected J2EE data systems, and built a 37-person engineering team.
VP Engineering at Software as Service
Drove product development and raised first-tier VC interest.
Director roles at Recurly & GearLaunch
Built new platform teams, scaled microservices in GCP, and led dev/ops transformations.
Senior Engineer at PayPal
Implemented mobile SDKs, analytics infra, and credit operations.
Todd holds an MBA from USC and a BA from UCLA.
🔗 LinkedIn
Join Our Beta
Want early access?
Join the waitlist and be among the first to explore our AI-powered data platform.
Email
info@britto.io