Skip to main content

Notice: Services Restored - PatentSearch API

The Server 500 errors affecting the PatentSearch API have been resolved. Full service is now restored. Thank you for your patience. 

PatentsView Algorithms and Methods

Discover how PatentsView transforms raw patent data into structured, reliable insights. Our methods ensure transparency, consistency, and usability—empowering researchers, developers, and policymakers to explore U.S. innovation with confidence.

Backed by the Office of the Chief Economist at the U.S. Patent and Trademark Office (USPTO), PatentsView applies rigorous data preparation techniques to support reproducible research and informed decision-making.

What Are PatentsView's Methods?

PatentsView methods are the behind-the-scenes processes that make our data trustworthy and actionable. From disambiguating inventor names to standardizing geographic locations, each step is designed to improve data quality and usability across our tools and datasets.

Whether you're using our API, building custom datasets, or exploring visualizations, understanding our methods helps you interpret results accurately and build stronger analyses.

Explore Our Core Methods

A chart network icon
Data Pipeline

Our data pipeline is the foundation of everything we do. It begins with raw patent data from the USPTO and ends with structured datasets that are easy to explore and analyze. 

Our data pipeline is the foundation of everything we do. It begins with raw patent data from the USPTO and ends with structured datasets that are easy to explore and analyze. This process includes:

  • Data ingestion from multiple USPTO sources
  • Cleaning and normalization to correct errors and inconsistencies
  • Schema mapping to align data with our internal structure
  • Versioning and release management to ensure transparency over time

We update our data quarterly and document all changes in our release notes. This methodology ensures that users always have access to the most current and reliable data available.

Read
Learn more about our data pipeline
A chart diagram icon
Disambiguation

Our disambiguation algorithm groups together records that refer to the same person or organization, even when names are misspelled, abbreviated, or reordered.

Inventor and assignee names often appear in multiple formats across different patents. Our disambiguation algorithm groups together records that refer to the same person or organization, even when names are misspelled, abbreviated, or reordered.

This process includes:

  • Name normalization (e.g., “J. Smith” vs. “John Smith”)
  • Affiliation matching using co-inventor and assignee data
  • Machine learning models trained on historical patent data

The result is a more accurate picture of who is inventing what, where, and when, which is critical for network analysis, innovation tracking, and policy research.

Read
Learn about patent entity disambiguation
A location dot icon
Location Standardization

Patent data often includes inconsistent or incomplete location information. Our location standardization methods support geographic analysis.

Patent data often includes inconsistent or incomplete location information. Our location standardization methods clean and enrich this data to support geographic analysis.

We:

  • Parse and validate city, state, and country fields
  • Match locations to a curated database of known places
  • Resolve ambiguities (e.g., “Springfield” in multiple states)

This enables users to explore innovation trends by region, map inventor networks, and compare patenting activity across geographies.

Read
Learn about location standardization
A government building icon
Government Interest Identification

Our government interest method extracts and flags patents funded by federal grants or contracts using natural language processing (NLP) techniques.

Many patents are funded by federal grants or contracts, but this information is often buried in unstructured text. Our government interest method extracts and flags these patents using natural language processing (NLP) techniques.

We:

  • Scan patent documents for statements of federal support
  • Extract agency names and grant numbers
  • Tag patents with structured metadata for easy filtering

This helps users identify federally funded innovation and evaluate the impact of public investment in research and development.

Read
Learn about government interest identification
A folder tree icon
Patent Classification Mapping

Our classification method maps patents to multiple classification systems, allowing users to explore innovation by technology area and more.

Patents are categorized using multiple classification systems, each with its own structure and purpose. This allows users to explore innovation by technology area, compare trends across industries, and align patent data with other economic indicators.

Our classification method maps patents to:

  • International Patent Classification (IPC)
  • Cooperative Patent Classification (CPC)
  • U.S. Patent Classification (USPC)
  • WIPO Technology Fields
  • NBER Technology Groupings

 

Read
Learn about classification mapping
A circle and check icon
Data Evaluation and Testing

We don’t just build tools, we test them. We use real-world use cases and performance metrics to validate the accuracy and usability of our data. 

We don’t just build tools, we test them. Our evaluation methods use real-world use cases and performance metrics to validate the accuracy and usability of our data.

We:

  • Benchmark disambiguation accuracy
  • Test API response times and reliability
  • Collect user feedback through our forum

This continuous improvement process ensures that PatentsView remains a trusted resource for patent data analysis.

Read
Learn about how we evaluate our data

Tools and Datasets Built on These Methods

A download icon
Bulk Data Downloads

Download large sets of patent data in a tab-delimited format. You can choose raw, cleaned, or processed data that works well with most data analysis tools.

Download large sets of patent data in a tab-delimited format. You can choose raw, cleaned, or processed data. This format works well with most data analysis tools like R or Python. It’s great for deep research and custom analysis.

Read
Explore Data Downloads
A chart icon
PatentSearch API

Use our application programming interface (API) to access patent data. Create custom searches and pull data into your apps or websites.

Use our application programming interface (API) to access patent data programmatically. You can create custom searches and pull data directly into your apps or websites. The API includes detailed documentation and a Swagger interface to help you get started quickly.

Read
Explore PatentSearch API
Compass Icon
Data Explorer - Pending

A user-friendly tool that offers an efficient visual interface for building custom patent datasets. Useful for technical and non-technical users. 

A user-friendly tool that offers an efficient visual interface for building custom patent datasets. The PatentsView Data Explorer allows users to:

  • Enter search criteria using a simple filter interface
  • Explore and refine patent data visually
  • Download tailored datasets for further analysis
  • Use the Insights tab to view trends and summaries
  • Navigate with ease, whether you're a technical or non-technical user
Read
Coming Soon!
A pie graph icon
Data Visualizations

Explore patent data through interactive charts and graphs.

Explore patent data through interactive charts and graphs. You can search by keywords, filter by location, and compare different attributes. These data visualization tools help you spot trends, patterns, and relationships in the data.

Read
Explore Visualizations
A magnifying glass icon
Search Datatool

Search more than 40 years of patent data.

Search more than 40 years of patent data. This tool helps you find connections between inventors, patents, and companies. It’s a powerful way to explore innovation trends and discover new insights.

Read
Use the Search Tool
A line graph icon
Patent Data Dictionaries

These guides explain how our data is organized. 

These guides explain how our data is organized. They include definitions for terms, field names, and attributes. Use them to better understand the structure of our datasets and ensure accurate analysis.

Read
Explore Data Dictionaries

Why Methods Matter

At PatentsView, we believe that open data should also mean open processes. By documenting and sharing our methods, we:

  • Support reproducible research
  • Build trust with our users
  • Enable better decision-making
  • Foster a community of informed data users

Whether you're analyzing patent trends, building a policy brief, or developing a new product, our methodologies give you the confidence to use PatentsView data effectively.