-
Data
-
New & Trending
PatentsView Releases Q2 2025 Data Update
-
New & Trending
- Methodology
- Topics
-
About
- About PatentsView
- About Our Data
- Connect
PatentsView Algorithms and Methods
Discover how PatentsView transforms raw patent data into structured, reliable insights. Our methods ensure transparency, consistency, and usability—empowering researchers, developers, and policymakers to explore U.S. innovation with confidence.
Backed by the Office of the Chief Economist at the U.S. Patent and Trademark Office (USPTO), PatentsView applies rigorous data preparation techniques to support reproducible research and informed decision-making.
What Are PatentsView's Methods?
PatentsView methods are the behind-the-scenes processes that make our data trustworthy and actionable. From disambiguating inventor names to standardizing geographic locations, each step is designed to improve data quality and usability across our tools and datasets.
Whether you're using our API, building custom datasets, or exploring visualizations, understanding our methods helps you interpret results accurately and build stronger analyses.
Explore Our Core Methods
Data Pipeline
Our data pipeline is the foundation of everything we do. It begins with raw patent data from the USPTO and ends with structured datasets that are easy to explore and analyze.
Our data pipeline is the foundation of everything we do. It begins with raw patent data from the USPTO and ends with structured datasets that are easy to explore and analyze. This process includes:
- Data ingestion from multiple USPTO sources
- Cleaning and normalization to correct errors and inconsistencies
- Schema mapping to align data with our internal structure
- Versioning and release management to ensure transparency over time
We update our data quarterly and document all changes in our release notes. This methodology ensures that users always have access to the most current and reliable data available.
Read
Disambiguation
Our disambiguation algorithm groups together records that refer to the same person or organization, even when names are misspelled, abbreviated, or reordered.
Inventor and assignee names often appear in multiple formats across different patents. Our disambiguation algorithm groups together records that refer to the same person or organization, even when names are misspelled, abbreviated, or reordered.
This process includes:
- Name normalization (e.g., “J. Smith” vs. “John Smith”)
- Affiliation matching using co-inventor and assignee data
- Machine learning models trained on historical patent data
The result is a more accurate picture of who is inventing what, where, and when, which is critical for network analysis, innovation tracking, and policy research.
Read
Location Standardization
Patent data often includes inconsistent or incomplete location information. Our location standardization methods support geographic analysis.
Patent data often includes inconsistent or incomplete location information. Our location standardization methods clean and enrich this data to support geographic analysis.
We:
- Parse and validate city, state, and country fields
- Match locations to a curated database of known places
- Resolve ambiguities (e.g., “Springfield” in multiple states)
This enables users to explore innovation trends by region, map inventor networks, and compare patenting activity across geographies.
Read
Government Interest Identification
Our government interest method extracts and flags patents funded by federal grants or contracts using natural language processing (NLP) techniques.
Many patents are funded by federal grants or contracts, but this information is often buried in unstructured text. Our government interest method extracts and flags these patents using natural language processing (NLP) techniques.
We:
- Scan patent documents for statements of federal support
- Extract agency names and grant numbers
- Tag patents with structured metadata for easy filtering
This helps users identify federally funded innovation and evaluate the impact of public investment in research and development.
Read
Patent Classification Mapping
Our classification method maps patents to multiple classification systems, allowing users to explore innovation by technology area and more.
Patents are categorized using multiple classification systems, each with its own structure and purpose. This allows users to explore innovation by technology area, compare trends across industries, and align patent data with other economic indicators.
Our classification method maps patents to:
- International Patent Classification (IPC)
- Cooperative Patent Classification (CPC)
- U.S. Patent Classification (USPC)
- WIPO Technology Fields
- NBER Technology Groupings
Read
Data Evaluation and Testing
We don’t just build tools, we test them. We use real-world use cases and performance metrics to validate the accuracy and usability of our data.
We don’t just build tools, we test them. Our evaluation methods use real-world use cases and performance metrics to validate the accuracy and usability of our data.
We:
- Benchmark disambiguation accuracy
- Test API response times and reliability
- Collect user feedback through our forum
This continuous improvement process ensures that PatentsView remains a trusted resource for patent data analysis.
Read
Tools and Datasets Built on These Methods
Bulk Data Downloads
Download large sets of patent data in a tab-delimited format. You can choose raw, cleaned, or processed data that works well with most data analysis tools.
Download large sets of patent data in a tab-delimited format. You can choose raw, cleaned, or processed data. This format works well with most data analysis tools like R or Python. It’s great for deep research and custom analysis.
Read
PatentSearch API
Use our application programming interface (API) to access patent data. Create custom searches and pull data into your apps or websites.
Use our application programming interface (API) to access patent data programmatically. You can create custom searches and pull data directly into your apps or websites. The API includes detailed documentation and a Swagger interface to help you get started quickly.
Read
Data Explorer - Pending
A user-friendly tool that offers an efficient visual interface for building custom patent datasets. Useful for technical and non-technical users.
A user-friendly tool that offers an efficient visual interface for building custom patent datasets. The PatentsView Data Explorer allows users to:
- Enter search criteria using a simple filter interface
- Explore and refine patent data visually
- Download tailored datasets for further analysis
- Use the Insights tab to view trends and summaries
- Navigate with ease, whether you're a technical or non-technical user
Read
Data Visualizations
Explore patent data through interactive charts and graphs.
Explore patent data through interactive charts and graphs. You can search by keywords, filter by location, and compare different attributes. These data visualization tools help you spot trends, patterns, and relationships in the data.
Read
Search Datatool
Search more than 40 years of patent data.
Search more than 40 years of patent data. This tool helps you find connections between inventors, patents, and companies. It’s a powerful way to explore innovation trends and discover new insights.
Read
Patent Data Dictionaries
These guides explain how our data is organized.
These guides explain how our data is organized. They include definitions for terms, field names, and attributes. Use them to better understand the structure of our datasets and ensure accurate analysis.
Read
Why Methods Matter
At PatentsView, we believe that open data should also mean open processes. By documenting and sharing our methods, we:
- Support reproducible research
- Build trust with our users
- Enable better decision-making
- Foster a community of informed data users
Whether you're analyzing patent trends, building a policy brief, or developing a new product, our methodologies give you the confidence to use PatentsView data effectively.