Turn your Big IP and R&D Data in to Business Insights.
KMX Patent Analytics allows organizations to strengthen their innovation process and improving the ROI on their patent portfolio. KMX provides professional Patent Information Specialists a unique classification, clustering and visualization solution for analyzing and visualizing large patent collections.
2. Industry Thought Leaders about Treparel
“Treparel KMX’s visualization capabilities around its auto-categorization and
clustering offer immediate insight into unstructured data sets and appear to
be adaptable and customizable to customer needs. Its approach to auto-
categorization utilizes statistical principles and machine learning that require
significantly less training and tuning on the part of customers than other
approaches.” David Schubmehl, IDC
“As we acquire more and more information, we need tools that will guide us
through the data maze. Analysts need tools to help them understand
patterns and define clusters. Users need to explore data to uncover
relationships from scattered sources. Treparel’s KMX serves both these
needs with its ability to cluster and categorize collections of data with a high
degree of accuracy, and its interactive visualization tools that enable
exploration of large data sets.” Sue Feldman, Synthexis.com (author: The
Answer Machine.
Treparel KMX – All Rights Reserved 2014 2www.treparel.com
3. Some of our clients & partners
KMX is an integral part of our IP analysis toolbox. It contributes to our
capability of making added value IP analyses of technologies and
competitors to support strategic decision making.
www.fusepool.eu
“We’ve speed up our patent searches from 2 days to 2 hours using
KMX technology”
Treparel KMX – All rights reserved 2014 3
4. Visualization
Clustering Classification
Text Preprocessing and Indexing
Acquire documents
Present Results
Taxonomies,
Ontologies
Semantic Analysis
KMX Text Analytics Application overview
KMX unique functions:
• Extract concepts in context
using clustering and
classification of documents
• Use classification to create
ranked lists and to tag subsets
• Support of binary and multi-
class Classification
• Enterprise edition
(server/cloud) & Professional
edition (desktop)
• Integration with other
applications through KMX API
Treparel KMX – All rights reserved 2014 www.treparel.com 4
Query &
Search Tools
5. Benefits: Get quick insights through automated visual clusters
with annotations to enhance the discovery process
1. Analyze the clusters and the relationships in the data
2. Explore outliers in the data
3. Find documents of interest
What it does: A visualization of clusters where the documents
are displayed as points and the distance between them shows
their similarity.
What KMX delivers: Use KMX to do:
1. Perform text preprocessing (stemming/tokenization etc)
2. Calculate between all documents a similarity measure
3. Calculate visualization (landscape) with automatic annotation
4. Create the visualization
– As a static image
– Or provide interaction where the user can zoom in/out with
support for adaptive annotation
Clustering: User Unsupervised Analytics
Treparel KMX – All rights reserved 2014 www.treparel.com 5
6. Benefits: Finding fast, accurate and precise small result sets and enabling trend
reporting and Alerting by reusing predefined categorization models.
1. Obtain a ranked list of the most relevant documents
2. Separate the important documents from the irrelevant documents (noise)
How it works: A list of the relevant documents defined from a users
perspective.
What KMX delivers: Use KMX to do:
1. Tag (label) a small number of relevant and irrelevant documents
– Use search to identify documents that need to be tagged
– Perform manual tagging
– Select documents interactive from the visualization (brushing)
2. Create a Classifier (categorizer) using the tagged documents
3. Automatically perform the classification on all documents
4. Obtain the important documents as ranked high and the irrelevant
documents which are ranked low
Classification: User Supervised Analytics
Treparel KMX – All rights reserved 2014 www.treparel.com 6
7. Benefits: KMX Visualisations are supporting
the process of constructing a visual image
in the mind to understand the data better.
How it works: KMX offers a visualization framework with various methods for
seeing the unseen. It enriches the process of discovery and fosters profound
and unexpected insights.
What KMX delivers: Different visualizations or visual pipelines to:
• Comprehend large datasets, datasets that are too large to grasp by mental
imagination.
• Discover previous unknown properties of the data set that may not have
been anticipated
• Reveal inherent problems of the data, for instance errors and artefacts
• Examine large-scale features of the dataset as well as the local features or
allows the user to see local features in a larger scale reference
• Let users form hypothesis based on the (newly) observed phenomena or
developed insights
Visualization: Discovering Unexpected Insights
Treparel KMX – All rights reserved 2014 www.treparel.com 7
8. How Reliable & Accurate are the results?
Review your results with advanced performance tools
The quality of the automatic classification (categorization) is shown in the
histogram, where a small number of documents with a high classification score
are separated from the large number of documents.
Non relevant documents Relevant documents
KMX calculates the Precision and Recall of the results using cross validation.
• Precision is essential for: First analysis & Alerting services
• Recall is crucial for: Freedom to Operate search, Validity search Patentability search
• Both need to be high for: Patent portfolio landscape analysis, Technology Exploration, Risk Assessments
Fig: Classification performance 1280 patents on ‘biomass’
8
9. Part 1:
KMX: Ready to Use Patent Analytics
Intuitive Content Clustering,
Classification & Visualization
Treparel KMX – All rights reserved 2014 9www.treparel.com
10. Finding all relevant patents
KMX: An integrated IP Search, Discover & Monitor
1 intuitive visual interface to many IP search cases
State of the art search: Identify all relevant
patents for the purpose of general
technology review by a landscaping analysis
Novelty search: Identify all relevant patents
and non-patents which may affect
patentability of an idea/invention (analysis
done before drafting and filling the patent)
Freedom to Operate search: identify all
relevant patents or non patent literature
which cover a product or process idea that
one wants to work-out
Infringement search: identify all relevant
patents which cover your product or process
and are still in force
Collections: Collect prior art in a particular
area (often conducted using selected or pre-
built patent classifications).
Scientific / Business: Identify financial
organizational, statistical, commercial and
other information.
Voice of Customer: Research into specific
questions asked by customers
Cluster Visualization: the lens to KMX and
all your IP Searches
10
Finding limited (or 1) patents
Patentability search: Given one patent
application determine the novelty.
Opposition search: identify public literature
(patent or non-patent) and show a lack of
novelty (aka invalidity search) of one patent.
Due diligence search: analyze key strengths
and weaknesses and the scope of a patent.
Misc searches: patent family, legal status,
citing references etc.
Monitor, Map & Enrich
Patent Watch & Benchmarking: Monitor latest technology developments by subject (using
Alerts) or compare uniquely categorized IP portfolios to others.
Patent Map: Show patent landscape to uncover trends, gaps or overlap
Co-occurrence: search for semantic similarity where a set of terms (or documents) have a
similar meaning (or semantic content) according to a list of terms.
Semantic enrichment: Find in a set of relevant documents the most important
annotations/words that characterize these documents
Thesauri creation: Group, edit and export terms from a set of documents that have similar
meaning.
11. Use Case 1: Performing small to large scale SWOT
analysis
Patent
Database
+10.000 patents
986 patents
29 patents
Ranking
Queries
Filtering
SWOT analysis example
Start with removing irrelevant
patents using Classification and
Filtering to determine:
• Who are the important players
(assignees, inventors)?
• Where are the important patents
filed (countries)?
• What is the trend over time (growth
of patents over the years)?
• NB: we used a (very) simple query to
find 986 patents filed under
Astrazeneca.
Output
Business
User
Ranking Filtering
Ranking Filtering
Treparel KMX – All rights reserved 2014
12. Landscaping and Ranking:
What are most relevant Respiratory & Inflammation patents?
Fig: From 986 to the most relevant patents using visual selection (brushing) to build a
classification model (Classifier) to be able to rank the full data set and to extract the most relevant.
12
13. Landscaping and Ranking:
What are most relevant Respiratory & Inflammation patents?
Fig: Ranked patents using a Classifier for Respiratory & Inflammation patents (In yellow the selection of 29
absolute relevant patents to be further analyzed). We used ‘respiratory’ to demonstrate highlighting
capabilities.
Yellow = most
important patents
(+80% score)
Blue = least relevant
patents (for this
analysis)
13
NB: crosshair points
to 1 specific patent
(full text in left pane)
Treparel KMX – All rights reserved 2014
14. Page 14 |
Extracting concepts in context from classification of documents
Use Case 2: Concept detection using text classification
to auto generate thesauri
1. Visualization multiple topic
clusters
2. Select cluster select documents
with similar topics
3. Select training documents within
the sub-cluster
4. Build Classifier and classify
5. Rank documents find set of
documents with related concepts
6. Extract concepts to create thesauri
KMX Example: ‘Ebola, SARS, Bird flue: How do they relate?’
Treparel KMX – All rights reserved 2014 14
15. Key KMX Features
Acquire Text
• Use the standard file importers for formats like XLS, CSV, XML
• Directly query data using native connections to data providers like OPS,
CLAIMS Direct, Customer SOLR, NLM PubMed, PLoS or The Guardian .
Model & Analyse
• Get quick insights through a best-in-class search workflow and
automated annotation
• Enhance the discovery process by using dynamic hierarchical views in
the search results delivering automated visual clusters of information
• Focus on high-interest subjects with visual filtering
• Discover serendipity: unexpected trends, patterns and relationships
• Quickly categorize the large data by developing and applying your
personal classification scheme
• Use brushing to visually train the software with small sample sets for
automatic analysis (patent-pending technology)
15Treparel KMX – All rights reserved 2014 15www.treparel.com
16. Key KMX Features
Adapt & Output
• Display the data sets through advanced visualizations incl. landscapes,
document similarity, frequency distribution, classification scores
• Export the data result sets like ranked document lists, labelled lists,
enriched data
• Extract terms from documents to build a thesaurus for semantic analysis
• Share the results in a web browser, in a Word file, or as Excel sheet
• Use auto reporting for scheduling to larger group of business users
Manage & Collaborate
• Cross validate by measuring the precision and recall of the results set
• Improve categorization performance using assisted tuning of
document labelling
• Tag, annotate and rank the results to improve relevance
• Collaborate and cooperate with other users by sharing result sets,
visualizations and analysis models (KMX Classifiers)
• Balance compute power and batch scheduling for up to millions of
documents
• Manage users and schedule automated backups
• Monitoring and alerting on relevant new documents 16
17. Add-on servers:
Auto Reporting & Batch Classification
• Auto Reporting Server
– Support automated analysis for
aggregated results for multiple users
– Pie & bar charts
– Landscape visualizations for overview
of subjects
– Enabling rich interaction
• Batch Classification Server
– high-performance stand-alone text-
classification server
– Enables large scale parallel
processing
Page 17
Treparel KMX – All rights reserved 2014 17www.treparel.com
18. KMX Licensing options
1. Professional Edition (Desktop Installation)
• Full version, including: Standard Importer (CSV, XLS, Text), Text preprocessing, Information Extraction,
Indexing, Building stop-lists for words / stemmers, Multi language word lists, Clustering, Classification
(Building classifiers), Assisted Classifier Performance Tuning, Landscape Visualization, Semantic Analysis.
• Price per user*, per year: CONTACT sales@treparel.com
2. Group Edition (Desktop & Server Installation)
• Similar functionality to Professional Edition, additional: Automated backups, User management,
Collaboration & sharing of Classifiers, data and reports (through server)
• Price per user*: CONTACT sales@treparel.com
3. KMX Starter Package
• KMX Starter Package includes 3 Group Edition licenses, 2 days preparation/Q&A (remote) and 2 days on-
site training on client specific use-case by experienced IP consultant, per year.
All prices are based on minimum contract of 2 years, including Maintenance, Updates and Support and
excluding VAT and excluding travel. Multi user discounts may apply.
Note: Treparel also provides proof-of-concepts to test the software in a limited period of time supported
by experience IP consultant.
*A User is defined as a named individual, however the Treparel Fair Use policy allows sharing.
18Treparel KMX – All rights reserved 2014 www.treparel.com
19. Part 2:
KMX software:
User Interface, key functions & value
Treparel KMX – All rights reserved 2014 19www.treparel.com
20. KMX : Model, Analyse, Discover and Visualize
in one view and deploy it to large scale
Document text
Search and
highlighting
Landscape visualization Coloring of classification score
Brushing
Filtering
20
KMX Example: ‘Ebola, SARS, Bird flue: How do they relate?’
Treparel KMX – All rights reserved 2014 www.treparel.com
21. KMX : Optimize Output
using Classification Performance Tuning
Precision
And
Recall
Distribution of classification scores
Document
classification
for three
classes
21Treparel KMX – All rights reserved 2014 www.treparel.com
22. Business Value from Patents with KMX
Text Analytics for Anyone and Everyone – Intuitive to use and learn. Designed
for every user: business (info consumers) and scientific (info creators).
Instant Business Insights – Explore all of your unstructured data (text, blogs,
email, patents) without limits.
Rapid Time to Value - Adaptable and customizable to users needs. No
implementation or extensive and expensive modelling or development.
Significant less training and tuning.
Any size deployment – Meets every business need from a single user to large
multilevel type user groups.
Language independent – Search and analyze most of the world’s languages using
machine translation.
Any kind or deployment - Use it from your desktop or in a - private - cloud. Buy
the software-as-a-service or get the output-as-a-service.
Enterprise-proven, IP & IT friendly – Successfully delivering value to IP, business
and markets in multinational companies.
Integration – Use the KMX API to increase the value of unstructured data in
other software applications
22Treparel KMX – All rights reserved 2014 www.treparel.com
23. Part 3:
NEW: IP and R&D Dashboard
Integrated SAAS based search, reporting,
visualization and analysis
Built in partnership with:
Treparel KMX – All rights reserved 2014 23www.treparel.com
24. Evalueserve IP and R&D NG Dashboard
A Joint Development by Evalueserve and Treparel
Evalueserve Treparel
Largest IP and R&D Search and
Analytics Provider..
Leading developer of text analytics
and visualization software..
…understands what information
professionals need and how end
users would like to receive IP and
R&D search and analytics projects
..has analytics and visualization
technology successfully used by
leading companies for R&D and IP
information
Have joined forces to develop and market a state of the art next generation
IP and R&D Dashboard…
Treparel KMX – All rights reserved 2014 www.treparel.com
25. NG IP and R&D Dashboard
Key Features
• …it is not a patent database (or any other database) x
• …it is (not yet) an advanced text analytics tool x
• it is a tool for the information professional to deliver IP and R&D search
and analytics projects to end users √
• ..it is source neutral – can take data of any source (patent, non-patent
literature, product info, etc. √
• ..end users and information professionals benefit from advanced
visualization and search functions for efficient work with projects such as
patent landscapes or competitive intelligence projects √
• ..it hosts workflows such as alerts √
• ..it is sold as product or product plus service √
Treparel KMX – All rights reserved 2014 www.treparel.com
26. Advanced Searching, Filtering,
Browsing,
Customized Alerting
Indexing
(Manual/
Automated)
IP and R&D Information Management Tool
An Efficient & Intuitive Knowledge Management System
> Source Independent
Data
Patent Data
Scientific Literature
Business News
Project 1
Project 2
Interactive and
Dynamic Visualization
Evalueserve
NG Dashboard
Treparel KMX – All rights reserved 2014
27. IP & RD Dashboard:
Content Driven Analytical solution
Treparel KMX – All rights reserved 2014 www.treparel.com 27
Ease of Use access to Search, Reporting & Analysis of
content like Patents, Emails, Legislation, Application Notes, websites
28. IP & RD Dashboard:
Content analytics beyond key-word search
Treparel KMX – All rights reserved 2014 www.treparel.com 28
Interactive taxonomy with multiple coupled views
and advanced search in large sets of documents
29. IP & RD Dashboard:
Built in analytics & interactive visualizations
Treparel KMX – All rights reserved 2014 www.treparel.com 29
Ad-hoc or Standard interactive visualizations
leading directly to the underlying documents or notes