44. Recall is the retrieval of all items that are relevant to the query
45. Precision is the retrieval of only those items that are relevant to the query
46. Higher precision leads to missing items that may be relevant but use a different vocabulary
47. Higher recall leads to the retrieval of too many items that may be unrelated to the query Automatic Concept Identification has the ability to increase precision with no loss of recall
48.
49. Deliver a robust content management approach maximizing SharePoint technologies
57. Users can’t navigate to information. Taxonomies provide consistent guided navigation for end users to extract relevant information even in external content. Taxonomy navigation is 36%-48% faster and more efficient than lists.
58. Vocabulary normalization across diverse geographies and cultures causes issues and inhibits sharing of knowledge and expertise due to nomenclature.
59. Case Study: Fortune 500 firm realized that search alone did not solve findability issues. Implemented conceptClassifier to secure and manage content in a policy-compliant manner, eliminated end user tagging, delivered the ability to rapidly build and deploy taxonomies, and to normalize vocabulary across global boundaries.KNOWLEDGE WORKERS CHALLENGES ~ 15% of their time is spent duplicating information. ~ 25% of their time is spent searching. ~ 40% can not easily find the information they require to do their job. The cost to a 500 employee company is $2.4 million per year in inefficiencies and lost productivity. Gartner Group
60.
61. Ensures adoption to any enterprise regulation for external agencies or where compliance is mandatory
62. Easy Integration with Microsoft Records Center. Ensures the long term usefulness of the records and enforcement of life cycle management.
64. Optional updating of Content Type based on the metadata contained within the documents
65. Case Study: US Air Force Medical Service eliminated all manual metadata tagging and uses conceptClassifier to automatically generate semantic metadata, assign record retention codes based on the metadata within the content, automatically change the Content Type and migrate documents to the RMCOMPLIANCE & RECORDS MANAGEMENT CHALLENGES ~ The average cost of manually tagging one item is estimated at $4.00 - $7.04 ~IDC estimates that only 50% of content is correctly meta-tagged ~ It costs and organization $180 per document to recreate it when it is not tagged correctly and cannot be found ~ Poor information quality costs organizations 10% to 20% of operating revenues
66.
67. Migrates content to a secure location where Windows Rights Management Services is applied to the file in the new location
69. The taxonomy standardizes the process of identifying all possible privacy data exposures at the time of content creation and modification (digital and handwritten).
70. Case Study: conceptClassifier for SharePoint extracted 2,000+ documents with sensitive information, from a redacted sample pilot data set. By human error, these 2,000 records contained real social security numbers and real employee information. These documents were identified in the proof of concept to the client in front of executive management.DATA BREACHES & EXPOSURES CHALLENGES ~ Average cost of a data breach is $6.3 million and ranges from $225K to $35 million. ~ Average cost per exposed record is $197 and ranges from $90-$305 per record. ~ 70% of breaches were due to a mistake or malicious intent by an organization’s own staff. ~ Healthcare provider - $7 million, TJX Companies - $256 million, ValueClick - $2.9 million.
71.
72. Generate weighted results from diverse repositories such as human resources records, time and billing, project documentation, content authorship, team structures, user profiles
73. Results are generated based on the most experienced and knowledgeable individual for that specific topic or skill set aggregated from diverse repositories
76. Case Study: Professional Services firm with over 39K employees across 36 countries uses conceptClassifier for expert identification to identify and utilize in-house consultants for projects as opposed to outsourcing – increasing utilization of staff by 5% to 10%Collaboration and Enterprise 2.0 ~ Nearly 80% of executives believe collaboration is important but needs to be managed ~Email storage costs $500GB per year – a Fortune 100 manufacturing company saved $2.6 million per year by implementing collaboration solutions ~ Up to 90% of content from premium paid publication database services is available for free on-line ~ Only 25% of executives describe their organization as effective at sharing knowledge across boundaries
88. Search will return results based on the concept even if the exact terms are not contained in the document (i.e. ‘coronary artery surgery’, ‘heart surgery’)
89. Metadata can be used by any search engine index or any application/process that uses metadataTriple Baseball Three Heart Organ Center Bypass Highway Avoid
120. API is based entirely on Web Services and all information is exchanged in XML T
121. Taxonomy formats are based on Web Ontology Language (OWL). Since the server is stateless it also works with all failover and load balancing hardware and software.
122.
123. There is no way to automatically generate metadata when it is created or ingested
129. Term Sets provide capability for faceted search and hierarchical navigation: Regions Country/State, Business Unit/Departments, Band Names/Album Names, TV Show Titles/Characters
130. conceptClassifier fully supports SharePoint 2010 EMM as the primary location for taxonomy definitions with no need to Import/Export
134. Improves search outcomes by placing conceptual metadata in the FAST Search index to increase relevancy of search results
135. Enables import of FAST Entities into the conceptClassifier taxonomy manager to fine-tune them with metadata generated from your own content and nomenclature
136. Runs natively as a FAST Pipeline Stage eliminating integration and customization issues
144. Identification and location of sensitive information (PII, PHI, Confidential) and migrates content to new location where Windows Rights Management Services are applied
145. Automatically tags and classifies content based on semantics contained within the actual document of record and optionally updates the Content Type
155. Freedom of Information Act (FOIA)Distribution Statement A: Approved for public release; distribution is unlimited. 311 ABG/PA No. 09-488, 16 Oct 2009