SlideShare una empresa de Scribd logo
1 de 47
SharePoint and Office 365
Information Governance and Compliance
Challenges
Don Miller
Vice President of Sales
Concept Searching
donm@conceptsearching.com
Twitter @conceptsearch
• Company founded in 2002
• Product launched in 2003
• Focus on management of structured and unstructured information
• Technology Platform
• Delivered as a web service
• Automatic concept identification, content tagging, auto-classification,
taxonomy management
• Only statistical vendor that can extract conceptual metadata
• 2009, 2010, 2011, 2012, 2013, 2014 ‘100 Companies that Matter in KM’
KMWorld and Trend Setting product of 2009, 2010, 2011, 2012, 2013
• Authority to Operate enterprise wide US Air Force and enterprise wide
NETCON US Army
• Locations: US, UK, and South Africa
• Client base: Fortune 500/1000 organizations
• Microsoft Business-Critical SharePoint program partner,
Gold Certification in Application Development
• Smart Content Framework™ for Information Governance comprising
• Five Building Blocks for success
• Product Platforms: conceptClassifier for SharePoint, conceptClassifier for Office 365,
conceptClassifier, and Concept Searching Technology
The Global Leader in
Managed Metadata Solutions
Metadata, Auto-classification,
and Taxonomies
Types of Classification Metadata
Intrinsic – information that can be extracted
directly from an object (file name, size)
Administrative/Management – information
used to manage the document (author, date
created, date to be reviewed)
Descriptive – information that describes the
object (title, subject, audience)
Semantic – ability to extract concepts from
within content and generate the metadata
(intelligent metadata)
Why do you care?
• Without effective governance, most technology focused metadata
projects will fail (Forrester Research)
• Less than 50% of content is correctly indexed, meta tagged, or
efficiently searchable
• Unstructured data and metadata are increasing at an average
annual growth rate of 62%
• Corporations will be responsible for the security, privacy, reliability,
and compliance of 85% of that information
(IDC 2010 Digital Universe Study)
• 67% of data loss in records management is due to end user error
(Prism International)
• 70% of data breaches are due to end user error (Ponemon Institute)
Metadata Matters
The Challenges of Content Overload
• 80% of enterprise data is unstructured (IDC)
• 60% of documents are obsolete (eLaw)
• 50% of documents are duplicates (Equivio)
The Benefits of Automatic Semantic Metadata Generation
• Elimination of costs and errors associated with end user tagging
• Identification and protection of secure content assets from
unauthorized access and portability in accordance with compliance
procedures
• Automatic in-place identification and tagging of documents of record
• Normalization of content across functional and geographical
boundaries
• Integration with the enterprise search
• Ability to apply policy consistently across diverse repositories
A manual metadata approach will fail 95%+ of the time
Issue Organizational Impact
Inconsistent Less than 50% of content is correctly indexed, meta-tagged or
efficiently searchable rendering it unusable to the organization (IDC)
Subjective Highly trained information specialists will agree on meta tags between
33%-50% of the time (C. Cleverdon)
Cumbersome – expensive Average cost of manually tagging one item runs from $4 - $7 per
document and does not factor in the accuracy of the meta tags nor the
repercussions from mistagged content (Hoovers)
Malicious compliance End users select first value in list
(Perspectives on Metadata, Sarah Courier)
No perceived value for end user What’s in it for me? End user creates document, does not see value
for organization nor risks associated with litigation and
non-conformance to policies
What have you seen Metadata will continue to be a problem due to inconsistent human
behavior
Why is metadata so hard to get right?
• Concept Searching’s unique statistical concept identification underpins all technologies
• Multi-word suggestion is explicitly more valuable than single term suggestion algorithms
Concept Searching has a unique approach to ensure success
• conceptClassifier for SharePoint will generate conceptual
metadata by extracting multi-word terms that identify
‘triple heart bypass’ as a concept as opposed to single keywords
• Metadata can be used by any search engine index or any
application/process that uses metadata.
Concept Searching
provides Automatic
Concept Term Extraction
Triple
Baseball
Three
Heart
Organ
Center
Bypass
Highway
Avoid
Unique Approach
OK, we have our metadata,
what’s next?
Auto-classification
• Supervised – some external mechanism, such as
human feedback, provides information on the correct
classification
• Unsupervised – also known as document clustering,
where the classification has no reference to external
information
• Semi-supervised – where parts of the documents
are labeled by an external mechanism and some by
human intervention
Automatic Document Classification
+
Statistical
Rules-based
Linguistic
Machine Learning
Semantic Networks
Auto-classification Systems
Auto-classification Systems – What do they do?
Document
Preparation
• Split into language
blocks (paragraphs,
headings),
formatting, layout
Parsing
• Entity extraction
• NLP: parts of speech,
phrases
• Terms, variants
Weighting
• Frequency
• Location in text,
phrase
• Proximity
• Combination
• Format of text
Classification
• If threshold reached
• Can influence search
results
This is where rules
vs statistics come
into play…Not all classification solutions are created equal!
We still have one more missing
piece!
Taxonomies
Types of Taxonomies
List, Picklist, Controlled Vocabulary, Authority Files –
list of lead or preferred terms, selected by the end user,
may or may not have relationships among the terms, can
include a synonym ring
Synonym Lists - the use of synonyms allows one concept
to be instantiated as the same as the other, but still allows a
term to be preferred over another
Hierarchical – each content item resides in only one
category, referred to as a ‘tree’
• Piano
• Musical Instrument
Types of Taxonomies
Polyhierachical, Faceted, Thesauri – content
items can exist in more than one category, more
structured controlled vocabulary, provides
information about each term and its relationship to
other terms, features of a hierarchical taxonomy plus
associative relationships
• Piano
• Musical Instrument
• Stringed Instrument
• Percussion Instrument
Ontology – multiple taxonomies with additional
relationships added to specify concepts within
a domain
Sources: Marlene Rockmore – The Taxonomy Blog, and Heather Hedden, author of ‘The Accidental Taxonomist’
Let’s talk about SharePoint,
Office 365, OneDrive, Delve
and Information Governance
Why Information Governance in SharePoint and/or Office 365?
• A single semantic framework regardless of
where content resides
• Maximizes value of enterprise information
assets
• Reduces liabilities surrounding lack of
processes
• Provides a way to manage unstructured
and semi-structured data in a cloud or
hybrid environment
• Reduces organizational risk
• Avoids the cost of non-compliance
• Addresses cyber security and data
protection
• Improves decision making and
organizational performance
The goal of information governance is to
optimize the value of information, while
simultaneously minimizing the
associated risks and costs
What is the challenge with SharePoint?
• Search: SharePoint 2013 search is much improved in regards to features, but the
same old problem of user tagging compromises the quality and relevancy of
search results, 85% of relevant documents are never retrieved in search (IDC)
• Security at the content level: 83% of data harm/damage is due to user mistakes
and accidents and only 1% malicious internal user behavior
• The probability of a material data breach in an organization with 10,000
records is 22%
• Average cost to the organization is $3.3 million (Ponemon Institute)
• Records Management: 88% of organizations are challenged by regulatory
change (Robert Half International), less than 50% of content is correctly indexed,
meta tagged, or efficiently searchable (IDC), average cost of manually tagging
one record is $4-$7
• Migration: mass moves impact search, eDiscovery, content management,
• Duplicates, content that should be archived, and just plain garbage are never
addressed, can be a storage issue
What is the challenge with Office 365?
See Previous Slide
• And…
• How do you manage security?
• What is required for applications? Multi-factor
authentication, encryption, ‘enterprise ready’?
• The average organization loads 85.6GB of high risk
applications to the cloud (Skyhigh Networks)
• How do you replicate your business processes in
Office 365?
• How do you integrate with SharePoint applications that
are not cloud ready?
• How do you manage all content and keep it in synch?
• How do you determine what content goes where?
• How do you audit for compliance?
• How do you identify and manage risk?
Office 365 Technology Challenges
• Office 365 limits the type of solutions
that can be installed
• No full trust solutions, sandbox only
• Reduced SharePoint and Web APIs
Auto-Classification Challenges in Office 365
• The restrictions in Office 365 pose specific challenges to an
auto-classification application
• Metadata updates cannot invoke a system update:
A problem if you want to update MMP without corrupting the
Modified By user and date
• Term Store APIs are restricted:
Rename Term and Delete Term are not supported and Term GUIDs
cannot be specified
What is the challenge with OneDrive?
What are your users doing, despite availability of enterprise tools?
• 89% of 5,187 full-time employees use consumer file sync and storage tools at work,
despite the security risks, 25% use three or more consumer/commercial products to
get work done
• 44% rely on email and memory sticks (Ovum)
• Your content is only as safe as your least common denominator user
• Do you know what is being saved to OneDrive?
• What is your tolerance for risk or loss of confidential information?
• In a BYOD world do you know what OneDrive is being synched to?
• How do you handle a lost device that is synched to OneDrive?
• Several organizations are turning off OneDrive for Business because there is no way to
guarantee what is being posted there is compliant with governance, enterprise policies,
and directives
• Users may be unaware their My Documents are
being auto synched to OneDrive
• Is this the right approach?
So, what’s in your OneDrive?
What is the Risk with SharePoint, Office 365, and OneDrive?
• Loss of confidential information
• Proposals
• Pricing information
• Financial forecasts
• Negative reports
• Security information and protocols
• Trade secrets
• Legal liability
• PII/PIA/HIPAA
• Customer/Confidential information
• Corporate Reputation
• Can you survive a loss of consumer faith? Can your boss?
Isn’t OOTB good enough?
• Short answer, No
• You don’t know what is being synched or loaded to OneDrive, in fact,
your users may not realize what is being synched, so when they save
that super secret quarterly earnings report to their My Documents…
• Little integration between on-premise and hybrid environments
• Cloud is still risky business, unless managed in accordance with
enterprise policy
• Lack of controls to prevent use and access to ‘non-approved’ cloud
services and applications
• MSFT has a solution that will notify you that something is amiss, but…
• That might work if you only have 20, 30, maybe 100 personnel and
OneDrives but can you realistically monitor 90,000 or more in that
fashion?
• The solution must include additional automation and options such as DRM
The solution must intercept content before it is available in OneDrive
What about Microsoft’s Recent DLP Announcement?
• Only addresses the cloud
• Not hybrid
• Not on-premise
• The average number of cloud services used by an enterprise came in at
738, 10 times more than what IT typically expects from its employees,
…and you are going to hire how many more people?
• Does not really do much more than a well crafted search query
• Do you have the time to go look at 90,000 OneDrives and ensure Office
365 is not the repository for confidential information? …and you are
going to hire how many more people?
• Does not stop unauthorized cloud services from being placed in Office 365
or content in OneDrive which means it is available until an admin goes in
and identifies and removes it, that might be hours, days, or never
• So, if you were a large pharma company and FDA wrote a nasty report
on your new miracle drug, that report might be in the wild for hours,
days or even a week before it is secured. Is that a risk you can live
with?
What about Microsoft’s Delve?
• We all know what Delve is – right?
• Search and discovery tool that
automatically delivers the most
interesting, most useful and most
relevant information from across all of
Office 365 using machine learning and
artificial intelligence
• Delve will ‘guess’ what you are looking
for based on your activities
• Office Graph is the search engine
technology using artificial intelligence,
based on Yammer’s Enterprise Graph
technology, developed by the previous
FAST group located in Oslo
• Currently supports Exchange, Yammer,
OneDrive for Business, SharePoint
Online
To use or not?
• Only addresses the cloud
• Not hybrid - Not on premise
• Works best across distributed teams and
workgroups
• Personal tool, dependent on end user adoption
• Enforcement issues
• Caveat: If you don't allow access to the Office
Graph, you also disable solutions that are built on
top of it, such as Delve, and you remove Delve
from the Office 365 global navigation
• Does this replace Office 365 search?
• No, augments it
• Is SharePoint Integration planned?
• No
• Security concerns
• Whistle blower protection
• Personal security violations
• Accuracy? Artificial Intelligence is great but…
Augment the tools Microsoft gives you… Why?
• An added value third party application able to generate and use intelligent
metadata can deliver an organization significant benefits in terms of
productivity, governance and compliance, across SharePoint, Office 365, and
OneDrive
• What makes this different?
• Intelligence to identify and trap content as its migrated from file shares
during the onboarding process
• Automation to move the content to a secure location for evaluation
• The option to invoke and kick off Information Rights Management
• The option and ability to expand this to include Documents of Record
• Aligns with Governance policies across the organization, file shares,
SharePoint on-premise and SharePoint Online
Most important, intercept and secure the content before it becomes
available in OneDrive, and identify unauthorized cloud services or
application that has been loaded into Office 365
What can an Enterprise Metadata Enabled Approach Achieve?
• Immediate
• Supports and facilitates development of your enterprise
metadata and content management strategy
• Improves search and findability
• Intelligent content migration into O365/One Drive
• Auto-classifies all content, regardless of whether IT is
aware of its existence
• Elimination of end user tagging
• Elimination of silos of information
• Ongoing
• Supports your records management requirements
without a hybrid farm, automatic in-place record
declaration
• Protects your confidential assets, in real-time, removes
from search, disables download
• Standardizes business processes across all
environments
• Enterprise policy and governance enforcement, across
on-premise and the cloud
What can you solve?
Search
Records management
Data privacy
Migration
Security
eDiscovery
Content management
Collaboration
Business social networking
Text analytics
Evaluating Solutions
A Few Questions to Ask to Get You Started
• How often should a drive or repository be indexed for
new content?
• Does the system need to perform in real-time?
• Should old content be re-classified to determine if it
should be classified according to a different
category?
• How are classification errors solved?
• Should the user have the ability to override the
classification assignment?
• How long should deployment and ongoing
management take?
• How much end user involvement can be eliminated?
• How does the system handle vocabulary and/or
language ambiguities?
Calculating ROI
Show me the ROI
• Create enterprise automated metadata framework/model
• Average return on investment minimum of 38% and runs as high as 600%
(IDC)
• Migration, can individuals ensure the right documents are migrated and
the sensitive ones are removed!
• Apply consistent meaningful metadata to enterprise content
• Incorrect meta tags costs an organization $2,500 per user per year – in
addition potential costs for non-compliance (IDC)
• Guide users to relevant content with taxonomy navigation
• Savings of $8,965 per year per user based on $80,000 salary
(Chen & Dumais)
• 100% ‘recall’ of content, 35% faster access to content ‘precision’
• Use automatic conceptual metadata generation to improve records
management
• Eliminate inconsistent end user tagging at $4-$7 per record (Hoovers)
• Improve compliance processes, eliminate potential privacy exposures
Real World Savings
Pique Solutions
• The Business Solutions
• Search
• Records Management
• Migration
• Data Security
• eDiscovery/Litigation Support,
FOIA
• Information Governance
• Text Analytics
• Social Tagging
• Collaboration
• Content Management
• Metadata Management
Intelligent Metadata Enabled
Solutions in Office 365,
OneDrive for Business,
or a non-Microsoft Cloud
Environment
Enterprise Search
“By itself the search function has limited value. The real value
of search and information access technologies is in the
ongoing efforts needed to establish effective taxonomies,
to index and classify content of all kinds, in order to
provide meaningful results.”
Tom Eid, Research Vice President Gartner Group
Metadata Drives Precision Search
• Keyword search captures only 33% of relevant
information. Consistent, meaningful metadata
ensures all relevant information related to key
words will be returned.
• Users can’t navigate to information. Taxonomies
provide consistent guided navigation for end users
to extract relevant information even in external
content. Taxonomy navigation is 36%-48% faster
and more efficient than lists.
• Vocabulary normalization across diverse
geographies and cultures causes issues and
inhibits sharing of knowledge and expertise due to
nomenclature.
Knowledge Workers’ Challenges
• 15% of their time is spent
duplicating information
• 25% of their time is spent searching
• 40% can not easily find the
information they require to do their
job
• The cost to a 500 employee
company is $2.4 million per year in
inefficiencies and lost productivity
(Gartner Group)
Situation:
• Not-for-profit organization that contributes to the prevention and
cure of cancer
• More than 30,000 users
• Outpatient treatment programs that record more than 328,300
visits a year
Challenge:
• Portal to enable patients to access information relevant to their
specific health situations
• Accurate, medically sound, and secure information necessary
• Aggregate content from internal and external sources
Solution:
• conceptClassifier for SharePoint platform
• SharePoint 2010
Microsoft FAST Search
• Integrated solution with partner Aeturnum
Benefits:
• Accuracy of search
• Relevance of results
• Confidence in data
• Control and trust
“With more than 30,000 current users,
the MyMoffitt Patient Portal has seen
significant growth, and of the new
patients that come to Moffitt, 87%
register for a patient portal account. All
developments and enhancements are
about improving the patient experience.”
Jennifer Camps, Director of Portal
Technologies and Data Management,
Moffitt Cancer Center
Read the Case Study
Case Study – Intelligent Search
Data Privacy and Cyber Security
• Works in conjunction with security
applications, or as a stand-alone application
• Protects and secures content assets from
search and portability such as
• Personally Identifiable Information (PII)
• Protected Health Information (PHI)
• OPSEC
• Or any metadata that is deemed confidential
by the organization
• Identification in real time as content is created
or ingested
• Same approach as records management
• The taxonomy standardizes the process of
identifying all possible privacy data
exposures – digital and handwritten
Data Breaches and Exposures
Challenges
• Average cost of a data breach is
$6.3 million and ranges from
$225 thousand to $35 million
• Average cost per exposed record is
$197 and ranges from $90-$305
per record
• 70% of breaches were due to a
mistake or malicious intent by an
organization’s own staff
• Healthcare provider - $7 million,
TJX Companies - $256 million,
ValueClick - $2.9 million
Situation:
• Budget of $6.9 Billion
• Over 60,000 users
• Runs 75 hospitals and clinics providing care to more than 2.6 million
beneficiaries
Challenge:
• Data Privacy
• Intelligent Migration
• Before and after
• Records Management
• Pilot project: 72,000 Site Collections, 5,300 retention codes,
classify 200,000 documents per hour with minimum resources
Solution:
• conceptClassifier for SharePoint platform
Benefits:
• Automatic tagging based on organizational vocabulary and descriptors
• Automatic routing and the ability to change the SharePoint content type
• Eliminated manual tagging, removes from unauthorized access and
portability
• No security exposures or breaches in 4 years
“Concept Searching’s Taxonomy
Manager provides our Subject
Matter Experts with a user friendly
web interface enabling the
development of controlled
vocabularies that can be used to
filter search results and auto-
classify content to folder
structures.”
J.D. Whitlock, Lt Col, USAF,
MSC, CPHIMS
Air Force Medical Service
Read the Case Study
Case Study - Search, Data Privacy, Records Management,
Migration
‘Intelligent’ Migration
• Standalone or in conjunction with migration application
• Pre-migration: As content is migrated it is analyzed for
organizationally defined descriptors and vocabularies,
which will automatically classify the content to
taxonomies or optionally the SharePoint Term Store
• Security rights maintained
• Safeguards document confidentiality and identifies
security exposures and records that were previously
unknown
• Index content
• File Shares to File Shares, File Share to SharePoint
• SharePoint to SharePoint
• Custom Action – from any other repository
(.NET code and Web services)
• Plug in architecture to custom develop content
sources and destination sources
• Post migration: Taxonomy hierarchy can be used to
improve search as content will be organized by concept
and enable the ability to find relevant content that was
previously unable to be retrieved
Migration Challenges
• 84% of data migration projects fail (Bloor)
• 72% of organizations delay migration
because it is too risky (Bloor)
• 70% of projects reported schedule
overruns of about 30% while 64%
reported budget average budge overruns
of 16% (Hitachi Data Systems)
• Survey respondents rely on end users to
validate whether their data migration
was successful or not
(Enterprise Strategy Group)
Situation:
• Multiple Clients
Challenge:
• Simply moving content to new location did not
provide any benefits
• Human error and time was too costly
• Quantity of content too great
Solution:
• conceptClassifier for SharePoint platform
Benefits:
• Cleanses irrelevant and unnecessary documents
• Dramatically reduces the time for migration
• Eliminates manual intervention
• Improves the outcome enabling improvements in:
• Search
• Records management
• Data privacy
• eDiscovery and litigation support
• Text analytics
Case Study – Intelligent Migration
Automotive Parts Company
The goal was to improve search for
147,000 business users but needed to
migrate literally millions of documents.
conceptClassifier for SharePoint was
used for the pre and post migration and
for enabling concept based searching
with their existing search engine and
taxonomy based search after the
migration.
conceptClassifier for SharePoint
identified 66,000 duplicates out of a
total of 270,000 documents,
representing a 24% reduction in disk
space.
Office 365 Case Study
Office 365 Case Study
Situation:
• Global services firm
• SharePoint environment
• Over 170,000 users located across the globe - Americas, Europe, Middle East, India, Africa
(EMEIA), Asia Pacific, Japan
Challenge:
• Ability to communicate real-time with end users and clients regardless of where they reside or
how they are connected
• Information Governance issues
Solution:
• conceptClassifier for SharePoint
• conceptClassifier for Office 365
• conceptClassifier for OneDrive for Business
• conceptTaxonomyWorkflow
Benefits:
• Hybrid environment with Information Governance enforced across the global enterprise –
migration, data security, search, content management, records management
• Improved communication and access to real-time information
• High performance and scalability
• One core set of technologies – deploy once, use for multiple applications
Questions?
Freebie
If you would like to see a
demonstration or ask any further
technical questions, please come
by Booth 18 for
a chance to win Bose headphones!
Thank You
Don Miller
Vice President of Sales
Concept Searching
donm@conceptsearching.com
Twitter @conceptsearch

Más contenido relacionado

La actualidad más candente

Intelligent Metadata Enabled Migration with SharePoint
Intelligent Metadata Enabled Migration with SharePointIntelligent Metadata Enabled Migration with SharePoint
Intelligent Metadata Enabled Migration with SharePoint
Concept Searching, Inc
 
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
Concept Searching, Inc
 
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
Concept Searching, Inc
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Concept Searching, Inc
 
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...
Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...
martingarland
 

La actualidad más candente (20)

FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
 
Intelligent Metadata Enabled Migration with SharePoint
Intelligent Metadata Enabled Migration with SharePointIntelligent Metadata Enabled Migration with SharePoint
Intelligent Metadata Enabled Migration with SharePoint
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!
 
Why Use Add ins with SharePoint and SharePoint Online? Webinar
Why Use Add ins with SharePoint and SharePoint Online? WebinarWhy Use Add ins with SharePoint and SharePoint Online? Webinar
Why Use Add ins with SharePoint and SharePoint Online? Webinar
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
 
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
 
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
 
Concept Searching Webinar
Concept Searching WebinarConcept Searching Webinar
Concept Searching Webinar
 
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...
Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...
 
Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares
 
Governance for power bi Toronto SPS Saturday
Governance for power bi Toronto SPS Saturday Governance for power bi Toronto SPS Saturday
Governance for power bi Toronto SPS Saturday
 
Developing an Effective Search Strategy for Office 365 and Hybrid Deployments
Developing an Effective Search Strategy for Office 365 and Hybrid DeploymentsDeveloping an Effective Search Strategy for Office 365 and Hybrid Deployments
Developing an Effective Search Strategy for Office 365 and Hybrid Deployments
 
Putting Content in Context: Getting Information into SharePoint for Content M...
Putting Content in Context: Getting Information into SharePoint for Content M...Putting Content in Context: Getting Information into SharePoint for Content M...
Putting Content in Context: Getting Information into SharePoint for Content M...
 
NHSPUG April 2017 - We Need to Talk: How to Converse with Regular People Abou...
NHSPUG April 2017 - We Need to Talk: How to Converse with Regular People Abou...NHSPUG April 2017 - We Need to Talk: How to Converse with Regular People Abou...
NHSPUG April 2017 - We Need to Talk: How to Converse with Regular People Abou...
 
OK So Enterprise Search is "Janky" - Now What?
OK So Enterprise Search is "Janky" - Now What?OK So Enterprise Search is "Janky" - Now What?
OK So Enterprise Search is "Janky" - Now What?
 
Building internal-competencies-in-ioa
Building internal-competencies-in-ioaBuilding internal-competencies-in-ioa
Building internal-competencies-in-ioa
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePoint
 
SharePoint Governance 101 - Austin O365 & SharePoint User Group
SharePoint Governance 101  - Austin O365 & SharePoint User GroupSharePoint Governance 101  - Austin O365 & SharePoint User Group
SharePoint Governance 101 - Austin O365 & SharePoint User Group
 
Why You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management WebinarWhy You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management Webinar
 

Similar a SharePoint Fest Chicago Presentation

84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
Concept Searching, Inc
 
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Concept Searching, Inc
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
Concept Searching, Inc
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide Webinar
Concept Searching, Inc
 
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePointReduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Concept Searching, Inc
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
Concept Searching, Inc
 

Similar a SharePoint Fest Chicago Presentation (20)

84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
 
Data Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint WebinarData Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint Webinar
 
How To Drive Intelligent Migration Webinar
How To Drive Intelligent Migration WebinarHow To Drive Intelligent Migration Webinar
How To Drive Intelligent Migration Webinar
 
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide Webinar
 
Webinar - The Swiss Army Knife for SharePoint 2010 – Tagging, Term Store and ...
Webinar - The Swiss Army Knife for SharePoint 2010 – Tagging, Term Store and ...Webinar - The Swiss Army Knife for SharePoint 2010 – Tagging, Term Store and ...
Webinar - The Swiss Army Knife for SharePoint 2010 – Tagging, Term Store and ...
 
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePointReduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk – eDiscovery and Records Management in SharePoint
 
The art of information architecture in Office 365
The art of information architecture in Office 365The art of information architecture in Office 365
The art of information architecture in Office 365
 
SharePoint 2013 ECM & Methodology
SharePoint 2013 ECM & Methodology SharePoint 2013 ECM & Methodology
SharePoint 2013 ECM & Methodology
 
Getting to Know Enterprise Content Management (ECM) and How It Can Help You
Getting to Know Enterprise Content Management (ECM) and How It Can Help YouGetting to Know Enterprise Content Management (ECM) and How It Can Help You
Getting to Know Enterprise Content Management (ECM) and How It Can Help You
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance Webinar
 
What You Need to Know Before Upgrading to SharePoint 2013
What You Need to Know Before Upgrading to SharePoint 2013What You Need to Know Before Upgrading to SharePoint 2013
What You Need to Know Before Upgrading to SharePoint 2013
 
2012 MN Gov IT Symposium - Get Away from SharPoint Nightmares with Governance
2012 MN Gov IT Symposium - Get Away from SharPoint Nightmares with Governance2012 MN Gov IT Symposium - Get Away from SharPoint Nightmares with Governance
2012 MN Gov IT Symposium - Get Away from SharPoint Nightmares with Governance
 
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
 
Enterprise search Information
Enterprise search Information Enterprise search Information
Enterprise search Information
 
How to Get the Most Out of Search Webinar
How to Get the Most Out of Search WebinarHow to Get the Most Out of Search Webinar
How to Get the Most Out of Search Webinar
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records Management
 
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
 

Más de Concept Searching, Inc

The Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchThe Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online Search
Concept Searching, Inc
 
How To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization WebinarHow To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization Webinar
Concept Searching, Inc
 

Más de Concept Searching, Inc (20)

ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase
 
Using Metadata and Classification in Records Management
Using Metadata and Classification in Records ManagementUsing Metadata and Classification in Records Management
Using Metadata and Classification in Records Management
 
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarDiscovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
 
Drowning in Data and Starving for Information
Drowning in Dataand Starving for InformationDrowning in Dataand Starving for Information
Drowning in Data and Starving for Information
 
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarMetadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email Webinar
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records Management
 
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance WebinarEnough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
 
Using Metadata-Driven Taxonomies to Solve Business Problems
Using Metadata-Driven Taxonomies to Solve Business ProblemsUsing Metadata-Driven Taxonomies to Solve Business Problems
Using Metadata-Driven Taxonomies to Solve Business Problems
 
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge DiscoveryWhat You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
 
Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365
 
Eliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches WebinarEliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches Webinar
 
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
 
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic WebinarWhy Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
 
ECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish WebinarECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish Webinar
 
Collaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous WebinarCollaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous Webinar
 
SharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results WebinarSharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results Webinar
 
The Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchThe Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online Search
 
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term SetsExploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
 
How To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization WebinarHow To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization Webinar
 
conceptTermStoreManager Demo On Demand
conceptTermStoreManager Demo On DemandconceptTermStoreManager Demo On Demand
conceptTermStoreManager Demo On Demand
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 

SharePoint Fest Chicago Presentation

  • 1. SharePoint and Office 365 Information Governance and Compliance Challenges Don Miller Vice President of Sales Concept Searching donm@conceptsearching.com Twitter @conceptsearch
  • 2. • Company founded in 2002 • Product launched in 2003 • Focus on management of structured and unstructured information • Technology Platform • Delivered as a web service • Automatic concept identification, content tagging, auto-classification, taxonomy management • Only statistical vendor that can extract conceptual metadata • 2009, 2010, 2011, 2012, 2013, 2014 ‘100 Companies that Matter in KM’ KMWorld and Trend Setting product of 2009, 2010, 2011, 2012, 2013 • Authority to Operate enterprise wide US Air Force and enterprise wide NETCON US Army • Locations: US, UK, and South Africa • Client base: Fortune 500/1000 organizations • Microsoft Business-Critical SharePoint program partner, Gold Certification in Application Development • Smart Content Framework™ for Information Governance comprising • Five Building Blocks for success • Product Platforms: conceptClassifier for SharePoint, conceptClassifier for Office 365, conceptClassifier, and Concept Searching Technology The Global Leader in Managed Metadata Solutions
  • 4. Types of Classification Metadata Intrinsic – information that can be extracted directly from an object (file name, size) Administrative/Management – information used to manage the document (author, date created, date to be reviewed) Descriptive – information that describes the object (title, subject, audience) Semantic – ability to extract concepts from within content and generate the metadata (intelligent metadata)
  • 5. Why do you care? • Without effective governance, most technology focused metadata projects will fail (Forrester Research) • Less than 50% of content is correctly indexed, meta tagged, or efficiently searchable • Unstructured data and metadata are increasing at an average annual growth rate of 62% • Corporations will be responsible for the security, privacy, reliability, and compliance of 85% of that information (IDC 2010 Digital Universe Study) • 67% of data loss in records management is due to end user error (Prism International) • 70% of data breaches are due to end user error (Ponemon Institute)
  • 6. Metadata Matters The Challenges of Content Overload • 80% of enterprise data is unstructured (IDC) • 60% of documents are obsolete (eLaw) • 50% of documents are duplicates (Equivio) The Benefits of Automatic Semantic Metadata Generation • Elimination of costs and errors associated with end user tagging • Identification and protection of secure content assets from unauthorized access and portability in accordance with compliance procedures • Automatic in-place identification and tagging of documents of record • Normalization of content across functional and geographical boundaries • Integration with the enterprise search • Ability to apply policy consistently across diverse repositories
  • 7. A manual metadata approach will fail 95%+ of the time Issue Organizational Impact Inconsistent Less than 50% of content is correctly indexed, meta-tagged or efficiently searchable rendering it unusable to the organization (IDC) Subjective Highly trained information specialists will agree on meta tags between 33%-50% of the time (C. Cleverdon) Cumbersome – expensive Average cost of manually tagging one item runs from $4 - $7 per document and does not factor in the accuracy of the meta tags nor the repercussions from mistagged content (Hoovers) Malicious compliance End users select first value in list (Perspectives on Metadata, Sarah Courier) No perceived value for end user What’s in it for me? End user creates document, does not see value for organization nor risks associated with litigation and non-conformance to policies What have you seen Metadata will continue to be a problem due to inconsistent human behavior Why is metadata so hard to get right?
  • 8. • Concept Searching’s unique statistical concept identification underpins all technologies • Multi-word suggestion is explicitly more valuable than single term suggestion algorithms Concept Searching has a unique approach to ensure success • conceptClassifier for SharePoint will generate conceptual metadata by extracting multi-word terms that identify ‘triple heart bypass’ as a concept as opposed to single keywords • Metadata can be used by any search engine index or any application/process that uses metadata. Concept Searching provides Automatic Concept Term Extraction Triple Baseball Three Heart Organ Center Bypass Highway Avoid Unique Approach
  • 9. OK, we have our metadata, what’s next? Auto-classification
  • 10. • Supervised – some external mechanism, such as human feedback, provides information on the correct classification • Unsupervised – also known as document clustering, where the classification has no reference to external information • Semi-supervised – where parts of the documents are labeled by an external mechanism and some by human intervention Automatic Document Classification +
  • 12. Auto-classification Systems – What do they do? Document Preparation • Split into language blocks (paragraphs, headings), formatting, layout Parsing • Entity extraction • NLP: parts of speech, phrases • Terms, variants Weighting • Frequency • Location in text, phrase • Proximity • Combination • Format of text Classification • If threshold reached • Can influence search results This is where rules vs statistics come into play…Not all classification solutions are created equal!
  • 13. We still have one more missing piece! Taxonomies
  • 14. Types of Taxonomies List, Picklist, Controlled Vocabulary, Authority Files – list of lead or preferred terms, selected by the end user, may or may not have relationships among the terms, can include a synonym ring Synonym Lists - the use of synonyms allows one concept to be instantiated as the same as the other, but still allows a term to be preferred over another Hierarchical – each content item resides in only one category, referred to as a ‘tree’ • Piano • Musical Instrument
  • 15. Types of Taxonomies Polyhierachical, Faceted, Thesauri – content items can exist in more than one category, more structured controlled vocabulary, provides information about each term and its relationship to other terms, features of a hierarchical taxonomy plus associative relationships • Piano • Musical Instrument • Stringed Instrument • Percussion Instrument Ontology – multiple taxonomies with additional relationships added to specify concepts within a domain Sources: Marlene Rockmore – The Taxonomy Blog, and Heather Hedden, author of ‘The Accidental Taxonomist’
  • 16. Let’s talk about SharePoint, Office 365, OneDrive, Delve and Information Governance
  • 17. Why Information Governance in SharePoint and/or Office 365? • A single semantic framework regardless of where content resides • Maximizes value of enterprise information assets • Reduces liabilities surrounding lack of processes • Provides a way to manage unstructured and semi-structured data in a cloud or hybrid environment • Reduces organizational risk • Avoids the cost of non-compliance • Addresses cyber security and data protection • Improves decision making and organizational performance The goal of information governance is to optimize the value of information, while simultaneously minimizing the associated risks and costs
  • 18. What is the challenge with SharePoint? • Search: SharePoint 2013 search is much improved in regards to features, but the same old problem of user tagging compromises the quality and relevancy of search results, 85% of relevant documents are never retrieved in search (IDC) • Security at the content level: 83% of data harm/damage is due to user mistakes and accidents and only 1% malicious internal user behavior • The probability of a material data breach in an organization with 10,000 records is 22% • Average cost to the organization is $3.3 million (Ponemon Institute) • Records Management: 88% of organizations are challenged by regulatory change (Robert Half International), less than 50% of content is correctly indexed, meta tagged, or efficiently searchable (IDC), average cost of manually tagging one record is $4-$7 • Migration: mass moves impact search, eDiscovery, content management, • Duplicates, content that should be archived, and just plain garbage are never addressed, can be a storage issue
  • 19. What is the challenge with Office 365? See Previous Slide • And… • How do you manage security? • What is required for applications? Multi-factor authentication, encryption, ‘enterprise ready’? • The average organization loads 85.6GB of high risk applications to the cloud (Skyhigh Networks) • How do you replicate your business processes in Office 365? • How do you integrate with SharePoint applications that are not cloud ready? • How do you manage all content and keep it in synch? • How do you determine what content goes where? • How do you audit for compliance? • How do you identify and manage risk?
  • 20. Office 365 Technology Challenges • Office 365 limits the type of solutions that can be installed • No full trust solutions, sandbox only • Reduced SharePoint and Web APIs
  • 21. Auto-Classification Challenges in Office 365 • The restrictions in Office 365 pose specific challenges to an auto-classification application • Metadata updates cannot invoke a system update: A problem if you want to update MMP without corrupting the Modified By user and date • Term Store APIs are restricted: Rename Term and Delete Term are not supported and Term GUIDs cannot be specified
  • 22. What is the challenge with OneDrive? What are your users doing, despite availability of enterprise tools? • 89% of 5,187 full-time employees use consumer file sync and storage tools at work, despite the security risks, 25% use three or more consumer/commercial products to get work done • 44% rely on email and memory sticks (Ovum) • Your content is only as safe as your least common denominator user • Do you know what is being saved to OneDrive? • What is your tolerance for risk or loss of confidential information? • In a BYOD world do you know what OneDrive is being synched to? • How do you handle a lost device that is synched to OneDrive? • Several organizations are turning off OneDrive for Business because there is no way to guarantee what is being posted there is compliant with governance, enterprise policies, and directives • Users may be unaware their My Documents are being auto synched to OneDrive • Is this the right approach? So, what’s in your OneDrive?
  • 23. What is the Risk with SharePoint, Office 365, and OneDrive? • Loss of confidential information • Proposals • Pricing information • Financial forecasts • Negative reports • Security information and protocols • Trade secrets • Legal liability • PII/PIA/HIPAA • Customer/Confidential information • Corporate Reputation • Can you survive a loss of consumer faith? Can your boss?
  • 24. Isn’t OOTB good enough? • Short answer, No • You don’t know what is being synched or loaded to OneDrive, in fact, your users may not realize what is being synched, so when they save that super secret quarterly earnings report to their My Documents… • Little integration between on-premise and hybrid environments • Cloud is still risky business, unless managed in accordance with enterprise policy • Lack of controls to prevent use and access to ‘non-approved’ cloud services and applications • MSFT has a solution that will notify you that something is amiss, but… • That might work if you only have 20, 30, maybe 100 personnel and OneDrives but can you realistically monitor 90,000 or more in that fashion? • The solution must include additional automation and options such as DRM The solution must intercept content before it is available in OneDrive
  • 25. What about Microsoft’s Recent DLP Announcement? • Only addresses the cloud • Not hybrid • Not on-premise • The average number of cloud services used by an enterprise came in at 738, 10 times more than what IT typically expects from its employees, …and you are going to hire how many more people? • Does not really do much more than a well crafted search query • Do you have the time to go look at 90,000 OneDrives and ensure Office 365 is not the repository for confidential information? …and you are going to hire how many more people? • Does not stop unauthorized cloud services from being placed in Office 365 or content in OneDrive which means it is available until an admin goes in and identifies and removes it, that might be hours, days, or never • So, if you were a large pharma company and FDA wrote a nasty report on your new miracle drug, that report might be in the wild for hours, days or even a week before it is secured. Is that a risk you can live with?
  • 26. What about Microsoft’s Delve? • We all know what Delve is – right? • Search and discovery tool that automatically delivers the most interesting, most useful and most relevant information from across all of Office 365 using machine learning and artificial intelligence • Delve will ‘guess’ what you are looking for based on your activities • Office Graph is the search engine technology using artificial intelligence, based on Yammer’s Enterprise Graph technology, developed by the previous FAST group located in Oslo • Currently supports Exchange, Yammer, OneDrive for Business, SharePoint Online
  • 27. To use or not? • Only addresses the cloud • Not hybrid - Not on premise • Works best across distributed teams and workgroups • Personal tool, dependent on end user adoption • Enforcement issues • Caveat: If you don't allow access to the Office Graph, you also disable solutions that are built on top of it, such as Delve, and you remove Delve from the Office 365 global navigation • Does this replace Office 365 search? • No, augments it • Is SharePoint Integration planned? • No • Security concerns • Whistle blower protection • Personal security violations • Accuracy? Artificial Intelligence is great but…
  • 28. Augment the tools Microsoft gives you… Why? • An added value third party application able to generate and use intelligent metadata can deliver an organization significant benefits in terms of productivity, governance and compliance, across SharePoint, Office 365, and OneDrive • What makes this different? • Intelligence to identify and trap content as its migrated from file shares during the onboarding process • Automation to move the content to a secure location for evaluation • The option to invoke and kick off Information Rights Management • The option and ability to expand this to include Documents of Record • Aligns with Governance policies across the organization, file shares, SharePoint on-premise and SharePoint Online Most important, intercept and secure the content before it becomes available in OneDrive, and identify unauthorized cloud services or application that has been loaded into Office 365
  • 29. What can an Enterprise Metadata Enabled Approach Achieve? • Immediate • Supports and facilitates development of your enterprise metadata and content management strategy • Improves search and findability • Intelligent content migration into O365/One Drive • Auto-classifies all content, regardless of whether IT is aware of its existence • Elimination of end user tagging • Elimination of silos of information • Ongoing • Supports your records management requirements without a hybrid farm, automatic in-place record declaration • Protects your confidential assets, in real-time, removes from search, disables download • Standardizes business processes across all environments • Enterprise policy and governance enforcement, across on-premise and the cloud What can you solve? Search Records management Data privacy Migration Security eDiscovery Content management Collaboration Business social networking Text analytics
  • 31. A Few Questions to Ask to Get You Started • How often should a drive or repository be indexed for new content? • Does the system need to perform in real-time? • Should old content be re-classified to determine if it should be classified according to a different category? • How are classification errors solved? • Should the user have the ability to override the classification assignment? • How long should deployment and ongoing management take? • How much end user involvement can be eliminated? • How does the system handle vocabulary and/or language ambiguities?
  • 33. Show me the ROI • Create enterprise automated metadata framework/model • Average return on investment minimum of 38% and runs as high as 600% (IDC) • Migration, can individuals ensure the right documents are migrated and the sensitive ones are removed! • Apply consistent meaningful metadata to enterprise content • Incorrect meta tags costs an organization $2,500 per user per year – in addition potential costs for non-compliance (IDC) • Guide users to relevant content with taxonomy navigation • Savings of $8,965 per year per user based on $80,000 salary (Chen & Dumais) • 100% ‘recall’ of content, 35% faster access to content ‘precision’ • Use automatic conceptual metadata generation to improve records management • Eliminate inconsistent end user tagging at $4-$7 per record (Hoovers) • Improve compliance processes, eliminate potential privacy exposures
  • 34. Real World Savings Pique Solutions • The Business Solutions • Search • Records Management • Migration • Data Security • eDiscovery/Litigation Support, FOIA • Information Governance • Text Analytics • Social Tagging • Collaboration • Content Management • Metadata Management
  • 35. Intelligent Metadata Enabled Solutions in Office 365, OneDrive for Business, or a non-Microsoft Cloud Environment
  • 36. Enterprise Search “By itself the search function has limited value. The real value of search and information access technologies is in the ongoing efforts needed to establish effective taxonomies, to index and classify content of all kinds, in order to provide meaningful results.” Tom Eid, Research Vice President Gartner Group
  • 37. Metadata Drives Precision Search • Keyword search captures only 33% of relevant information. Consistent, meaningful metadata ensures all relevant information related to key words will be returned. • Users can’t navigate to information. Taxonomies provide consistent guided navigation for end users to extract relevant information even in external content. Taxonomy navigation is 36%-48% faster and more efficient than lists. • Vocabulary normalization across diverse geographies and cultures causes issues and inhibits sharing of knowledge and expertise due to nomenclature. Knowledge Workers’ Challenges • 15% of their time is spent duplicating information • 25% of their time is spent searching • 40% can not easily find the information they require to do their job • The cost to a 500 employee company is $2.4 million per year in inefficiencies and lost productivity (Gartner Group)
  • 38. Situation: • Not-for-profit organization that contributes to the prevention and cure of cancer • More than 30,000 users • Outpatient treatment programs that record more than 328,300 visits a year Challenge: • Portal to enable patients to access information relevant to their specific health situations • Accurate, medically sound, and secure information necessary • Aggregate content from internal and external sources Solution: • conceptClassifier for SharePoint platform • SharePoint 2010 Microsoft FAST Search • Integrated solution with partner Aeturnum Benefits: • Accuracy of search • Relevance of results • Confidence in data • Control and trust “With more than 30,000 current users, the MyMoffitt Patient Portal has seen significant growth, and of the new patients that come to Moffitt, 87% register for a patient portal account. All developments and enhancements are about improving the patient experience.” Jennifer Camps, Director of Portal Technologies and Data Management, Moffitt Cancer Center Read the Case Study Case Study – Intelligent Search
  • 39. Data Privacy and Cyber Security • Works in conjunction with security applications, or as a stand-alone application • Protects and secures content assets from search and portability such as • Personally Identifiable Information (PII) • Protected Health Information (PHI) • OPSEC • Or any metadata that is deemed confidential by the organization • Identification in real time as content is created or ingested • Same approach as records management • The taxonomy standardizes the process of identifying all possible privacy data exposures – digital and handwritten Data Breaches and Exposures Challenges • Average cost of a data breach is $6.3 million and ranges from $225 thousand to $35 million • Average cost per exposed record is $197 and ranges from $90-$305 per record • 70% of breaches were due to a mistake or malicious intent by an organization’s own staff • Healthcare provider - $7 million, TJX Companies - $256 million, ValueClick - $2.9 million
  • 40. Situation: • Budget of $6.9 Billion • Over 60,000 users • Runs 75 hospitals and clinics providing care to more than 2.6 million beneficiaries Challenge: • Data Privacy • Intelligent Migration • Before and after • Records Management • Pilot project: 72,000 Site Collections, 5,300 retention codes, classify 200,000 documents per hour with minimum resources Solution: • conceptClassifier for SharePoint platform Benefits: • Automatic tagging based on organizational vocabulary and descriptors • Automatic routing and the ability to change the SharePoint content type • Eliminated manual tagging, removes from unauthorized access and portability • No security exposures or breaches in 4 years “Concept Searching’s Taxonomy Manager provides our Subject Matter Experts with a user friendly web interface enabling the development of controlled vocabularies that can be used to filter search results and auto- classify content to folder structures.” J.D. Whitlock, Lt Col, USAF, MSC, CPHIMS Air Force Medical Service Read the Case Study Case Study - Search, Data Privacy, Records Management, Migration
  • 41. ‘Intelligent’ Migration • Standalone or in conjunction with migration application • Pre-migration: As content is migrated it is analyzed for organizationally defined descriptors and vocabularies, which will automatically classify the content to taxonomies or optionally the SharePoint Term Store • Security rights maintained • Safeguards document confidentiality and identifies security exposures and records that were previously unknown • Index content • File Shares to File Shares, File Share to SharePoint • SharePoint to SharePoint • Custom Action – from any other repository (.NET code and Web services) • Plug in architecture to custom develop content sources and destination sources • Post migration: Taxonomy hierarchy can be used to improve search as content will be organized by concept and enable the ability to find relevant content that was previously unable to be retrieved Migration Challenges • 84% of data migration projects fail (Bloor) • 72% of organizations delay migration because it is too risky (Bloor) • 70% of projects reported schedule overruns of about 30% while 64% reported budget average budge overruns of 16% (Hitachi Data Systems) • Survey respondents rely on end users to validate whether their data migration was successful or not (Enterprise Strategy Group)
  • 42. Situation: • Multiple Clients Challenge: • Simply moving content to new location did not provide any benefits • Human error and time was too costly • Quantity of content too great Solution: • conceptClassifier for SharePoint platform Benefits: • Cleanses irrelevant and unnecessary documents • Dramatically reduces the time for migration • Eliminates manual intervention • Improves the outcome enabling improvements in: • Search • Records management • Data privacy • eDiscovery and litigation support • Text analytics Case Study – Intelligent Migration Automotive Parts Company The goal was to improve search for 147,000 business users but needed to migrate literally millions of documents. conceptClassifier for SharePoint was used for the pre and post migration and for enabling concept based searching with their existing search engine and taxonomy based search after the migration. conceptClassifier for SharePoint identified 66,000 duplicates out of a total of 270,000 documents, representing a 24% reduction in disk space.
  • 44. Office 365 Case Study Situation: • Global services firm • SharePoint environment • Over 170,000 users located across the globe - Americas, Europe, Middle East, India, Africa (EMEIA), Asia Pacific, Japan Challenge: • Ability to communicate real-time with end users and clients regardless of where they reside or how they are connected • Information Governance issues Solution: • conceptClassifier for SharePoint • conceptClassifier for Office 365 • conceptClassifier for OneDrive for Business • conceptTaxonomyWorkflow Benefits: • Hybrid environment with Information Governance enforced across the global enterprise – migration, data security, search, content management, records management • Improved communication and access to real-time information • High performance and scalability • One core set of technologies – deploy once, use for multiple applications
  • 46. Freebie If you would like to see a demonstration or ask any further technical questions, please come by Booth 18 for a chance to win Bose headphones!
  • 47. Thank You Don Miller Vice President of Sales Concept Searching donm@conceptsearching.com Twitter @conceptsearch