Enviar búsqueda
Cargar
Text prospecting
•
Descargar como ODP, PDF
•
1 recomendación
•
446 vistas
S
singingfish
Seguir
Lightning talk at OSDC 2008, Sydney
Leer menos
Leer más
Tecnología
Denunciar
Compartir
Denunciar
Compartir
1 de 16
Descargar ahora
Recomendados
SQL: Why 'Little Bobby Tables' is funny
SQL: Why 'Little Bobby Tables' is funny
Erik Tank
Briney - Leveling Up Data Management - Sept 8
Briney - Leveling Up Data Management - Sept 8
National Information Standards Organization (NISO)
Paul2 ecn 2012
Paul2 ecn 2012
ECNOfficer
"Hands Off! Best Practices for Code Hand Offs"
"Hands Off! Best Practices for Code Hand Offs"
Naomi Dushay
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload
Prof. Wim Van Criekinge
Information Retrieval - Data Science Bootcamp
Information Retrieval - Data Science Bootcamp
Kais Hassan, PhD
PHP - Introduction to PHP MySQL Joins and SQL Functions
PHP - Introduction to PHP MySQL Joins and SQL Functions
Vibrant Technologies & Computers
Build Your Own World Class Directory Search From Alpha to Omega
Build Your Own World Class Directory Search From Alpha to Omega
Ravi Mynampaty
Recomendados
SQL: Why 'Little Bobby Tables' is funny
SQL: Why 'Little Bobby Tables' is funny
Erik Tank
Briney - Leveling Up Data Management - Sept 8
Briney - Leveling Up Data Management - Sept 8
National Information Standards Organization (NISO)
Paul2 ecn 2012
Paul2 ecn 2012
ECNOfficer
"Hands Off! Best Practices for Code Hand Offs"
"Hands Off! Best Practices for Code Hand Offs"
Naomi Dushay
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload
Prof. Wim Van Criekinge
Information Retrieval - Data Science Bootcamp
Information Retrieval - Data Science Bootcamp
Kais Hassan, PhD
PHP - Introduction to PHP MySQL Joins and SQL Functions
PHP - Introduction to PHP MySQL Joins and SQL Functions
Vibrant Technologies & Computers
Build Your Own World Class Directory Search From Alpha to Omega
Build Your Own World Class Directory Search From Alpha to Omega
Ravi Mynampaty
Nicholas stern at canberra press club
Nicholas stern at canberra press club
singingfish
The dog and the snail
The dog and the snail
Patrizia Tirel
Ws Rosabianca
Ws Rosabianca
Patrizia Tirel
Learning Morse at Interesting 2009
Learning Morse at Interesting 2009
Tim Duckett
Mmf D Slides Bis
Mmf D Slides Bis
Patrizia Tirel
Balto, Lucky Bear!
Balto, Lucky Bear!
Patrizia Tirel
The Cowardly Test-o-Phobe's Guide To Testing
The Cowardly Test-o-Phobe's Guide To Testing
Tim Duckett
MojoMojo - the Elegant wiki, Catalyst-powered
MojoMojo - the Elegant wiki, Catalyst-powered
Dan Dascalescu
RML Rendezvous - Zotero
RML Rendezvous - Zotero
National Network of Libraries of Medicine, Pacific Northwest Region
Unlock user behavior with 87 Million events using Hudi, StarRocks & MinIO
Unlock user behavior with 87 Million events using Hudi, StarRocks & MinIO
nadine39280
Introduction to libre « fulltext » technology
Introduction to libre « fulltext » technology
Robert Viseur
Flow, RefWorks, Mendely, Zotero: Citation Management Tools For Research
Flow, RefWorks, Mendely, Zotero: Citation Management Tools For Research
John Pell
Databases evolution in CulturePlex Lab
Databases evolution in CulturePlex Lab
Javier de la Rosa
Code as Data workshop: Using source{d} Engine to extract insights from git re...
Code as Data workshop: Using source{d} Engine to extract insights from git re...
source{d}
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Takeshi Morita
RMLL 2013 : Build Your Personal Search Engine using Crawlzilla
RMLL 2013 : Build Your Personal Search Engine using Crawlzilla
Jazz Yao-Tsung Wang
Optimizing Application Architecture (.NET/Java topics)
Optimizing Application Architecture (.NET/Java topics)
Ravi Okade
ProjectHub
ProjectHub
Sematext Group, Inc.
Fedora Overview
Fedora Overview
eposthumus
Jaoo irony
Jaoo irony
Nick Hodge
Accesso ai dati con Azure Data Platform
Accesso ai dati con Azure Data Platform
Luca Di Fino
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
EUCLID project
Más contenido relacionado
Destacado
Nicholas stern at canberra press club
Nicholas stern at canberra press club
singingfish
The dog and the snail
The dog and the snail
Patrizia Tirel
Ws Rosabianca
Ws Rosabianca
Patrizia Tirel
Learning Morse at Interesting 2009
Learning Morse at Interesting 2009
Tim Duckett
Mmf D Slides Bis
Mmf D Slides Bis
Patrizia Tirel
Balto, Lucky Bear!
Balto, Lucky Bear!
Patrizia Tirel
The Cowardly Test-o-Phobe's Guide To Testing
The Cowardly Test-o-Phobe's Guide To Testing
Tim Duckett
Destacado
(7)
Nicholas stern at canberra press club
Nicholas stern at canberra press club
The dog and the snail
The dog and the snail
Ws Rosabianca
Ws Rosabianca
Learning Morse at Interesting 2009
Learning Morse at Interesting 2009
Mmf D Slides Bis
Mmf D Slides Bis
Balto, Lucky Bear!
Balto, Lucky Bear!
The Cowardly Test-o-Phobe's Guide To Testing
The Cowardly Test-o-Phobe's Guide To Testing
Similar a Text prospecting
MojoMojo - the Elegant wiki, Catalyst-powered
MojoMojo - the Elegant wiki, Catalyst-powered
Dan Dascalescu
RML Rendezvous - Zotero
RML Rendezvous - Zotero
National Network of Libraries of Medicine, Pacific Northwest Region
Unlock user behavior with 87 Million events using Hudi, StarRocks & MinIO
Unlock user behavior with 87 Million events using Hudi, StarRocks & MinIO
nadine39280
Introduction to libre « fulltext » technology
Introduction to libre « fulltext » technology
Robert Viseur
Flow, RefWorks, Mendely, Zotero: Citation Management Tools For Research
Flow, RefWorks, Mendely, Zotero: Citation Management Tools For Research
John Pell
Databases evolution in CulturePlex Lab
Databases evolution in CulturePlex Lab
Javier de la Rosa
Code as Data workshop: Using source{d} Engine to extract insights from git re...
Code as Data workshop: Using source{d} Engine to extract insights from git re...
source{d}
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Takeshi Morita
RMLL 2013 : Build Your Personal Search Engine using Crawlzilla
RMLL 2013 : Build Your Personal Search Engine using Crawlzilla
Jazz Yao-Tsung Wang
Optimizing Application Architecture (.NET/Java topics)
Optimizing Application Architecture (.NET/Java topics)
Ravi Okade
ProjectHub
ProjectHub
Sematext Group, Inc.
Fedora Overview
Fedora Overview
eposthumus
Jaoo irony
Jaoo irony
Nick Hodge
Accesso ai dati con Azure Data Platform
Accesso ai dati con Azure Data Platform
Luca Di Fino
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
EUCLID project
ApacheCon NA 2011 report
ApacheCon NA 2011 report
Koji Kawamura
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
João Rocha da Silva
Zotero And Refworks
Zotero And Refworks
karindalziel
NeXML - phylogenetic data as XML
NeXML - phylogenetic data as XML
Rutger Vos
Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)
Simeon Warner
Similar a Text prospecting
(20)
MojoMojo - the Elegant wiki, Catalyst-powered
MojoMojo - the Elegant wiki, Catalyst-powered
RML Rendezvous - Zotero
RML Rendezvous - Zotero
Unlock user behavior with 87 Million events using Hudi, StarRocks & MinIO
Unlock user behavior with 87 Million events using Hudi, StarRocks & MinIO
Introduction to libre « fulltext » technology
Introduction to libre « fulltext » technology
Flow, RefWorks, Mendely, Zotero: Citation Management Tools For Research
Flow, RefWorks, Mendely, Zotero: Citation Management Tools For Research
Databases evolution in CulturePlex Lab
Databases evolution in CulturePlex Lab
Code as Data workshop: Using source{d} Engine to extract insights from git re...
Code as Data workshop: Using source{d} Engine to extract insights from git re...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
RMLL 2013 : Build Your Personal Search Engine using Crawlzilla
RMLL 2013 : Build Your Personal Search Engine using Crawlzilla
Optimizing Application Architecture (.NET/Java topics)
Optimizing Application Architecture (.NET/Java topics)
ProjectHub
ProjectHub
Fedora Overview
Fedora Overview
Jaoo irony
Jaoo irony
Accesso ai dati con Azure Data Platform
Accesso ai dati con Azure Data Platform
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
ApacheCon NA 2011 report
ApacheCon NA 2011 report
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
Zotero And Refworks
Zotero And Refworks
NeXML - phylogenetic data as XML
NeXML - phylogenetic data as XML
Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)
Último
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
Rafal Los
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Safe Software
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Delhi Call girls
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
apidays
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Michael W. Hawkins
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
The Digital Insurer
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Puma Security, LLC
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Roshan Dwivedi
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
Enterprise Knowledge
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Delhi Call girls
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maria Levchenko
Último
(20)
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Text prospecting
1.
Text Digging
Prospecting With Zotero ( http://zotero.org ) And Perl (http://catalystframework.org)
2.
Stores documents and
metadata Scrapes from web, academic databases, Google Scholar Zotero
3.
Zotero
4.
Timeline view (MIT
Smile project)
5.
Perl Zotero database
– 43 tables
6.
DBIx::Class::Schema::Loader Firefox speaks
SQLite DBIx::Class Speaks SQLite Zotero DB: 43 Tables DBIC::Schema::Loader infers relationships perfectly
7.
Index Store Zotero
has it's own limited fulltext index Zotero::Meta extends this with Keyphrases (Lingua::EN::Tagger) Entities (Net::Calais or others)
8.
Catalyst, Template, Jquery,
Emastic (css framework) “Pretty” Browsable Index
9.
Browse keywords
10.
Browse related keywords
11.
View Documents
12.
View Text Snippets
13.
Supported Platforms Anywhere
that Perl and Firefox run
14.
Windows Support Hostile
environment - managed desktops - unresponsive support staff Solution: - Portable Firefox - Portable Strawberry Perl
15.
Cat In A
Box On A Stick (insert picture here) (is Dr Seus out of copyright yet?)
16.
Open source release
early 2009 may be licence issues X-( doesn't have a name yet
Descargar ahora