Improving Access to RCT Reports from Embase

Providing Consultancy &
Research in Health Economics
Julie Glanville, York Health Economics Consortium, UK
Gordon Dooley, Metaxis, UK
Anna Noel Storr, Cochrane Dementia and Cognitive Improvement Group
Ruth Foxlee, Cochrane Editorial Unit
October 2015
Improving rapid access to reports of
RCTs from Embase: innovative
methods to enhance the Cochrane
Central Register of Controlled Trials
(CENTRAL)

Presentation Overview
 Background
 Objectives
 Methods
 Results
 The future

Background
 Cochrane systematic reviews rely on the efficient identification of
research evidence, specifically evidence from randomised
controlled trials (RCTs) and quasi randomised studies.
 The largest single source of RCTs is the Cochrane Central Register
of Controlled Trials (CENTRAL)
 CENTRAL was mainly populated with records from Medline, but
also contained records from Embase
 Collaboration identified the need for improved rapid identification of
trials from Embase for inclusion in CENTRAL

The project and its
objectives
 The Cochrane Collaboration commissioned the Embase update
project in March 2013
 Project is undertaken by a consortium of three organisations
 the Cochrane Dementia and Cognitive Improvement Group
 Metaxis, UK
 York Health Economics Consortium. University of York, UK
 Objectives
 To identify reports of RCTs and controlled clinical trials from
Embase for more rapid availability in CENTRAL
 Today I will report on the development of the bespoke search filter
to identify the trials

Methods, 1
 We developed and validated a sensitive search filter to identify
reports of RCTs
 Reference standard of 10,000 randomly selected relevant Embase
reports of RCTs and quasi RCTs already available in CENTRAL
was compiled.
 published 2000-2010
 Used Simstatw and Wordstatw to Identify terms, phrases and
grouped terms within that reference standard set of records
which could be tested in filters

Methods, 2: techniques for
identifying candidate terms
 The frequency of terms which appeared in more than 10 records.
Terms were analysed by their location within a record: title,
abstract, EMTREE headings. Also all terms (independent of their
location within a record) were analysed by frequency.
 The WordStat phrase finding option was used to identify phrases
which appeared in more than 10 records.
 Case occurrence and term frequency–inverse document frequency
(tf*idf) were tested.
 WordStat clustering option to identify terms which form groups, i.e.
words which often appear in close proximity to each other.

Methods, 3: testing and
validation
 Draft strategies were tested on a second set of 10,000 randomly
selected Embase RCT records from CENTRAL
 The best candidate filter was validated against a third set of 10,000
randomly selected Embase RCT records from CENTRAL
 We also assessed the performance of the filter against the previous
Cochrane filter

Methods, 3: 2015 revision
 Cochrane 2014 strategy was revised and a range of exclusion
terms were added
 These were identified from the rejected studies
 Subject terms and also animal terms
 The impact of the exclusions was tested
 The revised strategy was adopted from February 2015 onwards
 This summer we have developed a filter just to remove animal
studies presented as conference papers

Results (reference standard 3)
 The validated search filter identifies reports of RCTs in
Embase with over 97.6% sensitivity
 97.6% in records published in 2002 (reference standard 3)
 100% in records published in 2010 (reference standard 3)
 Number needed to read
 156 (records published in 2001)
 400 (records published in 2010)

Embase Filter (Ovid
interface) 2014
1. Randomized controlled trial/
2. Controlled clinical study/
3. 1 or 2
4. Random$.ti,ab.
5. randomization/
6. intermethod comparison/
7. placebo.ti,ab.
8. (compare or compared or comparison).ti.
9. ((evaluated or evaluate or evaluating or
assessed or assess) and (compare or
compared or comparing or
comparison)).ab.
10. (open adj label).ti,ab.
11. ((double or single or doubly or singly) adj
(blind or blinded or blindly)).ti,ab.
12. double blind procedure/
13. parallel group$1.ti,ab.
14. (crossover or cross over).ti,ab.
15. ((assign$ or match or matched or
allocation) adj5 (alternate or group$1 or
intervention$1 or patient$1 or subject$1
or participant$1)).ti,ab.
16. (assigned or allocated).ti,ab.
17. (controlled adj7 (study or design or
trial)).ti,ab.
18. (volunteer or volunteers).ti,ab.
19. human experiment/
20. trial.ti.
21. or/4-20
22. 21 not 3

Process
 An analysis of the records retrieved resulted in a tiered
record assessment process
 The most obvious RCT reports are fast-tracked into CENTRAL
 Animal studies are set to one side for team assessment
 The less obvious RCT records are assessed for relevance by
internet crowdsourcing
 Record screening software written by Metaxis
 Between two and six people assess whether a record is really a
report of an RCT

Performance against
original Cochrane filter
 The Cochrane 2014 filter found 71,448 records that were not
retrieved by the original Cochrane filter:
 1000 of the most recent records were obtained for assessment.
 9.1% were possibly reports of CCTs or RCTs
 If this % is extrapolated to the 71,448 unique records retrieved by the
Cochrane 2014 filter then 6500 extra reports of RCTs might be identifed
by this filter
 The original filter found 988/1000 records that were not retrieved by
the Cochrane 2014 filter: all of these records were downloaded.
 3% of these records were possibly reports of controlled clinical trials
 The records found by both filters totalled 33,360.

Cochrane 2015 filter
 The following two slides show the search
terms which are excluded from the results
of the Cochrane 2014 filter
1. Cochrane 2014 filter
2. Exclusions (2015)
3. 1 NOT 2

exclusions, 1
 (random$ adj sampl$ adj7 ("cross section$" or questionnaire$1 or
survey$ or database$1)).ti,ab. not (comparative study/ or controlled
study/ or randomi?ed controlled.ti,ab. or randomly assigned.ti,ab.)
(5813)
 Cross-sectional study/ not (randomized controlled trial/ or controlled
clinical study/ or controlled study/ or randomi?ed controlled.ti,ab. or
control group$1.ti,ab.) (100831)
 (((case adj control$) and random$) not randomi?ed controlled).ti,ab.
(10405)
 (Systematic review not (trial or study)).ti. (44089)
 (nonrandom$ not random$).ti,ab. (11950)
 "Random field$".ti,ab. (1294)
 (random cluster adj3 sampl$).ti,ab. (703)

 (review.ab. and review.pt.) not trial.ti. (480641)
 "we searched".ab. and (review.ti. or review.pt.) (13032)
 "update review".ab. (64)
 (databases adj4 searched).ab. (11423)
 (rat or rats or mouse or mice or swine or porcine or murine or sheep
or lambs or pigs or piglets or rabbit or rabbits or cat or cats or dog
or dogs or cattle or bovine or monkey or monkeys or trout or
marmoset$1).ti. and animal experiment/ (819059)
 Animal experiment/ not (human experiment/ or human/) (1669138)
 ((In vitro or invitro) not (invivo or "in vivo")).ti. (239064)
 or/1-14 (2553242)

Embase processing
January 2014-end Jan 2015 using Cochrane 2014 filter
February 2015-July 2015 using revised filter
Jan 2014 to 31
Jan 2015
Feb 2015-
July 2015
inclusive
Total retrieved 153610 78516
Records sent directly into
Central 54282 9607
Screened RCT or CCT 4324 4515
Screened Reject 94095 63900
Screened Unsure 909 494

Study identification:
precision
Jan 2014 to 31
Jan 2015
Feb 2015-July
2015 inclusive
Precision: all records 38.15% 17.99%
Precision: screened records 4.55% 7.01%
NNR all RCT/CCT records 2.621063 5.559836
NNR screened records only 21.97132 14.26224
January 2014-end Jan 2015 using Cochrane 2014 filter
February 2015-July 2015 using revised filter

Summary
 Many next steps including exploring text mining options
 We have achieved improved currency of Embase record
availability in CENTRAL
 The number of irrelevant and duplicate records will be
fewer
 Searchers will be able to identify more RCTs more
accurately than previously by a rapid search of
CENTRAL

We need help!
 Please visit our project website
 http://www.metaxis.com/embasepublic/
Feel free to join the crowd!
http://www.metaxis.com/embase/login.php

http://tinyurl.com/yhec-facebook
http://twitter.com/YHEC1
http://www.minerva-network.com/
Thank you
julie.glanville@york.ac.uk
Telephone: +44 1904 324832
Website: www.yhec.co.uk

Improving Access to RCT Reports from Embase

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Improving Access to RCT Reports from Embase

Similar a Improving Access to RCT Reports from Embase (20)

Último

Último (20)

Improving Access to RCT Reports from Embase

Notas del editor