Se ha denunciado esta presentación.
Se está descargando tu SlideShare. ×

Score-based Approach for Anaphora Resolution in Drug-Drug Interaction Documents

Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio

Eche un vistazo a continuación

1 de 52 Anuncio

Más Contenido Relacionado

Similares a Score-based Approach for Anaphora Resolution in Drug-Drug Interaction Documents (20)

Más de Grupo HULAT (20)

Anuncio

Score-based Approach for Anaphora Resolution in Drug-Drug Interaction Documents

  1. 1. Score-based Approach for Anaphora Resolution in Drug-Drug Interaction Documents Isabel Segura Bedmar, Mario Crespo, César de Pablo Sánchez CS Department, Universidad Carlos III de Madrid 24th June 2009 Saarbrücken, Germany NLDB 2009
  2. 2. What is a Drug-Drug Interaction?
  3. 3.  Medication errors kill 7,000 patients per annum in USA.  16% of them are DDI  High incidence in certain patient groups Things can get complicated...
  4. 4. How do healthcare professionals avoid drug-drug interactions?
  5. 5. Drug interaction Resources
  6. 6. How Information Extraction helps? Aspirin may decrease the effects of probenecid, sulfinpyrazone, and phenylbutazone. DDI ( ASPIRIN , PROBENECID) DDI ( ASPIRIN , SULFINPYRAZONE ) DDI ( ASPIRIN , PHENYLBUTAZONE)
  7. 7. How Anaphora Resolution helps? Triamterene, metformin and amiloride should be co-administered with care as they might increase dofetilide levels.
  8. 8. Triamterene, metformin and amiloride should be co-administered with care as they might increase dofetilide levels. DDI ( DOFETILIDE , TRIAMTERENE) DDI ( DOFETILIDE , METFORMIN ) DDI ( DOFETILIDE , AMILORIDE)
  9. 9. Levofloxacin, a fluoroquinolone, is one of the most commonly prescribed antibiotics in clinical practice. Several case reports have indicated that this drug may signicantly potentiate the anticoagulation effect of warfarin.
  10. 10. Levofloxacin, a fluoroquinolone, is one of the most commonly prescribed antibiotics in clinical practice. Several case reports have indicated that this drug may signicantly potentiate the anticoagulation effect of warfarin. DDI ( LEVOFLOXACIN , WARFARIN )
  11. 11. Precision Recall
  12. 12. Build a corpus to study anaphora resolution in DDI A score-based method to resolve anaphora using semantic info
  13. 13. Building a corpus for DDI DrugBank HTML To Text Wrapper Corpus TXT Drug Name Recognition MetaMap UMLS: Text analysis String Matching Algorithm 2006AA UMLSKS WHOINN affixes DDI Extraction XML annotated with drugs and other biomedical concepts XML annotated with drugs interactions
  14. 14. DrugBank HTML To Text Wrapper Corpus TXT
  15. 15. DrugBank HTML To Text Wrapper Corpus TXT
  16. 16. Building a corpus for DDI DrugBank HTML To Text Wrapper Corpus TXT Drug Name Recognition MetaMap UMLS: Text analysis String Matching Algorithm 2006AA UMLSKS WHOINN affixes DDI Extraction XML annotated with drugs and other biomedical concepts XML annotated with drugs interactions
  17. 17. ...and enriching it for AR Drug Drug Interaction Annotattion XML annotated with drugs and other biomedical concepts XML annotated with drugs interactions Anaphora annotation XML annotated with resolved anaphoric expression 49 articles selected ● 40 sentences on average ● 716 words on average 18.035 phrases with 689 different drugs 331 anaphoric expresions
  18. 18. Personal (it) Reflexives (itself) Relatives (which) Distributives (both) Demonstratives (these) Indefinites (some) 0 20 40 60 80 100 120 140 23 1 120 8 12 8 Distribution of Pronominal Anaphora
  19. 19. Definite Possessives Distributives Demonstratives Indefinites 0 10 20 30 40 50 60 70 37 52 11 58 8 Distribution of Nominal Anaphora
  20. 20. Build a corpus to study anaphora resolution in DDI A score-based method to resolve anaphora using semantic info
  21. 21. Our approach
  22. 22. Our approach Identification of anaphoric expressions
  23. 23. Our approach Identification of anaphoric expressions Selection of candidate antecedents
  24. 24. Our approach Identification of anaphoric expressions Selection of candidate antecedents Scoring
  25. 25. Anaphoric expressions
  26. 26. Anaphoric expressions Pronominal Identify pronouns Exclude first and second personal forms Detect pleonastic it
  27. 27. Anaphoric expressions Pronominal Identify pronouns Exclude first and second personal forms Detect pleonastic it If it is not possible to discontinue the diuretic, the starting dose of trandolapril should be reduced.
  28. 28. Anaphoric expressions Nominal Select PP, NP and UNK Restrict to: ● pharmacological substance (phsu) ● Antibiotic (antb) ● clinical drugs (clnd)
  29. 29. Anaphoric expressions Nominal Select PP, NP and UNK Restrict to: ● pharmacological substance (phsu) ● Antibiotic (antb) ● clinical drugs (clnd) Levofloxacin, a fluoroquinolone, is one of the most commonly prescribed antibiotics in clinical practice. Several case reports have indicated that this drug may significantly potentiate the anticoagulation effect of warfarin.
  30. 30. Anaphoric expressions Nominal Select PP, NP and UNK Restrict to: ● pharmacological substance (phsu) ● Antibiotic (antb) ● clinical drugs (clnd) Possessive + drug properties
  31. 31. Anaphoric expressions Nominal Select PP, NP and UNK Restrict to: ● pharmacological substance (phsu) ● Antibiotic (antb) ● clinical drugs (clnd) Possessive + drug properties Although beta-adrenergic blockers or calcium channel blockers and digoxin may be useful in combination to control atrial brillation,their additive effects on AV node conduction can result in advanced or complete heart block.
  32. 32. Anaphoric expressions Pronominal Identify pronouns Exclude first and second personal forms Detect pleonastic it Nominal Select PP, NP and UNK Restrict to: ● pharmacological substance (phsu) ● Antibiotic (antb) ● clinical drugs (clnd) Possesives + drug properties
  33. 33. Candidate antecedents
  34. 34. Candidate antecedents Pronominal Number agreement (lexical number and coodinative structures)
  35. 35. Candidate antecedents Pronominal Number agreement (lexical number and coodinative structures) As fluoxetine, sertraline and paroxetine inhibit P450 2D6, they may vary in the extent of inhibition.
  36. 36. Candidate antecedents Nominal Number agreement (lexical number and coodinative structures) Semantic type agreement (phsu, antb, cnld)
  37. 37. Candidate antecedents Nominal Number agreement (lexical number and coodinative structures) Semantic type agreement (phsu, antb, cnld) Loracarbef is a carbacephem antibiotic. This antibiotic kills sensitive bacteria by interfering with formation of the bacteria's cell wall while it is growing.
  38. 38. Candidate antecedents Pronominal Number agreement (lexical number and coodinative structures) Nominal Number agreement (lexical number and coodinative structures) Semantic type agreement (phsu, antb, cnld)
  39. 39. Score
  40. 40. Score Pronominal
  41. 41. Score Nominal
  42. 42. Candidate antecedents Pronominal Nominal
  43. 43. Results
  44. 44. Baseline Several studies have focused on Dapsone showing it can interact with many drugs Levofloxacin, a fluoroquinolone, is one of the most commonly prescribed antibiotics in clinical practice. Several case reports have indicated that this drug may signicantly potentiate the anticoagulation effect of warfarin.
  45. 45. Personal Reflexives Relatives Distributives Demonstratives Indefinites 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 26% 100% 82% 18% 0% 16% 52% 100% 96% 80% 14% 61% Baseline Score Pronominal Anaphora F-score
  46. 46. Definite Possessives Distributives Demonstratives Indefinites 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0% 47% 23% 2% 0% 47% 67% 57% 34% 34% Baseline Score Nominal Anaphora F-score
  47. 47. Pronominal Nominal Total 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 66% 18% 44% 85% 50% 69% Baseline Score Global results F-score
  48. 48. Conclusions ● Domain specific resources are needed – Parsing with MMTx – Semantic information ● Future work – Centering and syntactic heuristics – More semantics: WHOINN rules – Evaluation in Medline corpora
  49. 49. Does Anaphora Resolution really helps to detect DDI?
  50. 50. Drug Drug Interaction detection is a promising application for IE and NLP
  51. 51. Castano et al., 2002 Liang and Lin, 2005 Kim et al, 2005 46 abstracts 120 sentences (PP Interactions) Medstract + 100 abstracts Gasperin & Brisco, 2008 5 full articles Biomedical anaphora approaches Scoring approach Centering TheoryScoring approach Probabilistic Bayes Model F = 0.74 F=0.64 pronominal, F=0.59 nominal F=0.92 pronominal, F= 0.78 nominal 28-56% precision, 35-54% recall

Notas del editor

  • sa:
    A drug-drug interaction occurs when one drug influences the
    level or activity of another drug.
    Drug-interactions can be beneficial, in fact, the polytherapy is a common practice of the medicine, in order, to achieve effective treatments.
    (for example, the combination of several antiretrovirals in order to achieve a more pontente antiretroviral for treating of VIH).
    However,...
    -------
    A drug - driug interaction occurs when one drug influences the level of activity of another drug
    Hay otros tipos de interacciones, pero en este trabajo nos centraremos en las drug interactions.
    Drug-food/beverage interactions. For example, mixing alcohol with some drugs may cause you to feel tired or slow your reactions.
    Drug-condition interactions may occur when an existing medical condition makes certain drugs potentially harmful. For example, if you have high blood pressure you could experience an unwanted reaction if you take a nasal decongestant.
    Son un problema real en la seguridad del paciente.
    Explicar por qué las interacciones no se detectan en las pruebas clínicas:
    -Se suelen excluir de las pruebas clínicas: ancianos, niños, embarazadas.
    -Tamaño pequeño de la muestra de personas y duración limitada (no se ven los consecuencias en un mayor tiempo).
    -No son capaces de predecir la situación real: sociedad que toma varios tipos de fármacos de forma concurrentes.
  • Isa:
    However, sometimes are very dangerous and range in severity, including prolonged morbidity and even death.
    Therefore, they have an important impact on patient safety due
    to the fact that they can be quite dangerous, and because
    of their relatively high incidence among certain population
    groups, such as geriatric, hepatic or polydrug patients.
    Several studies have shown that drug interactions accounted for 16.6% of adverse
    drug reactions causing hospitalization, and being thus a direct
    cause of the increase of health care costs.
    I would like to resalt that the Medication errors are among the most common medical errors, and are the cause of the 8% of the deaths in USA.
    ------------------------------------
    The interactions can range in severity, including prolonged morbidity and even death. The
    estimated incidence of drug-drug interactions that have a clinical signicance ranges from 3% to
    20%, depending on how many drugs are taken [Nies, 2001]. The frequency of drug interaction
    increases disproportionally with the increase in the number of drugs in combination. For example,
    only 5% of patients with fewer than six drugs manifested clinical signs of drug interaction;
    while 40% of patients given 16 drugs experienced an adverse drug interaction [Naguib et al.,
    1997].
    In addition, the drug interactions can greatly increase healthcare costs. A study revealed an
  • How to keep up to date?
    Drug interactions are frequently reported in journals of pharmacology
    medical literature is the most effective source
    - assistance reading
    - automatic completion
  • The development of automatic methods for collecting, maintaining and interpreting information on drug-drug interactions is crucial to achieve a real improvement in their early detection. Information Extraction can provide an interesting way to reduce the time spent by health care professionals on reviewing the literature. Nevertheless, only a few approaches have tackled this issue
    This is a very simple example of drug interactions occurring in a text from DrugBank.
  • The first sentence show the three probable interactions with the drug dofetilide.
    In the second example, the drug interaction spans between the two sentences. In the first sentence, the drug 'Levofloxacin” is defined, while, in the second sentence, is described its interaction with warfarin.
  • The first sentence show the three probable interactions with the drug dofetilide.
    In the second example, the drug interaction spans between the two sentences. In the first sentence, the drug 'Levofloxacin” is defined, while, in the second sentence, is described its interaction with warfarin.
  • Todos se centran en el dominio biológico (genes y proteínas).
    El propósito de Kim también es comprobar si la resolución de anaforas mejora la extracción de información en el campo de las interacciones entre proteínas.
    Los primeros tres enfoques son linguisticos y el último machine learning
    El tamaño de los corpus es muy pequeño.
    Revisión de los principales corpus en el dominio biomédico:
    GENIA
    Anotación lingüística y semántica de proteínas e interacciones
    93293
    2000 resúmenes MedLine codificados en XM
    Uno de los más utilizados
    1414

×