SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
Chemoinformatics in action:
     some question for audience

 Yuriy Sushko, Sergii Novotarskyi
Practical example
Story:
A company that produces or intends
   to produce some particular
   compound (drug, make up, paint,
   glue, toilet refresher, whatever..) is
   obliged to test, if this compound is
   toxic for human and how toxic it is.
   What are the options to check
   this?



                                            Teuthrin, Cyclopropanecarboxylic acid
Practical example

       Bioassay                      Computer modeling

                                    In silico: using QSAR (QSPR) based
                                    on machine learning to predict
In vivo and in vitro assays with
                                    properties of interest without direct
mice, dogs, rats or other species
                                    experiment.
Option 1: Bioassay
Classical and currently widely used method
  for measuring toxicity is bioassay with
  mice, rats, dogs or other species.

What are advantages and disadvantages
Option 1: Bioassay
For bioassay we would typically need:
• Dozens of mice for checking several concentrations of
  tested compound
• In some assays we need to wait for next generation
• We may need to test against several organisms (rat,
  mouse) and dierent administration routes (oral, skin, IV
  injection)
• Test can take upto several months
• Test would cost upto dozens of thousands dollars
     What if we need to measure toxicity for 100 000 compounds?
Option 2: Modeling
What are the steps required to build
 predictive model for physicochemical or
 biological property?

• Prepare dataset of experimental data
• Choose and calculate molecular
  descriptors
• Apply machine learning method
Molecular descriptors
What is descriptor? Most simple examples?

Descriptor is some numerical property of chemical
  compound.

•   Simplest constitutional descriptors: MW, NA, nDB, ..
•   Molecular properties: LogP, hydrophilic factor, ..
•   Randic molecular profiles
•   Various topological and 3D indices and profiles
Molecular descriptors
         2.54
         4.25
         -5.71
         3.26
         0.57
         -0.07



         1.45
         6.34
         8.28
         2.78
         -5.67
         -2.33



         1.45
         7.34
         8.35
         1.64
         -5.56
         -4.45
Machine learning
What kind of machine learning methods do
  you know?
• Linear regression
• K nearest neighbors (KNN)
• Partial Least Regression
• Neural networks
• Support Vector Machines
Some additional facts
Popular formats for representing molecules
  in databases
• SDF
• SMILES
• INCHI
SDF — a plain text file
benzene
ACD/Labs0812062058                                                                     header
 6   6 0 0 0 0 0       0     0 0     1 V2000
    1.9050  -0.7932        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
    1.9050  -2.1232        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
    0.7531  -0.1282        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
    0.7531  -2.7882        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0   atom information
   -0.3987  -0.7932        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
   -0.3987  -2.1232        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
  2 1 1 0 0 0 0
  3 1 2 0 0 0 0
  4 2 2 0 0 0 0
  5 3 1 0 0 0 0                                                                        bond information
  6 4 1 0 0 0 0
  6 5 2 0 0 0 0
 M END
 $$$$
> <Unique_ID>
XCA3464366

> <ClogP>
5.825
                                                                                       tags
> <Vendor>
Sigma

> <Molecular Weight>
499.611
SMILES — a string representation

                     C1=CC=C(C=C1)Br



                     CC(F)F




                     COC(C(Cl)Cl)(F)
                     F
InChI — one more approach
 InChI (international chemical identifier) — a standart, developed by IUPAC
    for a textual identifier of chemical substances


              InChI: InChI=1S/C6H5Br/c7-6-4-2-1-3-5-6/h1-5H
              InChIKey: QARVLSVVCXYDNA-UHFFFAOYSA


             InChI: InChI=1S/C2H4F2/c1-2(3)4/h2H,1H3
             InChIKey: NPNPZTNLOVBDOC-UHFFFAOYSA


              InChI: InChI=1S/C3H4Cl2F2O/c1-8-3(6,7)2(4)5/h2H,1H3
              InChIKey: RFKMCNOHBTXSMU-UHFFFAOYSA

Más contenido relacionado

Destacado

2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar
mdanavarra
 

Destacado (14)

Evaluation question 1 Nikitha
Evaluation question 1 Nikitha Evaluation question 1 Nikitha
Evaluation question 1 Nikitha
 
Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"
Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"
Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"
 
INOBACION TECNICA Y DESARROLLO SOSTENIBLE
INOBACION TECNICA Y DESARROLLO SOSTENIBLE INOBACION TECNICA Y DESARROLLO SOSTENIBLE
INOBACION TECNICA Y DESARROLLO SOSTENIBLE
 
Exj 5
Exj 5Exj 5
Exj 5
 
Smith middle school Guthrie
Smith middle school GuthrieSmith middle school Guthrie
Smith middle school Guthrie
 
Selenium
SeleniumSelenium
Selenium
 
Social media strategies in the workplace
Social media strategies in the workplaceSocial media strategies in the workplace
Social media strategies in the workplace
 
2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar
 
The Internet Is Dying
The Internet Is DyingThe Internet Is Dying
The Internet Is Dying
 
SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報
SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報
SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報
 
Samsung galaxy s® iii
Samsung galaxy s® iiiSamsung galaxy s® iii
Samsung galaxy s® iii
 
Quiz prelims
Quiz prelimsQuiz prelims
Quiz prelims
 
Brochure Salvo website
Brochure Salvo websiteBrochure Salvo website
Brochure Salvo website
 
here is the Shift
here is the Shifthere is the Shift
here is the Shift
 

Similar a Chemoinformatics in Action

Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & R
Rajarshi Guha
 
Chemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet EraChemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet Era
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Similar a Chemoinformatics in Action (20)

ACS Meeting New Orleans 2013 (CINF)
ACS Meeting New Orleans 2013 (CINF)ACS Meeting New Orleans 2013 (CINF)
ACS Meeting New Orleans 2013 (CINF)
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
A Global Commons for Scientific Data: Molecules and Wikidata
A Global Commons for Scientific Data: Molecules and WikidataA Global Commons for Scientific Data: Molecules and Wikidata
A Global Commons for Scientific Data: Molecules and Wikidata
 
Data integration and building a profile for yourself as an online scientist
Data integration and building a profile for yourself as an online scientistData integration and building a profile for yourself as an online scientist
Data integration and building a profile for yourself as an online scientist
 
The anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithmsThe anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithms
 
Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...
Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...
Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...
 
|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and Blockchain|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and Blockchain
 
EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...
EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...
EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...
 
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
 
Nirs
NirsNirs
Nirs
 
151 performance of a localized fiber optic
151 performance of a localized fiber optic151 performance of a localized fiber optic
151 performance of a localized fiber optic
 
151 performance of a localized fiber optic
151 performance of a localized fiber optic151 performance of a localized fiber optic
151 performance of a localized fiber optic
 
Substructure Search Face-off
Substructure Search Face-offSubstructure Search Face-off
Substructure Search Face-off
 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & R
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
Chemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet EraChemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet Era
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
 
20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...
20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...
20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...
 

Más de SSA KPI

Germany presentation
Germany presentationGermany presentation
Germany presentation
SSA KPI
 
Grand challenges in energy
Grand challenges in energyGrand challenges in energy
Grand challenges in energy
SSA KPI
 
Engineering role in sustainability
Engineering role in sustainabilityEngineering role in sustainability
Engineering role in sustainability
SSA KPI
 
Consensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable developmentConsensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable development
SSA KPI
 
Competences in sustainability in engineering education
Competences in sustainability in engineering educationCompetences in sustainability in engineering education
Competences in sustainability in engineering education
SSA KPI
 
Introducatio SD for enginers
Introducatio SD for enginersIntroducatio SD for enginers
Introducatio SD for enginers
SSA KPI
 

Más de SSA KPI (20)

Germany presentation
Germany presentationGermany presentation
Germany presentation
 
Grand challenges in energy
Grand challenges in energyGrand challenges in energy
Grand challenges in energy
 
Engineering role in sustainability
Engineering role in sustainabilityEngineering role in sustainability
Engineering role in sustainability
 
Consensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable developmentConsensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable development
 
Competences in sustainability in engineering education
Competences in sustainability in engineering educationCompetences in sustainability in engineering education
Competences in sustainability in engineering education
 
Introducatio SD for enginers
Introducatio SD for enginersIntroducatio SD for enginers
Introducatio SD for enginers
 
DAAD-10.11.2011
DAAD-10.11.2011DAAD-10.11.2011
DAAD-10.11.2011
 
Talking with money
Talking with moneyTalking with money
Talking with money
 
'Green' startup investment
'Green' startup investment'Green' startup investment
'Green' startup investment
 
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea wavesFrom Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
 
Dynamics of dice games
Dynamics of dice gamesDynamics of dice games
Dynamics of dice games
 
Energy Security Costs
Energy Security CostsEnergy Security Costs
Energy Security Costs
 
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environmentsNaturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
 
Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5
 
Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4
 
Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3
 
Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2
 
Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1
 
Fluorescent proteins in current biology
Fluorescent proteins in current biologyFluorescent proteins in current biology
Fluorescent proteins in current biology
 
Neurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functionsNeurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functions
 

Último

Último (20)

Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 

Chemoinformatics in Action

  • 1. Chemoinformatics in action: some question for audience Yuriy Sushko, Sergii Novotarskyi
  • 2. Practical example Story: A company that produces or intends to produce some particular compound (drug, make up, paint, glue, toilet refresher, whatever..) is obliged to test, if this compound is toxic for human and how toxic it is. What are the options to check this? Teuthrin, Cyclopropanecarboxylic acid
  • 3. Practical example Bioassay Computer modeling In silico: using QSAR (QSPR) based on machine learning to predict In vivo and in vitro assays with properties of interest without direct mice, dogs, rats or other species experiment.
  • 4. Option 1: Bioassay Classical and currently widely used method for measuring toxicity is bioassay with mice, rats, dogs or other species. What are advantages and disadvantages
  • 5. Option 1: Bioassay For bioassay we would typically need: • Dozens of mice for checking several concentrations of tested compound • In some assays we need to wait for next generation • We may need to test against several organisms (rat, mouse) and dierent administration routes (oral, skin, IV injection) • Test can take upto several months • Test would cost upto dozens of thousands dollars What if we need to measure toxicity for 100 000 compounds?
  • 6. Option 2: Modeling What are the steps required to build predictive model for physicochemical or biological property? • Prepare dataset of experimental data • Choose and calculate molecular descriptors • Apply machine learning method
  • 7. Molecular descriptors What is descriptor? Most simple examples? Descriptor is some numerical property of chemical compound. • Simplest constitutional descriptors: MW, NA, nDB, .. • Molecular properties: LogP, hydrophilic factor, .. • Randic molecular profiles • Various topological and 3D indices and profiles
  • 8. Molecular descriptors 2.54 4.25 -5.71 3.26 0.57 -0.07 1.45 6.34 8.28 2.78 -5.67 -2.33 1.45 7.34 8.35 1.64 -5.56 -4.45
  • 9. Machine learning What kind of machine learning methods do you know? • Linear regression • K nearest neighbors (KNN) • Partial Least Regression • Neural networks • Support Vector Machines
  • 10. Some additional facts Popular formats for representing molecules in databases • SDF • SMILES • INCHI
  • 11. SDF — a plain text file benzene ACD/Labs0812062058 header 6 6 0 0 0 0 0 0 0 0 1 V2000 1.9050 -0.7932 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 1.9050 -2.1232 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 0.7531 -0.1282 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 0.7531 -2.7882 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 atom information -0.3987 -0.7932 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.3987 -2.1232 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 2 1 1 0 0 0 0 3 1 2 0 0 0 0 4 2 2 0 0 0 0 5 3 1 0 0 0 0 bond information 6 4 1 0 0 0 0 6 5 2 0 0 0 0 M END $$$$ > <Unique_ID> XCA3464366 > <ClogP> 5.825 tags > <Vendor> Sigma > <Molecular Weight> 499.611
  • 12. SMILES — a string representation C1=CC=C(C=C1)Br CC(F)F COC(C(Cl)Cl)(F) F
  • 13. InChI — one more approach InChI (international chemical identifier) — a standart, developed by IUPAC for a textual identifier of chemical substances InChI: InChI=1S/C6H5Br/c7-6-4-2-1-3-5-6/h1-5H InChIKey: QARVLSVVCXYDNA-UHFFFAOYSA InChI: InChI=1S/C2H4F2/c1-2(3)4/h2H,1H3 InChIKey: NPNPZTNLOVBDOC-UHFFFAOYSA InChI: InChI=1S/C3H4Cl2F2O/c1-8-3(6,7)2(4)5/h2H,1H3 InChIKey: RFKMCNOHBTXSMU-UHFFFAOYSA