1) Historically, chemistry relied on a few "trusted sources" for property data like melting points, but technology now allows a more transparent and data-driven approach using multiple sources.
2) An analysis of melting point data for several compounds from various sources found inconsistencies, outliers, and errors that highlighted issues with solely relying on a single source.
3) Making assumptions and data sources open allows for transparency, validation of data, and faster/better scientific progress as evidenced by several examples of collaborative melting point analysis and predictive models.
1. The Impact of Openness on the Quality of Chemistry Data and Implications for Organic Synthesis Drexel Chemistry Mini Symposium Jean-Claude Bradley Associate Professor of Chemistry Drexel University September 29, 2011
19. What is the melting point of 4-benzyltoluene? American Petroleum Institute5 C PHYSPROP-30 C PHYSPROP 125 C peer reviewed journal (2008)97.5 C government database-30 C government database4.58 C
20. The quest to resolve the melting point of 4-benzyltoluene: liquid at room temp and can be frozen <-30C (Evan Curtin)
21. Open Lab Notebook page measuring the melting point of 4-benzyltoluene
26. There are NO FACTS, only measurements embedded within assumptions Open Notebook Science maintains the integrity of data provenance by making assumptions explicit
27. Open Random Forest modeling of Open Melting Point data using CDK descriptors (Andrew Lang) R2 = 0.78, TPSA and nHdon most important
40. Comparison of model with triple validated measurements Straight chain carboxylic acids from 1 to 10 carbons Straight chain alcohols from 1 to 10 carbons
41. Cyclic primary amines from 3 to 6 carbons (cyclobutylamine flagged for validation – only single source available)