
Use and interpret data, models, and theories with integrity, completeness, and accuracy, and make use of the latest technological innovations ethically, responsibly, and fairly.
Overview
In the chemical sciences, data, standards, nomenclature, terminology, and symbols should be original, authentic, accessible, and accurate; models and theories should not distort findings; and the source of data and information should be easy to trace by including clear and accurate metadata (i.e. a description of what the data is, where it comes from, and how it is organised). The challenges raised by the growth in the number of large datasets and the need for automated handling of them make data accuracy and integrity increasingly important.
Integrity and accuracy are critical to all aspects of the chemical sciences, from conceiving an idea, planning the study, collecting and analysing data, and reporting and disseminating the findings, to archiving the data and making it available and useful to others. Documenting these workflows and keeping track of where the results come from are essential for ensuring the integrity and accuracy of data, especially when it is shared or reused.
Examples
Guiding Future Action
To ensure the integrity and accuracy of scientific research in chemistry and chemistry-adjacent areas, all chemists—regardless of their level of experience—should consider doing the following:
- develop a thorough knowledge of current nomenclature and terminology conventions in chemistry, along with the ways that chemical compounds and their data are represented and transmitted
- help establish digital data standards in chemistry in conjunction with the greater community
- commit to testing and demonstrating the reproducibility of their chemical data and its interpretation
- help address the challenges of automated processing and analysis of large data sets
- contribute to the development of standards and methods for the validation of chemical data
- learn how chemical compounds and their data can be made more FAIR
- understand how the CARE and TRUST Principles could be applied to their research

Research data management plans should be living documents that are revisited throughout the lifecycle of a research project. These plans will be dependent on the nature and types of data repositories that are available or chosen, and their security characteristics. The CoreTrustSeal (About – CoreTrustSeal), a certification that a repository meets international standards for being trustworthy, may be useful to researchers and organisations when considering these issues.
In chemistry, the accurate and complete documentation of measurements and results is critical for determining whether research findings can be used for further analysis and modelling. Users of chemistry information need to determine if the variance observed in reported results is of potential scientific interest or an artefact of measurement uncertainty. Multiple sources of random or systematic errors can impact measurement results. The extent to which sources of error can be identified and quantified can provide a level of confidence that is required for many further applications. Formal and robust methods for critical evaluation of reported data are developed and applied by many expert organisations, including IUPAC in conjunction with BIPM (Bureau International des Poids et Mesures—International Bureau of Weights and Measures) and others, to provide the global community with high-quality property values for practical use.5
Questions to Guide Discussion
- How and when is the detail of formal chemical nomenclature best acquired? What are the new challenges that arise during digital manipulation and transmission of chemical structure and identity?
- How do the conventions cope with automated manipulations of large data sets containing chemical structures and other data (and their units)? How might your representations be misunderstood?
- Which standards are currently missing and how are they connected to existing ones?
- What does reproducibility actually mean? How should it be reported? Best results, average, or typical results?
- How do computers and software programs “know” how to interpret data? How much do we rely on context when interpreting a set of data? What does context mean for a computer?
- How can you tell if a data set is faulty or corrupted in some way? Will a computer be able to do that? What are the implications for artificial intelligence of having such data sets?
- How could you envisage your data being used by other chemists? What about nonchemists, scientists from other fields, or even nonscientists?
- How can we determine the need for sensitive data to remain closed? Who would make that distinction? How does data restriction factor into transparency?
- How will the CARE and TRUST Principles impact you personally? Do they apply similarly in all countries/regions?
References
- Wilkinson, M. D.; Dumontier, M.; Aalbersberg, I. J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.-W.; Bonino da Silva Santos, L.; Bourne, P. E.; Bouwman, J.; Brookes, A. J.; Clark, T.; Crosas, M.; Dillo, I.; Dumon, O.; Edmunds, S.; Evelo, C. T.; Finkers, R.; Gonzalez-Beltran, A.; Gray, A. J. G.; Groth, P.; Goble, C.; Grethe, J. S.; Heringa, J.; ’t Hoen, P. A. C.; Hooft, R.; Kuhn, T.; Kok, R.; Kok, J.; Lusher, S. J.; Martone, M. E.; Mons, A.; Packer, A. L.; Persson, B.; Rocca-Serra, P.; Roos, M.; van Schaik, R.; Sansone, S.-A.; Schultes, E.; Sengstag, T.; Slater, T.; Strawn, G.; Swertz, M. A.; Thompson, M.; van der Lei, J.; van Mulligen, E.; Velterop, J.; Waagmeester, A.; Wittenburg, P.; Wolstencroft, K.; Zhao, J.; Mons, B. The FAIR Guiding Principles for Scientific Data Management and Stewardship. Sci. Data 2016, 3, 160018. https://doi.org/10.1038/sdata.2016.18.
- Lin, D.; Crabtree, J.; Dillo, I.; Downs, R. R.; Edmunds, R.; Giaretta, D.; De Giusti, M.; L’Hours, H.; Hugo, W.; Jenkyns, R.; Khodiyar, V.; Martone, M. E.; Mokrane, M.; Navale, V.; Petters, J.; Sierman, B.; Sokolova, D. V.; Stockhause, M.; Westbrook, J. The TRUST Principles for Digital Repositories. Sci. Data 2020, 7 (1), 144. https://doi.org/10.1038/s41597-020-0486-7.
- Carroll, S. R.; Garba, I.; Figueroa-Rodríguez, O. L.; Holbrook, J.; Lovett, R.; Materechera, S.; Parsons, M.; Raseroka, K.; Rodriguez-Lonebear, D.; Rowe, R.; Sara, R.; Walker, J. D.; Anderson, J.; Hudson, M. The CARE Principles for Indigenous Data Governance. Data Sci. J. 2020, 19 (1). https://doi.org/10.5334/dsj-2020-043.
- Cohen, E. R.; Cvitas, T.; Frey, J. G.; Holmstrom, B.; Kuchitsu, K.; Marquardt, R.; Mills, I.; Pavese, F.; Quack, M.; Stohner, J.; Strauss, H.; Takami, M.; Thor, A. J. Quantities, Units, and Symbols in Physical Chemistry, 3rd ed.; IUPAC Green Book; RSC Publishing: Cambridge, UK, 2007. ISBN 978-0-85404-433-7. https://doi.org/10.1039/9781847558039.
- Connelly, N. G.; Damhus, T.; Hartshorn, R. M.; Hutton, A. T. Nomenclature of Inorganic Chemistry: IUPAC Recommendations 2005; IUPAC Red Book; RSC Publishing: Cambridge, UK, 2005. ISBN 0-85404-438-8.
- Favre, H. A.; Powell, W. H. Nomenclature of Organic Chemistry: IUPAC Recommendations and Preferred Names 2013; IUPAC Blue Book; RSC Publishing: Cambridge, UK, 2014. ISBN 978-0-85404-182-4. https://doi.org/10.1039/9781849733069.
- Jones, R. G.; Wilks, E. S.; Metanomski, W. V.; Kahovec, J.; Hess, M.; Stepto, R.; Kitayama, T. Compendium of Polymer Terminology and Nomenclature: IUPAC Recommendations 2008, 2nd ed.; IUPAC Purple Book; RSC Publishing: Cambridge, UK, 2009. ISBN 978-0-85404-491-7. https://doi.org/10.1039/9781847559425.
- Hibbert, D. B. Compendium of Terminology in Analytical Chemistry; IUPAC Orange Book; The Royal Society of Chemistry: Cambridge, UK, 2023. ISBN 978-1-78262-947-4. https://doi.org/10.1039/9781788012881.
- Férard, G.; Dybkaer, R.; Fuentes-Arderiu, X. Compendium of Terminology and Nomenclature of Properties in Clinical Laboratory Sciences: Recommendations 2016, 2nd ed.; IFCC/IUPAC Silver Book; RSC Publishing: Cambridge, UK, 2017. https://doi.org/10.1039/9781782622451.
- McNaught, A. D.; Wilkinson, A. IUPAC Compendium of Chemical Terminology: Gold Book, 2nd ed.; IUPAC: Research Triangle Park, NC, 1997. https://doi.org/10.1351/goldbook.
- Shaw, D. G.; Bruno, I. J.; Chalk, S. J.; Hefter, G. T.; Hibbert, D. B.; Hutchinson, R. A.; Magalhães, M. C. F.; Magee, J. W.; McEwen, L. R.; Rumble, J. R.; Russell, G. T.; Waghorne, W. E.; Walczyk, T.; Wallington, T. J. Chemical Data Evaluation: General Considerations and Approaches for IUPAC Projects and the Chemistry Community (IUPAC Technical Report). Pure Appl. Chem. 2023, 95 (10), 1107–1120. https://doi.org/10.1515/pac-2022-0802

