This is an old revision of this page, as edited by Egon Willighagen (talk | contribs) at 07:26, 11 October 2020 (Updated the CDK info). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
Revision as of 07:26, 11 October 2020 by Egon Willighagen (talk | contribs) (Updated the CDK info)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)An editor has nominated this article for deletion. You are welcome to participate in the deletion discussion, which will decide whether or not to retain it.Feel free to improve the article, but do not remove this notice before the discussion is closed. For more information, see the guide to deletion. Find sources: "List of cheminformatics toolkits" – news · newspapers · books · scholar · JSTOR%5B%5BWikipedia%3AArticles+for+deletion%2FCheminformatics+toolkits%5D%5DAFD |
This article's use of external links may not follow Misplaced Pages's policies or guidelines. Please improve this article by removing excessive or inappropriate external links, and converting useful links where appropriate into footnote references. (September 2020) (Learn how and when to remove this message) |
Cheminformatics toolkits are software development kits that allow cheminformaticians to develop custom computer applications for use in virtual screening, chemical database mining, and structure-activity studies. Toolkits are often used for experimentation with new methodologies. Their most important functions deal with the manipulation of chemical structures and comparisons between structures. Programmatic access is provided to properties of individual bonds and atoms.
Functionality
Toolkits provide the following functionality:
- Read and save structures in various chemistry file formats.
- Determine if one structure is a substructure of another (substructure matching).
- Determine if two structures are equal (exact matching).
- Identification of substructures common to structures in a set (maximal common substructure, MCS).
- Disassemble molecules, splitting into fragments.
- Assemble molecules from elements or submolecules.
- Apply reactions on input reactant structures, resulting in output of reaction product structures.
- Generate molecular fingerprints. Fingerprints are bit-vectors where individual bits correspond to the presence or absence of structural features. The most important use of fingerprints is in indexing of chemistry databases.
List of notable cheminformatics toolkits
Name | License | APIs | Home Page | Notes |
---|---|---|---|---|
CDK | Open source | Java | https://cdk.github.io/ | |
ChemmineR | Open source | R, C++ | http://manuals.bioinformatics.ucr.edu/home/chemminer | |
Enalos KNIME nodes | Open source | KNIME | http://tech.knime.org/community/enalos-nodes | |
Enalos+ KNIME nodes | Proprietary | KNIME | http://enalosplus.novamechanics.com/ | |
Indigo | Open source | C, C#, Java, Python | http://lifescience.opensource.epam.com/indigo | |
MolEngine | Proprietary | .NET | http://www.scilligence.com | |
Molecular Operating Environment (MOE) | Proprietary | Scientific Vector Language | https://web.archive.org/web/20160909172415/http://www.chemcomp.com/MOE-Cheminformatics_and_QSAR.htm | |
OpenBabel | Open source | C, Python, Ruby | http://openbabel.org/ | , |
Helium | Open source | C++ | https://web.archive.org/web/20140407082845/http://www.moldb.net/helium.html | |
RDKit | Open source | Python, C++, Java, Knime | http://www.rdkit.org/ | |
Rcpi | Open source | R | https://bioconductor.org/packages/Rcpi | |
frowns | Open source | Python | http://frowns.sourceforge.net/ | |
OUCH | Open source | Haskell | http://www.pharmash.com/posts/2010-08-02-ouch.html | |
chemf | Open source | Scala | https://github.com/stefan-hoeck/chemf | |
3D-e-Chem | Open source | Python, Java, Knime | https://3d-e-chem.github.io/ | |
SMSD | Creative Commons Attribution | Java | http://www.ebi.ac.uk/thornton-srv/software/SMSD/ | |
Accord SDK | Proprietary | VBA, .NET, PL/SQL | http://accelrys.com/products/datasheets/accord-software-development-kit.pdf | |
CACTVS | Proprietary, free for academia, personal use, public web services | Tcl, C, C++, Python, Knime | http://www.xemistry.com/academic | |
Daylight | Proprietary | C, C++, Java, Fortran | http://www.daylight.com/products/toolkit.html | |
OEChem | Proprietary, free for academia | C++, Python, C#, Java | http://eyesopen.com/ | |
Marvin, JChem | Proprietary, free for academia | Java, .NET, Javascript | http://www.chemaxon.com | |
ChemDoodle API | Proprietary | Java, Javascript | http://www.ichemlabs.com | |
PerlMol | Open source | Perl | https://web.archive.org/web/20120315121757/http://www.perlmol.org/ | |
ADMET Predictor, MedChem Studio, MedChem Designer | Proprietary, free for qualifying academics | C++, KNIME, Pipeline Pilot | http://www.simulations-plus.com | |
CDD Vault | Proprietary, free for CDD Public read-only data | CDD Vault | https://www.collaborativedrug.com/cdd-vault | |
MolecularGraph.jl | MIT License | Julia | https://github.com/mojaie/MolecularGraph.jl |
References
- Jean-Loup Faulon; Andreas Bender (April 2010). Handbook of Chemoinformatics Algorithms. Chapman & Hall. ISBN 978-1420082920.
- Johann Gasteiger (November 2003). Chemoinformatics. Wiley-VCH. ISBN 3527306811.
- Steinbeck C, C.; Han Y; Kuhn S; Horlacher O; Luttmann E; Willighagen E (2003). "The Chemistry Development Kit". J Chemical Inf. Comput. Sci. 43 (2): 493–500. doi:10.1021/ci025584y. PMC 4901983. PMID 12653513.
- Steinbeck C, Christoph; Hoppe C.; Kuhn S.; Floris M.; Guha R.; Willighagen E.L. (2006). "Recent Developments of the Chemistry Development Kit (CDK) - An Open-Source Java Library for Chemo- and Bioinformatics". Curr. Pharm. Des. 12 (17): 2111–2120. doi:10.2174/138161206777585274. hdl:2066/35445. PMID 16796559.
- Willighagen, Egon L.; Mayfield, John W.; Alvarsson, Jonathan; Berg, Arvid; Carlsson, Lars; Jeliazkova, Nina; Kuhn, Stefan; Pluskal, Tomáš; Rojas-Chertó, Miquel; Spjuth, Ola; Torrance, Gilleain; Evelo, Chris T.; Guha, Rajarshi; Steinbeck, Christoph (December 2017). "The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching". Journal of Cheminformatics. 9 (1): 33. doi:10.1186/s13321-017-0220-4.
{{cite journal}}
: CS1 maint: unflagged free DOI (link) - Cao, Y; Charisi, A; Cheng, LC; Jiang, T; Girke, T (2008). "ChemmineR: A Compound Mining Framework for R." Bioinformatics. 24 (15): 1733–1734. doi:10.1093/bioinformatics/btn307. PMC 2638865. PMID 18596077.
- Wang, Y; Backman, TW; Horan, K; Girke, T (2013). "fmcsR: Mismatch Tolerant Maximum Common Substructure Searching in R." Bioinformatics. 29 (21): 2792–4. doi:10.1093/bioinformatics/btt475. PMID 23962615.
- Requires Knime (http://www.knime.org/)
- Requires KNIME (http://www.knime.org/)
- reads and writes all chemical file formats.
- O’Boyle N; Banck M; James C; Morley C; Vandermeersch T; Hutchison G (2011). "Open babel: an open chemical". Journal of Cheminformatics. 3 (33): 33. doi:10.1186/1758-2946-3-33. PMC 3198950. PMID 21982300.
{{cite journal}}
: CS1 maint: unflagged free DOI (link) - Cao DS, Xiao N, Xu QS, Chen AF (Jan 2015). "Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions". Bioinformatics. 31 (2): 279–281. doi:10.1093/bioinformatics/btu624. PMID 25246429.
- McGuire R, Verhoeven S, Vass M, Vriend G, de Esch IJ, Lusher SJ, Leurs R, Ridder L, Kooistra AJ, Ritschel T, de Graaf C (2017). "3D-e-Chem-VM: Structural cheminformatics research infrastructure in a freely available Virtual Machine". J. Chem. Inf. Model. 57 (2): 115–121. doi:10.1021/acs.jcim.6b00686. PMC 5342320. PMID 28125221.
- Kooistra AJ, Vass M, McGuire R, Leurs R, de Esch IJ, Vriend G, Verhoeven S, de Graaf C (2018). "3D-e-Chem: Structural Cheminformatics Workflows for Computer-Aided Drug Discovery". ChemMedChem. 13 (6): 614–626. doi:10.1002/cmdc.201700754. PMC 5900740. PMID 29337438.
- S. Asad Rahman, Syed; M. Bashton; G. L. Holliday; R. Schrader; J. M. Thornton (2009). "Small Molecule Subgraph Detector (SMSD) Toolkit". Journal of Cheminformatics. 1 (12): 12. doi:10.1186/1758-2946-1-12. PMC 2820491. PMID 20298518.
{{cite journal}}
: CS1 maint: unflagged free DOI (link) - Novel web-based tools combining chemistry informatics, biology and social networks for drug discovery. Hohman M, Gregory K, Chibale K, Smith PJ, Ekins S, Bunin B Drug Discov Today. 2009 Mar;14(5-6):261-70.