Cheminformatics Tutorials - Herong's Tutorial Examples - v2.03, by Herong Yang
RDKit Performance on Substructure Search
This chapter provides introductions and tutorial examples on substructure search performance with RDKit. Topics include introduction of large molecule dataset; substructure search on one molecule at a time taking a long time; substructure search on with a molecule object library taking a large memory; substructure search by reading molecule SMILES library performing the best.
These sections are omitted from this Web preview version. To view the full content, see information on how to obtain the full version this book.
Large Molecule Datasets for Performance Testing
CPU Info and Usage for Performance Testing
Molecule Construction Performance with RDKit
Substructure Search Performance with RDKit
Substructure Library Search Performance with RDKit
Binary Library Search Performance with RDKit
SMILES Library Search Performance with RDKit
Trusted SMILES Library Search Performance with RDKit
Library File for Substructure Search with RDKit
Takeaways:
Table of Contents
SMILES (Simplified Molecular-Input Line-Entry System)
Open Babel: The Open Source Chemistry Toolbox
Using Open Babel Command: "obabel"
Generating SVG Pictures with Open Babel
Substructure Search with Open Babel
Similarity Search with Open Babel
Fingerprint Index for Fastsearch with Open Babel
Stereochemistry with Open Babel
Command Line Tools Provided by Open Babel
RDKit: Open-Source Cheminformatics Software
rdkit.Chem.rdchem - The Core Module
rdkit.Chem.rdmolfiles - Molecular File Module
rdkit.Chem.rdDepictor - Compute 2D Coordinates
rdkit.Chem.Draw - Handle Molecule Images
Molecule Substructure Search with RDKit
rdkit.Chem.rdmolops - Molecule Operations
Daylight Fingerprint Generator in RDKit
Morgan Fingerprint Generator in RDKit
►RDKit Performance on Substructure Search
Introduction to Molecular Fingerprints
OCSR (Optical Chemical Structure Recognition)
AlphaFold - Protein Structure Prediction