RDKit Performance on Substructure Search

This chapter provides introductions and tutorial examples on substructure search performance with RDKit. Topics include introduction of large molecule dataset; substructure search on one molecule at a time taking a long time; substructure search on with a molecule object library taking a large memory; substructure search by reading molecule SMILES library performing the best.


These sections are omitted from this Web preview version. To view the full content, see information on how to obtain the full version this book.

Large Molecule Datasets for Performance Testing

CPU Info and Usage for Performance Testing

Molecule Construction Performance with RDKit

Substructure Search Performance with RDKit

Substructure Library Search Performance with RDKit

Binary Library Search Performance with RDKit

SMILES Library Search Performance with RDKit

Trusted SMILES Library Search Performance with RDKit

Library File for Substructure Search with RDKit


Takeaways:

Table of Contents

 About This Book

 SMILES (Simplified Molecular-Input Line-Entry System)

 Open Babel: The Open Source Chemistry Toolbox

 Using Open Babel Command: "obabel"

 Generating SVG Pictures with Open Babel

 Substructure Search with Open Babel

 Similarity Search with Open Babel

 Fingerprint Index for Fastsearch with Open Babel

 Stereochemistry with Open Babel

 Command Line Tools Provided by Open Babel

 RDKit: Open-Source Cheminformatics Software

 rdkit.Chem.rdchem - The Core Module

 rdkit.Chem.rdmolfiles - Molecular File Module

 rdkit.Chem.rdDepictor - Compute 2D Coordinates

 rdkit.Chem.Draw - Handle Molecule Images

 Molecule Substructure Search with RDKit

 rdkit.Chem.rdmolops - Molecule Operations

 Daylight Fingerprint Generator in RDKit

 Morgan Fingerprint Generator in RDKit

RDKit Performance on Substructure Search

 Introduction to Molecular Fingerprints

 OCSR (Optical Chemical Structure Recognition)

 AlphaFold - Protein Structure Prediction

 Resources and Tools

 Cheminformatics Related Terminologies

 References

 Full Version in PDF/EPUB