Cheminformatics Tutorials - Herong's Tutorial Examples - v2.01, by Herong Yang
Molecule Similarity Based on Fingerprints with RDKit
This section provides a tutorial example on calculating molecule simularity based on fingerprints using RDKit library.
RDKit library provides its own fingerprint generation function that can be used to calculate similarity score between two molecules.
f = rdkit.Chem.rdmolops.RDKFingerprint(m) - Returns the fingerprint using RDKit topological algorithm from a molecule.
s = rdkit.DataStructs.FingerprintSimilarity(f1 f2) - Returns the calculated similarity score between two fingerprints.
Here is a Jupyter Notebook example of calculating molecule similarity with RDKit library.
from rdkit import Chem from rdkit import DataStructs ms = [Chem.MolFromSmiles('CCCO'), Chem.MolFromSmiles('CCOC')] fs = [Chem.RDKFingerprint(x) for x in ms] s = DataStructs.FingerprintSimilarity(fs[0], fs[1]) i = Chem.Draw.MolsToGridImage(ms, subImgSize=(120,120)) display(i, 'Similarity: '+str(s))
Table of Contents
SMILES (Simplified Molecular-Input Line-Entry System)
Open Babel: The Open Source Chemistry Toolbox
Using Open Babel Command: "obabel"
Generating SVG Pictures with Open Babel
Substructure Search with Open Babel
Similarity Search with Open Babel
Fingerprint Index for Fastsearch with Open Babel
Stereochemistry with Open Babel
Command Line Tools Provided by Open Babel
RDKit: Open-Source Cheminformatics Software
rdkit.Chem.rdchem - The Core Module
rdkit.Chem.rdmolfiles - Molecular File Module
rdkit.Chem.rdDepictor - Compute 2D Coordinates
rdkit.Chem.Draw - Handle Molecule Images
Molecule Substructure Search with RDKit
►rdkit.Chem.rdmolops - Molecule Operations
What Is rdkit.Chem.rdmolops Module
►Molecule Similarity Based on Fingerprints with RDKit
Molecule Core and Sidechains Decomposition with RDKit
R-Group Decomposition with RDKit
Daylight Fingerprint Generator in RDKit
Morgan Fingerprint Generator in RDKit
RDKit Performance on Substructure Search
Introduction to Molecular Fingerprints
OCSR (Optical Chemical Structure Recognition)
AlphaFold - Protein Structure Prediction