Molecule Similarity Based on Fingerprints with RDKit

This section provides a tutorial example on calculating molecule simularity based on fingerprints using RDKit library.

RDKit library provides its own fingerprint generation function that can be used to calculate similarity score between two molecules.

f = rdkit.Chem.rdmolops.RDKFingerprint(m) - Returns the fingerprint using RDKit topological algorithm from a molecule.

s = rdkit.DataStructs.FingerprintSimilarity(f1 f2) - Returns the calculated similarity score between two fingerprints.

Here is a Jupyter Notebook example of calculating molecule similarity with RDKit library.

from rdkit import Chem
from rdkit import DataStructs
ms = [Chem.MolFromSmiles('CCCO'), Chem.MolFromSmiles('CCOC')]
fs = [Chem.RDKFingerprint(x) for x in ms]
s = DataStructs.FingerprintSimilarity(fs[0], fs[1])
i = Chem.Draw.MolsToGridImage(ms, subImgSize=(120,120))
display(i, 'Similarity: '+str(s))
Molecule Similarity Based on RDKit Fingerprints
Molecule Similarity Based on RDKit Fingerprints

Table of Contents

 About This Book

 SMILES (Simplified Molecular-Input Line-Entry System)

 Open Babel: The Open Source Chemistry Toolbox

 Using Open Babel Command: "obabel"

 Generating SVG Pictures with Open Babel

 Substructure Search with Open Babel

 Similarity Search with Open Babel

 Fingerprint Index for Fastsearch with Open Babel

 Stereochemistry with Open Babel

 Command Line Tools Provided by Open Babel

 RDKit: Open-Source Cheminformatics Software

 rdkit.Chem.rdchem - The Core Module

 rdkit.Chem.rdmolfiles - Molecular File Module

 rdkit.Chem.rdDepictor - Compute 2D Coordinates

 rdkit.Chem.Draw - Handle Molecule Images

 Molecule Substructure Search with RDKit

rdkit.Chem.rdmolops - Molecule Operations

 What Is rdkit.Chem.rdmolops Module

Molecule Similarity Based on Fingerprints with RDKit

 Molecule Core and Sidechains Decomposition with RDKit

 R-Group Decomposition with RDKit

 Daylight Fingerprint Generator in RDKit

 Morgan Fingerprint Generator in RDKit

 RDKit Performance on Substructure Search

 Introduction to Molecular Fingerprints

 OCSR (Optical Chemical Structure Recognition)

 AlphaFold - Protein Structure Prediction

 Resources and Tools

 Cheminformatics Related Terminologies

 References

 Full Version in PDF/EPUB