PubChem linkage & properties

This page summarizes linked PubChem records (via pc_id) for compounds in our database, and shows distributions for common physicochemical descriptors.
Updated live from PostgreSQL
Linked records
130772
from 143920 PubChem rows
Linkage rate
%
compounds with a PubChem link
Distinct PubChem IDs used
145049
unique pc_id referenced

Linkage to PubChem

Share of compounds with a valid PubChem link.

Field completeness in linked PubChem rows

Percent of linked records with a non-empty value.

XlogP distribution

Histogram of PubChem XlogP values (linked records).

Exact mass distribution

Histogram of exact mass (linked records).

tPSA distribution

Histogram of topological polar surface area.

Complexity distribution

Histogram of PubChem Complexity scores.

Exact mass vs XlogP (density bubbles)

2D binned density (bubble size ∝ √count).

Complexity vs tPSA (latest linked sample)

A recent slice of linked records (ordered by fetch time).

XlogP spread by covalent unit count

Candlestick-style summary (P25–P75 box, whiskers to P10/P90).

Top molecular formulas

Most frequent formulas among linked PubChem records.
Formula Count