get_uniprot_mapping_df

get_uniprot_mapping_df(version: VersionHint | None = None, *, prefix: Sequence[str] | None = None) pandas.DataFrame[source]

Download and parse the latest ChEMBL-UniProt target mapping TSV file.

Parameters:
  • version – The version number of ChEMBL to get. If none specified, uses latest() to look up the latest.

  • prefix – The directory inside pystow to use

Returns:

A dataframe with four columns:

  1. uniprot_id

  2. chembl_target_id

  3. name, the name from ChEMBL

  4. type, which can have one of the following values:

    • CHIMERIC PROTEIN

    • NUCLEIC-ACID

    • PROTEIN COMPLEX

    • PROTEIN COMPLEX GROUP

    • PROTEIN FAMILY

    • PROTEIN NUCLEIC-ACID COMPLEX

    • PROTEIN-PROTEIN INTERACTION

    • SELECTIVITY GROUP

    • SINGLE PROTEIN