get_uniprot_mapping_df

get_uniprot_mapping_df(version=None, *, prefix=None)[source]

Download and parse the latest ChEMBL-UniProt target mapping TSV file.

Parameters:
  • version (Optional[str]) – The version number of ChEMBL to get. If none specified, uses latest() to look up the latest.

  • prefix (Optional[Sequence[str]]) – The directory inside pystow to use

Return type:

DataFrame

Returns:

A dataframe with four columns:

  1. uniprot_id

  2. chembl_target_id

  3. name, the name from ChEMBL

  4. type, which can have one of the following values:

    • CHIMERIC PROTEIN

    • NUCLEIC-ACID

    • PROTEIN COMPLEX

    • PROTEIN COMPLEX GROUP

    • PROTEIN FAMILY

    • PROTEIN NUCLEIC-ACID COMPLEX

    • PROTEIN-PROTEIN INTERACTION

    • SELECTIVITY GROUP

    • SINGLE PROTEIN