PANDASPGS: A PYTHON PACKAGE FOR EASY RETRIEVAL OF POLYGENIC SCORE CATALOG DATA

pandasPGS: a Python package for easy retrieval of Polygenic Score Catalog data

pandasPGS: a Python package for easy retrieval of Polygenic Score Catalog data

Blog Article

Background The Polygenic Score (PGS) Catalog is a public database dedicated to Wrinkle Serum storing polygenic risk scores.To date, the database has included 5,022 polygenic risk scores associated with 656 different traits.Although the PGS Catalog offers an official resource representational state transfer (REST) application programming interface (API), there is no ready-made data client tailored for any specific programming language.Researchers are thus required to invest time in becoming familiar with the structure of the REST API and to implement a corresponding client in their programming language of choice to integrate PGS data into their analytical workflows.Methods In this work we introduce pandasPGS, a Python package that provides programmatic access to PGS Catalog data.

After being called by the researcher, pandasPGS will automatically select the appropriate uniform resource locator (URL) and request the data based on the name and parameters of the called function, and merge the obtained pagination data.In addition, pandasPGS also provides further data pre-processing functions.According to the structure of the obtained data, it can convert the data into several hierarchical pandas.DataFrame objects, which is convenient for further analysis by researchers.Results This tool allows researchers to easily analyze PGS Catalog data using Python.

It alleviates the time cost for researchers to learn the REST APIs of PGS PRO SLEEP Catalog.The source codes can be found in https://github.com/tianzelab/pandaspgs, and the API documentations can be found in https://tianzelab.github.io/pandaspgs/.

Report this page