PT - JOURNAL ARTICLE AU - Yeeleng S. Vang AU - Xiaohui Xie TI - HLA class I binding prediction via convolutional neural networks AID - 10.1101/099358 DP - 2017 Jan 01 TA - bioRxiv PG - 099358 4099 - http://biorxiv.org/content/early/2017/01/10/099358.short 4100 - http://biorxiv.org/content/early/2017/01/10/099358.full AB - Many biological processes are governed by protein-ligand interactions. Of such is the recognition of self and nonself cells by the immune system. This immune response process is regulated by the major histocompatibility complex (MHC) protein which is encoded by the human leukocyte antigen (HLA) complex. Understanding the binding potential between MHC and peptides is crucial to our understanding of the functioning of the immune system, which in turns will broaden our understanding of autoimmune diseases and vaccine design.We introduce a new distributed representation of amino acids, named HLA-Vec, that can be used for a variety of downstream proteomic machine learning tasks. We then propose a deep convolutional neural network architecture, named HLA-CNN, for the task of HLA class I-peptide binding prediction. Experimental results show combining the new distributed representation with our HLA-CNN architecture acheives state-of-the-art results in the vast majority of the latest two Immune Epitope Database (IEDB) weekly automated benchmark datasets. Codes are available at https://github.com/uci-cbcl/HLA-bind.