Skip to Main Content

Course Materials - Spring 2025

ML711: Intermediate Music AI

 TB = Textbook or Required reading                     REF = Reference or supplemental reading


Type Title eBook


Call Number

Image of the cover of the book 'The Computer Music Tutorial'


C. Roads, The computer music tutorial. MIT Press, 1996.


MT56 .R6 2023
Image of the cover of the book 'Godel, Escher, Bach'

D.R. Hofstadter, Gödel, Escher, Bach: An Eternal Golden Braid, Vintage Books, 1979.

 Open Access
QA9.8 .H63 1999
Image of the cover of the book 'The Unanswered Question'

L. Bernstein, The unanswered question: six talks at Harvard. Harvard University Press, 1976.

Open Access

On oRDER   
Kingma, D. P., & Welling, M. (2013). Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114. NANA  
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.  Open Access NA  
Roberts, A., Engel, J., Raffel, C., Hawthorne, C., & Eck, D. (2018, July). A hierarchical latent vector model for learning long-term structure in music. In International conference on machine learning (pp. 4364-4373). PMLR. Open Access NA  
Huang, C. Z. A., Vaswani, A., Uszkoreit, J., Shazeer, N., Simon, I., Hawthorne, C., ... & Eck, D. (2018). Music transformer. arXiv preprint arXiv:1809.04281. Open Access NA  
Dai, S., Zhang, Z., & Xia, G. G. (2018). Music style transfer: A position paper. arXiv preprint arXiv:1803.06841. NA  
Yang, R., Wang, D., Wang, Z., Chen, T., Jiang, J., & Xia, G. (2019). Deep music analogy via latent representation disentanglement. arXiv preprint arXiv:1906.03626. NA  
Wang, Z., Zhang, Y., Zhang, Y., Jiang, J., Yang, R., Zhao, J., & Xia, G. (2020). Pianotree vae: Structured representation learning for polyphonic music. arXiv preprint arXiv:2008.07118. NA  
Wang, Z., Wang, D., Zhang, Y., & Xia, G. (2020). Learning interpretable representation for controllable polyphonic music generation. arXiv preprint arXiv:2008.07122. NA  
Wang, Z., Chen, K., Jiang, J., Zhang, Y., Xu, M., Dai, S., ... & Xia, G. (2020). Pop909: A pop-song dataset for music arrangement generation. arXiv preprint arXiv:2008.07142.
Open Access
Zhao, J., & Xia, G. (2021). Accomontage: Accompaniment arrangement via phrase selection and style transfer. arXiv preprint arXiv:2108.11213. NA  
Yi, L., Hu, H., Zhao, J., & Xia, G. (2022). Accomontage2: A complete harmonization and accompaniment arrangement system. arXiv preprint arXiv:2209.00353. NA  
Zhao, J., Xia, G., & Wang, Y. (2023). Q&A: Query-Based Representation Learning for Multi-Track Symbolic Music re-Arrangement. arXiv preprint arXiv:2306.01635. NA  
Zhao, J., Xia, G., & Wang, Y. (2023). AccoMontage-3: Full-Band Accompaniment Arrangement via Sequential Style Transfer and Multi-Track Function Prior. arXiv preprint arXiv:2310.16334. NA  
Wang, Z., Xu, D., Xia, G., & Shan, Y. (2022, May). Audio-to-symbolic arrangement via cross-modal music representation learning. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 181-185). IEEE.
Open Access
Lin, L., Kong, Q., Jiang, J., & Xia, G. (2021). A unified model for zero-shot music source separation, transcription and synthesis. arXiv preprint arXiv:2108.03456. NA