Publications
- Franzosi, R., Dong, W., Hu, Z., Dai, W., Cha, M., Piloto, R., & Wang, G. (2024). Automatic information extraction of the narrative elements who, what, when, and where [Manuscript submitted for publication]. Social Science Computer Review.
- Yang, R., Tong, J., Wang, H., Huang, H., Hu, Z., Li, P., ... & Hong, C. (2025). Enabling inclusive systematic reviews: incorporating preprint articles with large language model-driven evaluations. Journal of the American Medical Informatics Association, ocaf137.
- LeRoy, N. J., Campbell Jr, D. R., Stadick, S., Khoroshevskyi, O., Park, S. H., Hu, Z., & Sheffield, N. C. (2025). Fast, memory-efficient genomic interval tokenizers for modern machine learning. arXiv preprint arXiv:2511.01555.
Presentations
- Huang, H., Tong, J., Hu, Z., Li, Y., Pencina, M., Chen, Y., & Hong, C. Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles based on Semantic Learning and Large Language Model. Poster presentation at the ENAR Spring Meeting, Baltimore, MD, USA. March 2024.
- Xue, B., Khoroshevskyi, O., Stolarczyk, M., Mosquera, J. V., Campbell, D., Hu, Z., Tambe, S., LeRoy, N., Gharavi, E., Duzlevski, O., & Sheffield, N. C. BEDbase: A web application and API for genomic region sets. Poster presentation at the Biological Data Science Conference, Cold Spring Harbor, NY, USA. November 2024.