A transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics
Description
During the diagnostic process, clinicians leverage multimodal information, such as the chief complaint, medical images and laboratory test results. Deep-learning models for aiding diagnosis have yet to meet this requirement of leveraging multimodal
