Changzhou University
Interactive segmentation, Vision transformer, Prior tokenization, Cross attention mechanism, Register method