https://arxiv.org/abs/2601.04768 LANGSAE EDITING: Improving Multilingual Information Retrieval via Post-hoc Language Identity RemovalDense retrieval in multilingual settings often searches over mixed-language collections, yet multilingual embeddings encode language identity alongside semantics. This language signal can inflate similarity for same-language pairs and crowd out relevant evarxiv.org..