Submitted by _Arsenie_Boca_ t3_10vwm8k in MachineLearning
I am looking for papers that inject information into LMs directly using embeddings (without formatting information as text). I find it notoriously hard to search for these paper because they could come from various different domains, so I thought asking here might be a good option to reach people from many different domains.
Some examples I already found are from the domain of knowledge graph augmented LMs: ERNIE https://arxiv.org/abs/1904.09223 K-BERT https://arxiv.org/abs/1909.07606
Prefix Tuning / Prompt Tuning are also somewhat similar to the idea, but they dont depend on any external information.
Can you think of other papers that inject additional information into LMs via embeddings?
wittfm t1_j7jvhoc wrote
Maybe this can help https://www.youtube.com/live/FKsARHV3ZTI they mention the SeFit method which seems similar to what you are looking for.