LLM-Blender | Dongfu Jiang (姜东甫)

LLM-Blender is an ensembling framework that combines outputs from multiple open-source LLMs to achieve consistently superior performance. It consists of two components: PairRanker (pairwise comparison to select the best candidate) and GenFuser (generative merging of top candidates).

Key contributions:

PairRanker: cross-attention encoder for fine-grained pairwise output comparison
GenFuser: seq2seq model that fuses top-ranked candidates into a single output
MixInstruct benchmark for large-scale pairwise evaluation
State-of-the-art performance across diverse instruction-following tasks

Links: GitHub · Paper