RAGBench Benchmark for evaluating RAG performance.This benchmark uses the rungalileo/ragbench dataset to evaluate
retrieval-augmented generation (RAG) systems. It measures context
relevancy and faithfulness metrics as described in
https://arxiv.org/abs/2407.11005.Parameters:
processes (int, optional): Number of processes for parallel processing.
subset (str, optional): Dataset subset to use (e.g., “hotpotqa”).
split (str, optional): Dataset split to use (e.g., “test”).