ASTOUND xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark at DuckDuckGo

aclanthology.org
Only include results for this site Hide site from these results
Share feedback about this site
ACL Anthology
https://aclanthology.org › 2023.findings-emnlp.371
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark ...
Jan 12, 2025%0 Conference Proceedings %T xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark %A Zhang, Chen %A D'Haro, Luis %A Tang, Chengguang %A Shi, Ke %A Tang, Guohua %A Li, Haizhou %Y Bouamor, Houda %Y Pino, Juan %Y Bali, Kalika %S Findings of the Association for Computational Linguistics: EMNLP 2023 %D 2023 %8 December %I ...
arxiv.org
Only include results for this site Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › abs › 2310.08958
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Oct 13, 2023Recent advancements in reference-free learned metrics for open-domain dialogue evaluation have been driven by the progress in pre-trained language models and the availability of dialogue data with high-quality human annotations. However, current studies predominantly concentrate on English dialogues, and the generalization of these metrics to other languages has not been fully examined. This ...
arxiv.org
Only include results for this site Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › pdf › 2310.08958.pdf
PDF xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
to the absence of a multilingual dialogue evalu-ation benchmark. To address the issue, we in-troduce xDial-Eval, built on top of open-source English dialogue evaluation datasets. xDial-Eval includes 12 turn-level and 6 dialogue-level English datasets, comprising 14930 an-notated turns and 8691 annotated dialogues respectively.
github.com
Only include results for this site Hide site from these results
Share feedback about this site
Github
https://github.com › e0397123 › xDial-Eval
e0397123/xDial-Eval - GitHub
Oct 25, 2023We hope our data can further benefit researchers working on multilingual open-domain dialogue systems and evaluation metrics. About Repository for EMNLP-2023 Findings Paper - xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
ar5iv.labs.arxiv.org
Only include results for this site Hide site from these results
Share feedback about this site
ar5iv
https://ar5iv.labs.arxiv.org › html › 2310.08958
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
This paper introduces xDial-Eval, a multilingual dialogue evaluation benchmark featuring 14930 annotated turns and 8691 dialogues in 10 languages. Both automatic and human evaluation validate the high quality of xDial-Eval. Additionally, we examine the performance of BERT-based metrics and emerging LLMs on this benchmark.
mendeley.com
Only include results for this site Hide site from these results
Share feedback about this site
Mendeley
https://www.mendeley.com › catalogue › e3934a8d-d98f-3eb5-91f9-c05981e0bd09
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
(2023) Zhang et al. Findings of the Association for Computational Linguistics: EMNLP 2023. Recent advancements in reference-free learned metrics for open-domain dialogue evaluation have been driven by the progress in pre-trained language models and the availability of dialogue data with high-qual...
aclanthology.org
Only include results for this site Hide site from these results
Share feedback about this site
ACL Anthology
https://aclanthology.org › 2023.findings-emnlp.371.bib
@inproceedings{zhang-etal-2023-xdial, title = "x{D}ial-Eval: A ...
This is largely due to the absence of a multilingual dialogue evaluation benchmark. To address the issue, we introduce xDial-Eval, built on top of open-source English dialogue evaluation datasets. xDial-Eval includes 12 turn-level and 6 dialogue-level English datasets, comprising 14930 annotated turns and 8691 annotated dialogues respectively.
researchgate.net
Only include results for this site Hide site from these results
Share feedback about this site
ResearchGate
https://www.researchgate.net › publication › 376401301_xDial-Eval_A_Multilingual_Open-Domain_Dialogue_Evaluation_Benchmark
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark ...
Jan 1, 2023Request PDF | On Jan 1, 2023, Chen Zhang and others published xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark | Find, read and cite all the research you need on ResearchGate
gengo.sotaro.io
Only include results for this site Hide site from these results
Share feedback about this site
gengo.sotaro.io
https://gengo.sotaro.io › 2023.findings-emnlp.371
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Currently, human evaluation is the most reliable way to holistically judge the quality of the dialogue. Approach: They propose to use English dialogue evaluation metrics to generalize them to other languages. Outcome: The proposed metrics outperform OpenAI's ChatGPT in terms of average Pearson correlations over all datasets and languages.
dblp.org
Only include results for this site Hide site from these results
Share feedback about this site
dblp
https://dblp.org › rec › journals › corr › abs-2310-08958
"xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark."
Bibliographic details on xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark. Stop the war! Остановите войну! solidarity - - ... A Multilingual Open-Domain Dialogue Evaluation Benchmark. CoRR abs/2310.08958 (2023) a service of . home. blog; statistics; update feed; XML dump; RDF dump; browse. persons; conferences;
Can’t find what you’re looking for?
Help us improve DuckDuckGo searches with your feedback

Can’t find what you’re looking for?

See What’s DuckDuckNew