ELSA Benchmark Datasets site:arxiv.org at DuckDuckGo

Only showing results from arxiv.org
Clear filter to show all search results
Clear Filter
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › pdf › 2406.01551
ELSA: Evaluating Localization of Social Activities in Urban Streets
propose a new benchmark dataset for Evaluating Localization of Social Activities (ELSA) in urban street images. ELSA draws on theoretical frameworks in urban sociology and design. While majority of action recognition datasets are collected in controlled settings, we use in-the-wild street-level imagery, where the size of
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › pdf › 2406.01551v2
arXiv:2406.01551v2 [cs.CV] 5 Dec 2024
et al. [10] introduced JRDB-Act, a video dataset for group-based social activities in university campus scenes. In con-trast, ELSA focuses on the more challenging task of local-izing social activities in images, where models must infer activities from a snapshot without the temporal cues avail-able in videos. 3. ELSA: A Benchmark for Evaluating ...
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › html › 2406.01551
ELSA: Evaluating Localization of Social Activities - arXiv.org
Jun 3, 2024In response to these challenges, we propose ELSA, a new benchmark dataset for Evaluating Localization of Social Activities in Urban Streets, using closed as well as open-vocabulary object detection models in recognizing and localizing human activity in urban streets from still images. We employ a multi-labeling scheme and define 34 unique ...
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › abs › 2410.24100
Benchmark Data Repositories for Better Benchmarking
Oct 31, 2024In machine learning research, it is common to evaluate algorithms via their performance on standard benchmark datasets. While a growing body of work establishes guidelines for -- and levies criticisms at -- data and benchmarking practices in machine learning, comparatively less attention has been paid to the data repositories where these datasets are stored, documented, and shared. In this ...
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › html › 2410.24100v1
Benchmark Data Repositories for Better Benchmarking - arXiv.org
Oct 31, 2024Abstract. In machine learning research, it is common to evaluate algorithms via their performance on standard benchmark datasets. While a growing body of work establishes guidelines for—and levies criticisms at—data and benchmarking practices in machine learning, comparatively less attention has been paid to the data repositories where these datasets are stored, documented, and shared.
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › abs › 2410.10687
Building a Multivariate Time Series Benchmarking Datasets Inspired by ...
Oct 14, 2024Time series analysis has become increasingly important in various domains, and developing effective models relies heavily on high-quality benchmark datasets. Inspired by the success of Natural Language Processing (NLP) benchmark datasets in advancing pre-trained models, we propose a new approach to create a comprehensive benchmark dataset for time series analysis. This paper explores the ...
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › pdf › 1806.02557.pdf
PDF Emoji-Powered Representation Learning for Cross-Lingual Sentiment ...
We evaluate the performance of ELSA on a benchmark Amazon review dataset that has been used in various cross-lingual sentiment classification studies [50, 58, 62]. The benchmark dataset covers nine tasks combined from three target languages (i.e., Japanese, French, and German) and three domains (i.e., book, DVD, and mu-sic).
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › abs › 2412.13102
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Dec 17, 20243) Dynamic. The domains and languages covered by AIR-Bench are constantly augmented to provide an increasingly comprehensive evaluation benchmark for community developers. We develop a reliable and robust data generation pipeline to automatically create diverse and high-quality evaluation datasets based on real-world corpora.
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › pdf › 2103.15296
Abstract arXiv:2103.15296v2 [cs.CV] 3 Jan 2022
•Our model, Elsa, achieves the best or comparable de-tection performance over multiple benchmark datasets, despite leveraging only a small set of labeled samples. • Elsa is robust to challenging settings, where the train-ing set contains unknown anomalies (Scenario-2 in experiments). Such property is important in real-world environments. 2.
arxiv.org
Clear filter to show all search results Hide site from these results
Share feedback about this site
arXiv.org
https://arxiv.org › pdf › 2304.14241.pdf
PDF arXiv:2304.14241v1 [cs.CL] 27 Apr 2023
datasets that are based on the Norwegian Review Corpus - NoReC (Velldal et al.,2018) - ﬁts this bill. While Section3.2describes how we build on NoReC to create an exploratory dataset for ELSA, we ﬁrst describe the different levels of existing an-notations in NoReC below. 3.1 NoReC NoReC is a multi-domain dataset of full-text pro-
Can’t find what you’re looking for?
Help us improve DuckDuckGo searches with your feedback

Can’t find what you’re looking for?

See What’s DuckDuckNew