Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
MediaText: a media industry-based dataset for scene text detetcion - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

MediaText: a media industry-based dataset for scene text detetcion

From MaRDI portal



DOI10.5281/zenodo.12796380Zenodo12796380MaRDI QIDQ6699604

Dataset published at Zenodo repository.

Author name not available (Why is that?)

Publication date: 22 July 2024

Copyright license: No records found.



Media-Text Media-Text dataset comprising images of banners, posters, covers and another images characterised for media industry. Full paper is available here: Media-Text: a Media Industry-Based Dataset for Scene Text Detection DATASET DESCRIPTION 400 images 7 744 annotated text instances 973 annotations have been marked as illegible for the task of text recognition 659 texts have been markes as do not care (###) for scene text detection. Images are represented by 193 unique resolutions. Annotation Format - Each image has corresponding gt_*.txt file, which contains annotations in bounding box format (defined by 4 courners), transcription, and bool flag which determines that text is illegible for OCR. Proposed format is similar to ICDAR15 annotations. x1, x2, ..., x4, y4, transcription, OCR Flag Example:37,68,198,49,214,181,52,200,LADIES,False ACKNOWLEDGMENT This work was supported by the Silesian University of Technology (SUT) through the subsidy for maintaining and developing research potential grant in 2024 for young researchers, No. 2/070/BKM24/0058, and by the Ministry of Science and Higher Education "Implementation Doctorate" No. DWD/5/0511/2021. Thanks to the graphic department of media-press group for the preparation and possibility of sharing graphics thematically related to the prepared dataset. LICENSE Annotations created by authors are licesned under CC-BY-4.0 license.Images from the Open-Image-V7 dataset and are licensed according to their source information. Source information is defined in a file metadata.csv file that defines all the metadata of each file (File name corresponds to the ImageID column). Images whose name corresponds to the media_press pattern are provided for academic use. CITING THE RELATED WORKS Please cite the related works in your publications if it helps your research: ``` @inproceedings{inproceedings, author = {Kalisz, Seweryn and Marczyk, Michał and Polanska, Joanna}, booktitle = {Modelling and simulation 2024. The 2024 European Simulation and Modelling Conference} editor = {Manuel Graa; J. David Nuez-Gonzalez} year = {2024}, month = {10}, pages = {138-144}, publisher = {EUROSIS-ETI}, title = {Media-Text: a Media Industry-Based Dataset for Scene Text Detection} } ```






This page was built for dataset: MediaText: a media industry-based dataset for scene text detetcion