Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
A collection of text embeddings of the arXiv corpus by title and abstract - MaRDI portal

A collection of text embeddings of the arXiv corpus by title and abstract (Q6673194)

From MaRDI portal





Dataset published at Zenodo repository.
Language Label Description Also known as
English
A collection of text embeddings of the arXiv corpus by title and abstract
Dataset published at Zenodo repository.

    Statements

    0 references
    A popular online repository of arXiv is home to numerous preprints in many scientific domains. Other than playing a role of disseminating up-to-date knowledge in pertaining domains, arXiv is an interesting complex system by itself from text analytics point of view. In this repository, we provide a collection of text embedding outputs for (almost) all papers from the arXiv corpus by their titles and abstracts in order to provide multi-faceted characteristics of scientific knowledge.
    0 references
    8 August 2023
    0 references
    0 references
    0 references
    2023-08-08
    0 references

    Identifiers

    0 references