Concept information
Preferred term
Spark NLP
Definition(s)
- Open source text processing library built on Apache Spark.
Broader concept(s)
Bibliographic citation(s)
- • Kocaman, V., & Talby, D. (2021). Spark NLP: Natural language understanding at scale. Software Impacts, 8, 100058. doi:10.1016/j.simpa.2021.100058
based on
has design country
- United States
has for input language
- Abkhazian
- Afrikaans
- Albanian
- Alemannic
- Alsatian
- Amharic
- Arabic
- Aragonese
- Armenian
- Asturian
- Asuri
- Azerbaijani
- Bantu
- Bashkir
- Basque
- Bavarian
- Belarusian
- Bemba
- Bengali
- Bihari
- Bishnupriya Manipuri
- Bislama
- Bosnian
- Bulgarian
- Burmese
- Catalan
- Cebuano
- Central Bikol
- Chamorro
- Chechen
- Chichewa
- Chinese
- Church Slavonic
- Chuukese
- Chuvash
- Coptic
- Corsican
- Croatian
- Czech
- Danish
- Dutch
- Efik
- Enawenê-Nawê
- English
- Erzya
- Esperanto
- Estonian
- Ewe
- Faroese
- Finnish
- Flemish
- Fon
- French
- Ga
- Galician
- Ganda
- Georgian
- German
- Gilbertese
- Gothic
- Greek
- Gujarati
- Gun
- Haitian Creole
- Hausa
- Hebrew
- Hiri Motu
- Hungarian
- Icelandic
- Igbo
- Indonesian
- Interlingua
- Irish
- Isoko
- Italian
- Japanese
- Javanese
- Kabyle
- Kalaallisut
- Kannada
- Kanuri
- Kaonde
- Kazakh
- Khmer
- Kinyarwanda
- Kirundi
- Konkani
- Korean
- Kurdish
- Kwangali
- Kwanyama
- Kyrgyz
- Lahnda
- Lao
- Latin
- Latvian
- Lewotobi
- Ligurian
- Limburgish
- Lingala
- Lithuanian
- Lombard
- Louisiana Creole
- Low German
- Lozi
- Luba-Kasai
- Luba-Katanga
- Lunda
- Luo
- Luvale
- Luxembourgish
- Macedonian
- Maithili
- Malagasy
- Malay
- Malayalam
- Maldivian
- Maltese
- Manx
- Marathi
- Marshallese
- Mauritian Creole
- Mazanderani
- Minangkabau
- Mingrelian
- Mirandese
- Mizo
- Mongolian
- Montenegrin
- Ndonga
- Neapolitan
- Nepali
- Nigerian Pidgin
- Niuean
- Northern Sámi
- Northern Sotho
- North Frisian
- Norwegian
- Nyaneka
- Occitan
- Odia
- Oromo
- Ossetian
- Palatine German
- Pangasinan
- Papiamento
- Paraguayan Guaraní
- Pashto
- Persian
- Pijin
- Pohnpeian
- Polish
- Portuguese
- Punjabi
- Quechuan
- Romanian
- Romansh
- Russian
- Ruund
- Sabanê
- Samoan
- Sango
- Sardinian
- Sayula Popoluca
- Scots
- Scottish Gaelic
- Serbian
- Serbo-Croatian
- Seychellois Creole
- Shona
- Sicilian
- Sindhi
- Sinhala
- Slovak
- Slovenian
- Somali
- Sotho
- Spanish
- Sranan Tongo
- Sundanese
- Swahili
- Swampy Cree
- Swazi
- Swedish
- Swiss German
- Tagalog
- Tahitian
- Tai
- Tajik
- Tamil
- Tatar
- Telugu
- Tetela
- Tetum
- Thai
- Tibetan
- Tigrinya
- Tiv
- Tok Pisin
- Tonga
- Tsonga
- Tswana
- Tumbuka
- Turkish
- Turkmen
- Tuvaluan
- Twi
- Tzotzil
- Ukrainian
- Umbundu
- Upper Sorbian
- Urdu
- Uyghur
- Uzbek
- Venda
- Venetian
- Vietnamese
- Volapük
- Wallisian
- Walloon
- Waray
- Welsh
- West Frisian
- Wolaitta
- Wolof
- Xhosa
- Yakut
- Yapese
- Yiddish
- Yoruba
- Yucatec Maya
- Zande
- Zazaki
- Zeelandic
- Zulu
has repository
has download location
implements
is encoded in
has interface
is executed in
- chunking
- coreference resolution
- dependency parsing
- keyword extraction
- language identification
- lemmatization
- machine translation
- named entity recognition
- normalization
- PoS tagging
- Question Answering
- sentence embedding
- sentiment analysis
- stemming
- text categorization
- tokenization
- word embedding
- word segmentation
has for license
In other languages
-
French
URI
http://data.loterre.fr/ark:/67375/LTK-XT2FHL42-3
{{label}}
{{#each values }} {{! loop through ConceptPropertyValue objects }}
{{#if prefLabel }}
{{/if}}
{{/each}}
{{#if notation }}{{ notation }} {{/if}}{{ prefLabel }}
{{#ifDifferentLabelLang lang }} ({{ lang }}){{/ifDifferentLabelLang}}
{{#if vocabName }}
{{ vocabName }}
{{/if}}