Create the solutions your customers are dreaming of
Taken separately, each module is an efficient tool for processing language, documents or names. Combine them and you benefit from powerful multilingual capabilities for data mining or semantic search solutions.
Whatever your end-customers need (eDiscovery and digital forensics, OSINT or COMINT analysis, competitive and marketing intelligence, e-reputation monitoring, sentiment analysis for more customer insight), SYSTRAN’s LDK enables you to utilize and analyze both structured and unstructured multilingual content, such as user-generated content, social media, Web content and more.
Now you’re fully equipped to create powerful data-mining or intelligence solutions for your innovation-loving customers!
Imports various document formats for processing by other modules, and rebuilds of the document in the original format with modified or annotated content and preserved layout.
Automatically identifies which language documents are written in, through specific word- or sentence-sample detection.
Segmentation and Tokenization
Segments text into sentences and sentences into “tokens” (minimal processing units).
Normalizes text from blogs, emails, chat forums and user-generated content, while correcting common errors and language deviations.
Determines the document “domain” based on predefined models, and displays key “hot topic” words.
Named Entity Recognition
Based on the analysis of the document contents, automatically recognizes and displays person names, locations, numbers, dates and organization names.
Offers monolingual and bilingual dictionary lookups, with additional contextual information such as frequency of meanings, domains and contexts, expressions and examples.
Provides morphological analysis for individual words, returning the list of possible lemmas and parts of speech for an inflected form.
Provides syntactic analysis at the sentence level, with the layers of linguistic analysis: word identification, part-of-speech tagging, and constituent and dependency analysis.
Makes transcription and transliteration of words or entities between languages with different scripts, and detects proper noun origins.
WSD (Word Sense Disambiguation)
Selects best meaning of a word depending on the context.
Supported Platforms are: Windows, Linux, Mac OS, iOS, Android.