Services overview

CLARIN wants to offer a rich number of services. This overview is a start to list and describe services that should be subject of extensive discussion within CLARIN and in particular within WP2/WP5. The overview is not comprehensive and can only be seen as a start. The services are roughly categorized, although each categorization will have its limitations. The categorization will be relevant for establishing a useful taxonomy of language resources and technology. (In this document we make the separation between (data) resources and technology components (tools), although in the wider sense also the latter are resources. However, we want to follow the terminology introduced in CLARIN.)

We would like to ask all CLARIN members to contribute to this list. Just click on the Edit button below a table.

Infrastructure

Service Type Description
Resource Registry Services services that allow users to register their resources if they are accessible via Internet
Tool Registry Services services that allow users to register their tools if they are accessible via Internet
Schema Registry Services services that allow users to register their schemas if they are accessible via Internet
Concept Registry Services services that allow users to register concepts in an ISO DCR compliant form
Relation Registry Services services that allow users to register relations between DCR based concepts in an ISO compliant form
Virtual Collection Services services that allow users to build virtual collections of resources from different repositories and to store them as virtual entities, basically this will be done based on metadata descriptions
Workflow Services services that allow users to combine different web services to chains of operations with exchangeable components, basically this will be done based on metadata descriptions
Persistent ID Services services that allow to associate PIDs with all kinds of resources, to maintain them and to allow resolving them
Distributed Authentication Services services that allow users to access remote resources by making use of their home identity and by using middleware to exchange user credentials
Registry Browse/Search Services services that allow users to find appropriate resources and tools that are accessible via Internet, these can be catalogue browsing, geographic browsing and structured and unstructured search
Registry Editing Services services that allow users to create and manipulate registry entries, in particular metadata fields
Registry Schema Services services that allow users to create their own metadata schemas by making use of accepted vocabularies
Registry Gateway Services services that allow other service providers to harvest the registry content, eventually semantic mapping has to be provided for example to offer DC records
Registry Harvesting Services services that allow CLARIN portals to harvest other metadata providers, in particular in CLARIN we will have a distributed system where harvesting mechanisms will need to be used
Registry Abstraction Services services that allow users to create their own hierarchy of metadata descriptions based on
Profile Mapping Services services that allow users to ask for suitable tools given a set of resources and a function specification

Resources

Service Type Description
Resource Creation Services services that allow users to create data resources of various sort such as annotations and lexica (detailed taxonomy to come from WP5)
Resource Repository Services services that allow others to store resources in a repository that can offer a solid and manageable repository system
Resource Upload Services services that allow users to upload resources into a repository
Resource Archiving Services services that take care of long term preservation of resources in a repository system
Resource Conversion Services services that allow users to convert between different encodings (characters, audio, video, etc) and formats, in general it is hoped that we will have generic formats for various resource types and that conversion is offered to these generic formats
Resource Merging Services services that allow users to merge two resources such as two lexica according to some criteria
Resource Access Services services that allow to access and present simple and complex resources that are stored in a repository via internet
Resource Content Search Services services that allow users to carry out structured and unstructured searches in the content of the resources contained in a virtual collection, these services need to be augmented by options to use resources for semantic interoperability
Resource Commentary Services services that allow users to make commentaries to content stored in repositories
Resource Relating Services services that allow users to relate resources and fragments of resources and to store them in repositories

Tools

Service Type Description
NLP Services a wide range of NLP services (see below)
Speech Services a wide range of speech services (see below)
Multimodal Services a wide range of mm services (see below)
Compute Services services that allow users to execute compute intensive jobs at compute servers
Wrapping Services services that allow users to encapsulate tools and to create web services

NLP

Service Type Description
Named Entity Recognition Services  
Tokenization Services  
POS Tagging Services  
Lemmatizing Services  
Parsing Services Parsers for natural languages, e.g. for discourse, grammar, morphology
Translation Services  
Summarization Services  
Annotation Services  
Document Classification Services A broad range of services, from document routing to spam filtering
Query Expansion Service In order to enrich queries by adding synonims, or other keywords for broadening the search.
Query Translation Service For services that could range from crosslingual searches, to accessing historical dictionaries to get old forms for searching in ancient texts.
Statistical Services A broad range of services that would include getting number of token/type occurrences in a text, relative frequency, distribution measures, and more complex such as Mutual Information measures, tdif, etc.

Speech

Service Type Description
Text-Sound Alignment Services services that allow users to align a transliteration with a corresponding speech signal
Audio Search Services services that allow users to search with pre-defined patterns in audio streams
Speech Recognition Services services that allow users to carry out speech recognition tasks
Annotation Services services that allow users to automatically annotate a given speech signal
Speech Synthesis Services services that allow users to create a synthesized speech based on a text
Intonation Analysis Services services that allow users to extract intonation patterns for a given sound signal
Dialogue manangement Services services to create and manage spoken dialogue systems

Multimodality

Service Type Description
Alignment Services services that allow users to align a video stream with a transliteration (sign language for example)
Image Search Services services that allow users to search for pre-defined patterns in a collection of video resources
Image Recognition Services services that allow users to recognize patterns in a video stream
Video Summarization services services to summarize a video stream in a sequence of key photos
Time Series Analysis Services services that allow users to analyze time series data such as from motion trackers
Sign/Gesture Recognition Services services that allow users to recognize certain signs and gestures in a video stream
Gesture Segmentation Services services that allow users to segment a video stream into
Facial Expression Synthesis Services services that allow users to synthesize facial expressions given on a stream of parameters
Gesture Synthesis Services services that allow users to synthesize gestures based on a stream of parameters

Topic revision: r3 - 2008-03-04 - SantiagoBel
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback