EXPloiting Empirical appRoaches to Translation

***Interested in applying for an EXPERT Early Stage Researcher position as advertised on Euraxess and by individual institutions?***

The application process consists of two compulsory stages:

Stage 1: Register and fill in the application form on the EXPERT website. You will receive a reference number and confirmation.

  1. Register for an account and fill in the basic details in the application form.
  2. Your account will be created and you will receive an e-mail containing information about how to log in and set your password.
  3. Follow those instructions and log in.
  4. Take a note of your Reference Number on the page 'Application Form (Stage1)' (use the link at the top right corner of the page)
  5. You can modify/update this form in the future (Edit and Save).

Stage 2: Depending on the position(s) to which you intend to apply, go to the website of the relevant host institution to obtain an electronic application. You will either need to email the application or submit it electronically, according to the rules of the host institution. In either case, please attach or upload a copy of your confirmation from

NB: Only applicants completing BOTH of the above stages will be taken into consideration.

Access for Stage 2 of the application is via individual institutional websites. Please proceed as follows:
Project details:

EXPERT (EXPloiting Empirical appRoaches to Translation) aims to train young researchers, namely Early Stage Researchers (ESRs) and Experienced Researchers (ERs), to promote the research, development and use of hybrid language translation technologies.

Human and automatic translation are an important part of the policy of multilingualism within Europe and EXPERT brings the two together through the development of next generation technologies to address the needs of both translators and EC policy. EXPERT fits within the EC's 2020 strategic framework to promote (i) a digital agenda for Europe, which proposes to better exploit the potential of ICTs in order to foster innovation, economic growth and progress: EXPERT will improve translation practices and enhance the p roductivity of relevant actors in the translation market by developing new ICT in the field of translation; and (ii) an agenda for new skills and jobs: EXPERT will help modernize the translation labour market by promoting new job profiles such as human translators and post-editors who will have the skills to make use of the latest ICT and translation technologies, along with automated translation researchers and developers. EXPERT will contribute to the general notion that ICT needs to be language-aware and promote content creation in multiple languages. By training young researchers to become future leaders in this area in Europe, as well as producing training material to be used by a number of other professionals and users, EXPERT will contribute to a strong and effective European leadership in the area.

Translation Memory (TM) and Machine Translation (MT) are the two most common technologies used to support human language translation. TMs are interactive systems which aim to help humans during the translation process, by offering suggestions based on matches with previous translations of similar texts for all or parts of the input text (called segments), leaving the unmatched parts for the human translator. MT systems, on the other hand, aim to fully translate the input texts. Most recent research in the MT field has focused on corpus-based (or empirical) approaches, particularly two variations based on examples of translations to automatically build translation systems: Example-Based (EBMT) and Statistical Machine Translation (SMT). These approaches are cheaper and faster to develop, as compared to rule-based MT, which requires specifying linguistic rules, a costly process usually done manually by experts in both languages.

TM technology is mainly used by professional translators who are experts in both the language pair and text domain, to translate repetitive documents. MT technology is mostly aimed at the general public with little knowledge of the source or target language, translating general domain and genre texts, mostly interested only in getting the gist of the text or a draft translation.

Recently, a number of developments in EBMT and SMT have shown the potential of corpus-based MT approaches for producing fast and low cost translations, significantly reducing human effort, time and costs. However, according to Allied Business Intelligence, only 1% of the world's translation demand is covered by MT, while the remainder is covered by human translators. The main reason for this figure is that MT tools are not designed to aid professional translators. Some of the shortcomings of MT technologies are user-unfriendly interfaces, lack of awareness of translator's feedback, etc. Therefore, their great potential has not yet been appropriately exploited. TM systems also have a number of well-known limitations, mainly their poor performance for texts which have not been translated before. The view of diverse and non-overlapping target users of these two types of translation approaches has resulted in little research towards exploiting the integration of these technologies to provide better solutions. In EXPERT we advocate that there is no clear boundary between supposedly fully automatic translation (i.e. MT) and semi-automatic translation (i.e. TM). We consider instead that both variations are tools to help humans (professional translators and end-users) to produce high quality, reliable, fast and cheap translations. EXPERT will accommodate the requirements of different types of users, by prioritising its research according to evidence about the needs and problems encountered in real-life conditions by users of translation technologies, including both professional translators and readers of translations. The purpose of EXPERT is twofold:

  1. To improve existing corpus-based TM and MT technologies by addressing their well-known shortcomings via the use of more sophisticated levels of linguistic processing, terminology and domain knowledge, along with better consideration of user requirements and feedback, in order to improve translation quality and user satisfaction, and allow quick development for new language pairs, including resource-poor languages.
  2. To create hybrid technologies which incorporate the main features of corpus-based approaches, minimizing human translators' effort and tailoring the system according to their needs, by allowing different levels of "assistance" to be provided in a user-friendly workflow.
Wolverhampton team
Publications by the Wolverhampton team