24/03/2022

ONE research grant for candidates with Master Degree with reference number BI| 2022/272 is now available under the scope of project MAIA

This job offer has expired


  • ORGANISATION/COMPANY
    INESC ID
  • RESEARCH FIELD
    Engineering
  • RESEARCHER PROFILE
    First Stage Researcher (R1)
  • APPLICATION DEADLINE
    13/05/2022 23:00 - Europe/Athens
  • LOCATION
    Portugal › Lisboa
  • TYPE OF CONTRACT
    Other
  • JOB STATUS
    Other
  • REFERENCE NUMBER
    MAIA - PT2020/45909/20 - BI| 2022/272

OFFER DESCRIPTION

Public notice for research grant

MAIA - PT2020/45909/20

INESC-ID - Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa is a R&D institute dedicated to advanced research and development in the fields of Information Technologies, Electronics, Communications, and Energy. INESC-ID has participated in more than 50 research projects funded by the European Union and more than 190 funded by national entities. Until today, our researchers have published more than 700 papers in international journal papers, more than 3000 papers in international conferences, and have registered 15 patents and/or brands.

1 | RESEARCH GRANT TYPE

ONE research grant for candidates with Master Degree with reference number BI| 2022/272 is now available under the scope of project MAIA – MULTILINGUAL AI AGENTS FOR CUSTOMER SERVICE – PROJETO PARCERIA INTERNACIONAL CMU REF. 045909, FUNDED BY THE APPLICABLEFINANCIAL FRAMEWORK, funded by FEDER, PROGRAMA OPERACIONAL REGIONAL DE LISBOA , AGÊNCIA NACIONAL DE

INOVAÇÃO and CMU, under the follow conditions:

2 | DURATION

9 months, starting in June 2022

- Renewable, if the candidate is enrolled in a PhD program - art. 6º, n.4 c)

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

subject to suitable performance within the period of the project, not exceeding the maximum period set by FCT for such grants – 4 years (included contract renewals)

- Renewable, if the candidate is enrolled in a non-degree programme – art. 6º, n. 4 a)

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

subject to suitable performance within the period of the project, not exceeding the maximum period set by FCT for such grants – 1 year (included contract renewals)

3 | LEGISLATION

A fellowship contract will be celebrated according to:

  1. Law 40/2004 of 18th of August (Scientific Research Fellow Status) and its successive amendments, including the amendments introduced by the Decree Law n. 123/2019 of 28 th of August

    https://dre.pt/web/guest/legislacao-consolidada/-/lc/124281176/201912061112/73740605/diploma/indice?lcq=estatuto+do+bolseiro,

     

  2. Regulations for Research Grants of the Foundation for Science and Technology in force (https://www.fct.pt/apoios/bolsas/docs/RegulamentoBolsasFCT2019.pdf )

     

  3. INESC-ID Lisboa Grant Regulations

https://www.inesc-id.pt/scholarship-regulations/

The fellowship contract is awarded on an exclusive dedication basis – art. 5 of Scientific Research Fellow Status and art. 16 of Regulations for Research Grants of the Foundation for Science and Technology.

4 | MONTHLY AMOUNT

The monthly amount of the grant is 1144.64€ in accordance with the values stipulated in the “Regulations for Research Grants of the Foundation for Science and Technology” in force https://www.fct.pt/apoios/bolsas/docs/Tabela_de_Valores_SMM_2022.pdf ) and shall be rendered through a monthly bank transfer to an account held by the grantee.

5 | OBJECTIVES/WORKPLAN

Chit-chat dialogue systems are currently trained in an end-to-end fashion with large collections of text corpora, resulting in pre-trained models that can be fine-tuned to various dialogue tasks. However, and as stated by several authors, one of the bottlenecks of these systems is that they do not display a consistent profile/persona/personality, a characteristic that is essential in task-oriented dialogues, such as in customer support settings, where, for instance, the (in)formality of the conversation/bot should be constant.

We believe that to improve the current state-of-the-art of chit-chat dialogue systems, more specific annotated datasets are needed. Nevertheless, there is a lack of such datasets. Exceptions are, for instance, the Persona-Chat dataset and subtitles datasets such as the Cornell Movie-Dialogs Corpus and the Friends dataset, that include basic profile information about the speakers. Nevertheless, all these corpora only exist for English, and are limited in the number of profiles/personas involved. Although not usually abundantly annotated, movie/series scripts contain information that could help improve chit-chat models: in movie scripts, each character line identifies the speaker, however the persona of the speaker is usually known. On the other hand, most subtitles datasets do not have this information, but exist in large quantities, for many languages, and are publicly available.

In this work, the candidate will explore how to extract and transfer profile/persona traits from movie/series scripts/subtitles datasets with the purpose of improving chit-chat dialogue systems. We will take advantage of deep learning models, but we will also resort to rule-base systems if needed. Moreover, we will take advantage of recent studies using latent action representations (VAEs, GANs, etc.) to capture persona features and speaker’s characteristics, and thus transfer this learned knowledge to other dialogue tasks.

6 | SCIENTIFIC SUPERVISION

The activity will be supervised by Maria Luísa Torres Ribeiro Marques da Silva Coheur, researcher at INESC-ID and Associate Professor at Instituto Superior Técnico.

INESC-ID will integrate the grantee in the research team of the scientific advisor.

7 | ADMISSION REQUIREMENTS

The candidates should have an MSc in Computer Engineering or related areas.

By the grant start date, the candidate must be enrolled in

  1. a PhD programme – art. 6º, n.1

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

or

  1. a non-degree programme – art. 6º, n. 2

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

Preferential factors:

preference will be given to candidates:

  • with proficiency in English.
  • with previous experience in NLP research, especially if related with the workplan.

8 | EVALUATION CRITERIA AND COMMITTEE

The selection will be according to the following criteria:

  1. CV – 50%
  2. Previous work in NLP research – 50%

The jury may also decide not to assign the scholarship, if none of candidates meets the required conditions.

Jury

Name

Professional Status

Institutions

President

Helena Gorete Silva Moniz

Researcher / Assistant Professor

INESC-ID | FCULisboa

Member

Maria Luísa Torres R. Marques da Silva Coheur

Researcher / Associate Professor

INESC-ID | Tecnico Ulisboa

Member

João Paulo Baptista de Carvalho

Researcher / Associate Professor

INESC-ID | Tecnico Ulisboa

Substitute member

Isabel Maria Martins Trancoso

Researcher / Full Professor

INESC-ID | Tecnico Ulisboa

Substitute member

Bruno Emanuel da Graça Martins

Researcher / Associate Professor

INESC-ID | Tecnico Ulisboa

9 | COMPLAIN AND APPEAL DEADLINES AND PROCEDURES

The jury has the faculty not to select a candidate who does not prove the requirements mentioned in required education Level and research experience

The admitted and excluded candidates will be notified by email of the final ranking list, including the copy of the Preliminary Report of the jury.

Prior Hearing and Deadline for Final Decision: After being notified, candidates have 10 working days to submit, if applicable, a formal rebuttal.

After that period, the jury notifies the candidates of the Final Report.

Excluded applicants may complain about the jury's final report for 15 working days after notification or appeal the jury's decision to the INESC ID Board of Directors for 30 working days after notification.

 

According to the Portuguese Law, a disabled candidate has a preference when in equal classification, which prevails over any other legal preference. Candidates must declare their respective degree of disability, the type of disability and the means of communication / expression to be used in the selection process, under the law.

10 | FORMALISATION OF APPLICATIONS

Applications are formalised by sending an email to rh@inesc-id.pt with the documents stated bellow and in pdf form.

The application email should clearly state the reference of the research grant and project.

10.1

Single copy of official academic degree certificate in the required education level

a) In the application submission, the candidates from Portuguese education institutions may replace the copy of the official academic degree certificate by a declaration of honour stating that they have the required academic degree.

 

It is mandatory for the approval of the fellowship contract that the selected candidate presents a single copy of the official academic degree certificate, required in education level

 

b) In the application submission, the candidates from foreigner education institutions may replace the copy of the official academic degree certificate by a declaration of honour stating that they have the required academic degree.

  • It is mandatory for the approval of the fellowship contract that the selected candidate presents a single copy of the official diploma recognition, required in education level

 

  • For more information about diploma recognition:

 

 

https://www.dges.gov.pt/en/pagina/degree-and-diploma-recognition

10.2

Detailed list of grades (pdf form);

 

10.3

Proof of enrolment required on 7 a) or 7 b) (pdf form);

In the application submission, the candidates may replace the proof of enrolment by a declaration of honour stating that they are/will be enrolled required in 7 a) or 7 b)

It is mandatory for the approval of the fellowship contract that the selected candidate presents an official copy of the enrolment, required in 7 a) or 7 b)

 

 

10.4

Detailed curriculum vitae (pdf form);

 

10.5

Motivation letter explaining the interest in the position (pdf form);

 

10.6

Name of two personal references (pdf form).

 
 
 
 
 
 
 

11 | Application Dates

From

To

25-03-2022

 

13-05-2022

More Information

Benefits

The monthly amount of the grant is 1144.64€ in accordance with the values stipulated in the “Regulations for Research Grants of the Foundation for Science and Technology” in force https://www.fct.pt/apoios/bolsas/docs/Tabela_de_Valores_SMM_2022.pdf ) and shall be rendered through a monthly bank transfer to an account held by the grantee.

Eligibility criteria

The candidates should have an MSc in Computer Engineering or related areas.

By the grant start date, the candidate must be enrolled in

  1. a PhD programme – art. 6º, n.1

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

or

  1. a non-degree programme – art. 6º, n. 2

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

Preferential factors:

preference will be given to candidates:

  • with proficiency in English.
  • with previous experience in NLP research, especially if related with the workplan.

Selection process

The selection will be according to the following criteria:

  1. CV – 50%
  2. Previous work in NLP research – 50%

The jury may also decide not to assign the scholarship, if none of candidates meets the required conditions.

Additional comments

a) In the application submission, the candidates from Portuguese education institutions may replace the copy of the official academic degree certificate by a declaration of honour stating that they have the required academic degree.

It is mandatory for the approval of the fellowship contract that the selected candidate presents a single copy of the official academic degree certificate, required in education level

 

b) In the application submission, the candidates from foreigner education institutions may replace the copy of the official academic degree certificate by a declaration of honour stating that they have the required academic degree.

- It is mandatory for the approval of the fellowship contract that the selected candidate presents a single copy of the official diploma recognition, required in education level

- For more information about diploma recognition:

https://www.dges.gov.pt/en/pagina/degree-and-diploma-recognition

Offer Requirements

  • REQUIRED EDUCATION LEVEL
    Engineering: Master Degree or equivalent
  • REQUIRED LANGUAGES
    ENGLISH: Good

Skills/Qualifications

Chit-chat dialogue systems are currently trained in an end-to-end fashion with large collections of text corpora, resulting in pre-trained models that can be fine-tuned to various dialogue tasks. However, and as stated by several authors, one of the bottlenecks of these systems is that they do not display a consistent profile/persona/personality, a characteristic that is essential in task-oriented dialogues, such as in customer support settings, where, for instance, the (in)formality of the conversation/bot should be constant.

We believe that to improve the current state-of-the-art of chit-chat dialogue systems, more specific annotated datasets are needed. Nevertheless, there is a lack of such datasets. Exceptions are, for instance, the Persona-Chat dataset and subtitles datasets such as the Cornell Movie-Dialogs Corpus and the Friends dataset, that include basic profile information about the speakers. Nevertheless, all these corpora only exist for English, and are limited in the number of profiles/personas involved. Although not usually abundantly annotated, movie/series scripts contain information that could help improve chit-chat models: in movie scripts, each character line identifies the speaker, however the persona of the speaker is usually known. On the other hand, most subtitles datasets do not have this information, but exist in large quantities, for many languages, and are publicly available.

In this work, the candidate will explore how to extract and transfer profile/persona traits from movie/series scripts/subtitles datasets with the purpose of improving chit-chat dialogue systems. We will take advantage of deep learning models, but we will also resort to rule-base systems if needed. Moreover, we will take advantage of recent studies using latent action representations (VAEs, GANs, etc.) to capture persona features and speaker’s characteristics, and thus transfer this learned knowledge to other dialogue tasks.

Specific Requirements

The candidates should have an MSc in Computer Engineering or related areas.

By the grant start date, the candidate must be enrolled in

  1. a PhD programme – art. 6º, n.1

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

or

  1. a non-degree programme – art. 6º, n. 2

https://files.dre.pt/2s/2019/12/241000000/0009100105.pdf

Preferential factors:

preference will be given to candidates:

  • with proficiency in English.
  • with previous experience in NLP research, especially if related with the workplan.

Work location(s)
1 position(s) available at
INESC ID
Portugal
Lisboa
Lisboa
1000-029
Rua Alves Redol, 9

EURAXESS offer ID: 762412

Disclaimer:

The responsibility for the jobs published on this website, including the job description, lies entirely with the publishing institutions. The application is handled uniquely by the employer, who is also fully responsible for the recruitment and selection processes.

 

Please contact support@euraxess.org if you wish to download all jobs in XML.