Skip to main content
EURAXESS Researchers in motion

Job offer

Apply now
Logo of Science 4 Refugees
7 Jun 2024

Job Information

Organisation/Company
Ecole nationale des chartes
Research Field
History » History of social sciences
Computer science » Other
Language sciences » Philology
Researcher Profile
Recognised Researcher (R2)
Positions
PhD Positions
Country
France
Application Deadline
Type of Contract
Temporary
Job Status
Full-time
Hours Per Week
35
Offer Starting Date
Is the job funded through the EU Research Framework Programme?
FP7 / Ideas-ERC
Is the Job related to staff position within a Research Infrastructure?
Yes

Offer Description

PROJECT LostMA

LostMA is a project funded by the European Research Council, as a Starting Grant, for the period 2024-2029. LostMA aims at understanding how human cultures are constituted and evolve, through the question of the transmission of written cultural artefacts.

The project strives to establish in what measure the transmission (and subsequent preservation or loss) of written artefacts, texts and ideas deviates from pure chance, and, if it deviates, by how much and why. This will be investigated by analysing the way in which texts in manuscript form were copied, transformed or destroyed, in a similar manner to the evolution of living organisms or that of language variants, through processes of innovation/mutation, fixation or extinction. As such, the goal of this project is not only to understand the processes behind the transmission of texts, but also to grasp the extent to which humans are the actors of the transmission of their own culture and how much the survival of texts or the constitution of cultural canons are due to chance.

LostMA will attempt a paradigm-shift in philological methods, by combining artificial intelligence, complexity science and philological expertise. On the mathematical sciences side, the work will range from theoretical to numerical and statistical:stochastic models (birth-and-death processes, branching processes, random trees, …), computer simulations (agent-based models, in particular), machine learning and data analysis will be used to emulate and comprehend the processes and mechanisms of textual

transmission. A case study will be undertaken, regarding chivalric literature in European context. Supported by deep learning methods, large-scale data collection will be made on a corpus of 4000 documents in Romance, Germanic and Celtic languages, with a full-text zoom on approx. 1000 Old French manuscripts. Data will provide observable values to be compared to simulation results, in order to measure deviations from chance, make inferences on non observable values such as loss/survival rates of works and manuscripts, and

understand the dynamics at work behind the transmission of texts.

 

MISSIONS

Context

The research developer will have a transverse position inside the project. He or she will collaborate with most its actors, in particular with the data architect and the two PhD students, under the supervision of the principal investigator.

He or she will work in a very multidisciplinary team, with researchers in philology, digital humanities, machine learning and applied mathematics. The project and the team collaborations will provide the research developer with the possibility of developing or acquiring new skills during the project.

Tasks and work packages



The missions of the research developer will focus mostly on the following work packages :

• WP C.1 : Document processing workflow for medieval manuscripts

• WP C.2 : Machine learning for manuscript analysis



In the context of these work packages, the research developer will be in charge of developing, documenting and maintaining a deep learning based workflow, that will be applied on harvested digitisations of manuscripts from European digital libraries. This will include layout analysis, handwritten text recognition, text segmentation, normalisation and annotation, as well as text reuse detection, alignment and collation, in a multilingual settings (the language envisioned by the project are the medieval Romance, Germanic and

Celtic languages). In addition, he or she will provide engineering support to the tasks of the two following work packages :



• WP A.1 : Stochastic model design and implementation

• WP A.2 : Advanced models design In particular in what regards the release, reusability and sustainable development of the code of the models developed by the post-doc (specific knowledge in stochastic modelling is not required).



Globally, the research developer will be responsible for maintaining, documenting and publishing in a sustainable way the developments made in the project (under the form of Python packages and APIs). He or she will also work closely with the project Data Architect, already recruited, who is in charge of the data sets releases made during the project, and the open data policy.



Deadlines

The research developer will be recruited at month 7 of the project. The main deadlines (according to the project calendar) are:

• Months 7-30 implementation of workflow for text acquisition;

• Months 31-48 implementation of deep learning methods for manuscript analysis;

• Months 49-55 design and implementation of API to facilitate code reuse.

Deliverables

In addition to the release of the code and API, the Research Developer will also be directly involved in:

• project workshops, especially regarding the appropriation of the code and its reuse by the scientific community

• submission/attendance, with team, to major international conferences of the field (such as the Digital Humanities, or the Computational Humanities Research Conferences)

• contribution to the team publications in leading international multidisciplinary, or digital humanities journal.

 

 

Where to apply

Website
https://recrutement.psl.eu/

Requirements

Research Field
Computer science
Education Level
PhD or equivalent
Research Field
History
Education Level
PhD or equivalent
Skills/Qualifications

— MA or PhD in NLP or machine learning, with interdisciplinary applications

— MA or PhD in Computational Philology/in philology, with experience in the use of computational methods

— preferably, prior work experience in research projects and/or research development.

 

YOUR SKILLS

— machine learning (NLP and/or computer vision)

— Python development, in particular machine learning libraries (e.g. Pytorch, Scikit-learn,…) and NLP tools (Kraken, …)

— language models (e.g., BERT)

— sustainable software development (documentation, unit testing, Python packaging)

— API and web services (e.g., FastAPI)

— some degree of familiarity with historical data (especially texts) and/or interdisciplinary experience would be appreciated

— taste for multidisciplinary projects and team work.

Languages
FRENCH
Level
Excellent
Languages
ENGLISH
Level
Excellent
Research Field
Computer science
Years of Research Experience
1 - 4
Research Field
History
Years of Research Experience
1 - 4

Additional Information

Benefits

WE OFFER

— Inclusion in a dynamic and multidisciplinary team, in Paris

— Work on an European research project, with an ambitious scope and computational methodologies beyond the current state-of-the-art

— Opportunity to develop your skills and acquire new one, by being part of the strong and developing community dedicated to the study of past

and current cultures with computational methods inside the École des chartes and PSL.

Holidays: 49 days/year

Catering: contribution to catering expenses,

Health: affiliation with the social security system; contribution to complementary health insurance;

Transportation: contribution to transportation costs up to 75%;

Sport: access to the sports services of PSL University;

Training: access to training from the internal school.

 

Selection process

APPLICATION

Applications (CV and letter of application) should be sent to the Director of the École nationale des chartes, by e-mail to : recrutement@chartes.psl.eu, as well as jean-baptiste.camps@chartes.psl.eu until 5 July 2024. Selected candidates will be invited to attend a (remote or

in person) interview between 10 and 15 July 2024. Contract will start on the 1st September 2024, or as soon as possible afterwards.

Informations on position :

Jean-Baptiste Camps : jean-baptiste.camps@chartes.psl.eu



Administrative information : rh@chartes.psl.eu

Website for additional job details

Work Location(s)

Number of offers available
1
Company/Institute
Ecole nationale des chartes
Country
France
State/Province
Ile-de-France
City
Paris
Postal Code
75002
Street
65 rue Richelieu
Geofield
Number of offers available
1
Company/Institute
Campus Cpndorcet
Country
France
State/Province
Ile-de-France
City
Aubervilliers
Postal Code
93322
Street
14 cours des Humanités
Geofield

Contact

State/Province
FRANCE
City
PARIS
Website
Street
65 RUE RICHELIEU
Postal Code
75002
E-Mail
recrutement@chartes.psl.eu

Share this page