We are seeking a candidate with a background in Computer Science. The candidate will help develop online text-mining and machine-coding tools to code a sample group of state leaders’ discourses as well as non-state armed groups’ manifestos and statements. The texts will be extracted from news archives and online scholarly sources as well as other online sources.

The main natural language processing (NLP) tasks we are dealing with are text classification and information extraction. We will work on text-mining to identify the speeches and public statements of state and armed group leaders as well as extracting and coding some specific information, e.g., norm, ideas, and concepts used in these speeches and statements. The successful candidate is expected to support and work with the research team in developing and maintaining the whole text data management and processing pipeline to fulfill the following tasks:

  • Have high level of programming and technical skills, e.g. being able to work using Linux terminal, programming in Python,
  • Extract data, which is mainly text, from web pages and convert it into a processable format (e.g. extracting texts from .html, .pdf and/or .jpeg files and convert these texts into .txt format),
  • Arrange and follow the updating procedure of these data using version control systems such as Git on a regular basis,
  • Have knowledge and (preferably) experience in state-of-the-art Natural Language Processing, Machine Learning, information extraction, event extraction, entity tagging, coreference and anaphora resolution, and multilingual language processing,
  • Gather already prepared datasets, results, and linguistic resources from other groups and keep record of these materials along with the Project’s own datasets, and perform basic filtering and analysis on these datasets in case of need,
  • Maintain an open source software development policy that includes version control (Git) and preferably continuous integration systems for the team,
  • Have knowledge and preferably experience in web and API development,
  • Be proficient in English.

This could be a full-time or part-time position for 18 months and the successful candidate is expected to start as soon as possible. Please send your application as a single pdf file including a cover letter and CV to belginsan21@gmail.com (with subject: MOBILENCE Post-doc Application). Consideration of candidates will begin immediately and continue until the position is filled. All candidates with a Bachelor’s, Master’s or PhD degree, and continuing Master and PhD Students will be considered for the position.