The School of Informatics is the largest, longest established and highest quality research group in informatics in the UK.

Research within the School is carried out across a number of institutes. The research programmes organised by the School of Informatics encompass a wide range of domains. Currently these include Artificial Life, Bioinformatics, Computational Thinking, Machine Learning, Music Informatics, Processes, Events & Activity, Software Engineering and System Level Integration.

Sub-communities within this community

Collections in this community

Recent Submissions

  • CSTR NAM TIMIT Plus 

    Yamagishi, Junichi; Brown, Georgina; Yang, ChenYu; Clark, Rob; King, Simon
    CSTR NAM TIMIT Plus (Version 0.8) RELEASE May 2012 The Centre for Speech Technology Research University of Edinburgh Copyright (c) 2012 Junichi Yamagishi jyamagis@inf.ed.ac.uk Overview This CSTR NAM TIMIT Plus corpus ...
  • SFARI Genes and where to find them; classification modelling to identify genes associated with Autism Spectrum Disorder from RNA-seq data 

    Simpson, Ian; Navarro, Magdalena
    Abstract Motivation: Autism spectrum disorder (ASD) has a strong, yet heterogeneous, genetic component. Among the various methods that are being developed to study it, one that is gaining popularity is the incorporation ...
  • Datasets of journal paper "Arduino-Based Myoelectric Control: Towards Longitudinal Study of Prosthesis Use" 

    Wu, Hancong; Kianoush, Nazarpour
    This dataset comprises the mean absolute value (MAV) to draw Figure 6 and the normalized MAV to draw Figure 7 in paper "Arduino-Based Myoelectric Control: Towards Longitudinal Study of Prosthesis Use".
  • multimodal TRIPOD 

    Papalampidi, P; Keller, F; Lapata, M
    The data contain multimodal features extracted for the TRIPOD dataset and used in the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction". The data contain 122 pickle files, each one corresponding to a movie ...
  • Biorobot log data for visual navigation with transverse oscillating route following (TORF) 

    Stankiewicz, JT
    The purpose of this work was to develop a flying robot that can navigate using a bee-inspired visual route following approach. To this end we performed several missions in which an aerial robot with an onboard camera and ...
  • Archival Metadata Descriptions from the University of Edinburgh Centre for Research Collections - Extracted October 2020 

    Havens, L; Alex, B; Bach, B; Terras, M; Renton, S; Hosker, R; Centre for Research Collections, The
    The dataset includes metadata descriptions extracted from the Centre for Research Collections' online archival catalog using OAI-PMH EAD harvesting. Metadata descriptions were extracted from four metadata fields: an ...
  • Listening-test materials for "Where do the improvements come from in sequence-to-sequence neural TTS?" 

    Watts, Oliver; Henter, Gustav Eje; Fong, Jason; Valentini-Botinhao, Cassia
    This data release contains listening-test materials associated with the paper "Where do the improvements come from in sequence-to-sequence neural TTS?", presented at SSW10 (the 10th ISCA Speech Synthesis Workshop) in Vienna, ...
  • CAD files for an insect inspired biorobot 

    Stankiewicz, Jan; Webb, Barbara
    Data in support of University of Edinburgh PhD thesis titled "Using a quadcopter to model the visual navigation behaviours of flying insects".
  • IDEAL Household Energy Dataset 

    Goddard, Nigel; Kilgour, Jonathan; Pullinger, Martin; Arvind, D.K; Lovell, Heather; Moore, Johanna; Shipworth, David; Sutton, Charles; Webb, Jan; Berliner, Niklas; Brewitt, Cillian; Dzikovska, Myroslava; Farrow, Edmund; Farrow, Elaine; Mann, Janek; Morgan, Evan; Webb, Lynda; Zhong, Mingjun
    The IDEAL Household Energy Dataset comprises data from 255 UK homes. Alongside electric and gas data from each home the corpus contains individual room temperature and humidity readings and temperature readings from the ...
  • Opinions on Weblinks 

    Albakry, Sara; Vaniea, Kami; Wolters, Maria
    This document provides a concrete version of the survey used in the URL reading experiment conducted in April 2017 and reported in the associated paper to appear at CHI'20 on 25 April, 2020. This documentation serves two ...
  • CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit (version 0.92) 

    Yamagishi, Junichi; Veaux, Christophe; MacDonald, Kirsten
    This CSTR VCTK Corpus includes speech data uttered by 110 English speakers with various accents. Each speaker reads out about 400 sentences, which were selected from a newspaper, the rainbow passage and an elicitation ...
  • ManySStuBs4J Dataset 

    Karampatsis, Rafael-Michael
    The ManySStuBs4J corpus contains simple statement bugs mined from open-source Java projects hosted in GitHub. There are two variations of the dataset. One mined from the 100 Java Maven Projects and one mined from the top ...
  • WikiCatSum 

    Perez-Beltrachini, Laura; Liu, Yang; Lapata, Mirella
    WikiCatSum is a domain specific Multi-Document Summarisation (MDS) dataset. It assumes the summarisation task of generating Wikipedia lead sections for Wikipedia entities of a certain domain (e.g. Companies) from the set ...
  • ASVspoof 2019: The 3rd Automatic Speaker Verification Spoofing and Countermeasures Challenge database 

    Yamagishi, Junichi; Todisco, Massimiliano; Sahidullah, Md; Delgado, Héctor; Wang, Xin; Evans, Nicolas; Kinnunen, Tomi; Lee, Kong Aik; Vestman, Ville; Nautsch, Andreas
    This is a database used for the Third Automatic Speaker Verification Spoofing and Countermeasures Challenge, for short, ASVspoof 2019 (http://www.asvspoof.org) organized by Junichi Yamagishi, Massimiliano Todisco, Md ...
  • A Survey on Developer-Centred Security 

    Tahaei, Mohammad; Vaniea, Kami
    Our research reports a systematic literature review of 49 publications on security studies with software developer participants. These attached files are: - A BibTeX file: includes all 49 references in BibTex format. - ...
  • SUPERSEDED - ManySStuBs4J Dataset 

    Karampatsis, Rafael-Michael
    ## This item has been replaced by the one which can be found at https://doi.org/10.7488/ds/2628 ## The ManySStuBs4J corpus contains simple statement bugs mined from open-source Java projects hosted in GitHub. There are ...
  • Listening-test materials for "Modern speech synthesis for phonetic sciences: a discussion and an evaluation" 

    Malisz, Zofia; Henter, Gustav Eje; Valentini-Botinhao, Cassia; Watts, Oliver; Beskow, Jonas; Gustafson, Joakim
    This data release contains listening-test materials associated with the paper "Modern speech synthesis for phonetic sciences: a discussion and an evaluation", presented at ICPhS 2019 in Melbourne, Australia.
  • Alba speech corpus 

    Valentini-Botinhao, Cassia; Yamagishi, Junichi
    Single speaker read speech corpus of a Scottish accented female native English speaker (Alba). The corpus was recorded in four speaking styles: plain (normal read speech, around 4 hours of recordings), fast (speaking as ...
  • Listening test results of the Voice Conversion Challenge 2018 

    Yamagishi, Junichi; Wang, Xin
    This dataset is associated with a paper and a dataset below: (1) Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhenhua Ling, "The Voice Conversion Challenge ...
  • UltraSuite Repository - sample data 

    Eshky, Aciel; Ribeiro, Manuel Sam; Cleland, Joanne; Renals, Steve; Richmond, Korin; Roxburgh, Zoe; Scobbie, James; Wrench, Alan
    UltraSuite is a repository of ultrasound and acoustic data from child speech therapy sessions. The current release includes three data collections, one from typically developing children -- Ultrax Typically Developing ...

View all