Skip to content

Research at St Andrews

Language transfer for early warning of epidemics from social media

Research output: Contribution to conferencePaperpeer-review

Open Access permissions

Open

Author(s)

Mattias Appelgren, Patrick Schrempf, Matúš Falis, Satoshi Ikeda, Alison Q O'Neil

School/Research organisations

Abstract

Statements on social media can be analysed to identify individuals who are experiencing red flag medical symptoms, allowing early detection of the spread of disease such as influenza. Since disease does not respect cultural borders and may spread between populations speaking different languages, we would like to build multilingual models. However, the data required to train models for every language may be difficult, expensive and time-consuming to obtain, particularly for low-resource languages. Taking Japanese as our target language, we explore methods by which data in one language might be used to build models for a different language. We evaluate strategies of training on machine translated data and of zero-shot transfer through the use of multilingual models. We find that the choice of source language impacts the performance, with Chinese-Japanese being a better language pair than English-Japanese. Training on machine translated data shows promise, especially when used in conjunction with a small amount of target language data.
Close

Details

Original languageEnglish
Number of pages6
Publication statusPublished - 13 Dec 2019
EventArtificial Intelligence for Humanitarian Assistance and Disaster Response - Vancouver, Canada
Duration: 13 Dec 2019 → …
http://hadr.ai

Workshop

WorkshopArtificial Intelligence for Humanitarian Assistance and Disaster Response
CountryCanada
CityVancouver
Period13/12/19 → …
Internet address

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. Templated text synthesis for expert-guided multi-label extraction from radiology reports

    Schrempf, P., Watson, H., Park, E., Pajak, M., MacKinnon, H., Muir, K. W., Harris-Birtill, D. & O’Neil, A. Q., 24 Mar 2021, In: Machine Learning and Knowledge Extraction. 3, 2, p. 299-317 19 p.

    Research output: Contribution to journalArticlepeer-review

  2. Paying per-label attention for multi-label extraction from radiology reports

    Schrempf, P., Watson, H., Mikhael, S., Pajak, M., Falis, M., Lisowska, A., Muir, K. W., Harris-Birtill, D. & O'Neil, A. Q., 2020, Interpretable and Annotation-Efficient Learning for Medical Image Computing: Third International Workshop, iMIMIC 2020, Second International Workshop, MIL3iD 2020, and 5th International Workshop, LABELS 2020, Held in Conjunction with MICCAI 2020, Lima, Peru, October 4–8, 2020, Proceedings. Cardoso, J., Van Nguyen, H., Heller, N., Henriques Abreu, P., Isgum, I., Silva, W., Cruz, R., Pereira Amorim, J., Patel, V., Roysam, B., Zhou, K., Jiang, S., Le, N., Luu, K., Sznitman, R., Cheplygina, V., Mateus, D., Trucco, E. & Abbasi, S. (eds.). Cham: Springer, p. 277-289 13 p. (Lecture Notes in Computer Science (including subseries Image Processing, Computer Vision, Pattern Recognition, and Graphics); vol. 12446 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Ontological attention ensembles for capturing semantic concepts in ICD code prediction from clinical text

    Falis, M., Pajak, M., Lisowska, A., Schrempf, P., Deckers, L., Mikhael, S., Tsaftaris, S. & O'Neil, A., Nov 2019.

    Research output: Contribution to conferencePaperpeer-review

  4. RadarCat : Radar Categorization for input & interaction

    Yeo, H. S., Flamich, G., Schrempf, P. M., Harris-Birtill, D. C. C. & Quigley, A. J., 16 Oct 2016, Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16). ACM, p. 833-841 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

ID: 264577925

Top