Group communication is often characterized by complex group structures, involving multiple entities, and based on an implicit semantic, reflected in domain specific ontologies. Further, communication is conducted via various means and methods, employing various multimodal data types (text, images, videos). Such communication data often is of high variability, involving structured and unstructured information, sometimes with metadata, sometimes without.
To analyze this heterogeneous data, it is necessary to prepare and correlate the different data sources and then find the hidden, complex structures hidden within. Here we focus mainly on the semantic analysis of text data and plan to investigate automatic multi-language semantic entity extraction and supporting domain-specific ontologies for intelligent searches. We further plan to design advanced visualization methods allowing for the exploration and analysis of the semantic text data as well as combine these results with the other multimodal data in an interactive Visual Analytics application.
- How can entities be extracted from multi-language texts and visualization be used to correlate different entities (using statistics, semantics, rule-based methods)?
- How can domain-specific ontologies be generated, continuously adapted, and support users when searching multi-language texts.
- How can knowledge from the semantic text analysis and all the other multimodal data sources be correlated and put into context to generate a complex knowledge graph, detecting hidden structures?
- Highly motivated
- Experience in text analysis (Document Analysis) and NLP recommended (or readiness to get up to speed
- Excellent programming skills in Python / D3 / Visualization or comparable
It is possible to work only on a specific sub-aspect of the proposed problems and tasks. Feel free to discuss your preferences with us!
- Scope: Bachelor / Master
- Project / Thesis Duration (Bachelor): 3 months + 3 months
- Project / Thesis Duration (Master): 6 months + 6 months
- Start: Planned in February 2020
- Seebacher, D., Fischer, M. T., Sevastjanova, R., Keim, D. A., & El-Assady, M. (2019). Visual Analytics of Conversational Dynamics. In EuroVis Workshop on Visual Analytics (EuroVA).
- Sacha, D., Jentner, W., Zhang, L., Stoffel, F., Ellis, G., & Keim, D. (2017). Applying Visual Interactive Dimensionality Reduction to Criminal Intelligence Analysis. VALCRI White Paper Series, 1.
- El-Assady, M., Sevastjanova, R., Gipp, B., Keim, D. A. und Collins, C. (2017) NEREx: Named-Entity Relationship Exploration in Multi-Party Conversations, Computer Graphics Forum, The Eurographics Association and John Wiley & Sons Ltd., pp. 213-225, 2017.