Publikationen

Hier finden Sie von Know-Center MitarbeiterInnen verfasste wissenschaftliche Publikationen

2018

Bassa Akim, Kröll Mark, Kern Roman

GerIE - An Open InformationExtraction System for the German Language

Journal of Universal Computer Science, 2018

Journal
Open Information Extraction (OIE) is the task of extracting relations fromtext without the need of domain speci c training data. Currently, most of the researchon OIE is devoted to the English language, but little or no research has been conductedon other languages including German. We tackled this problem and present GerIE, anOIE parser for the German language. Therefore we started by surveying the availableliterature on OIE with a focus on concepts, which may also apply to the Germanlanguage. Our system is built upon the output of a dependency parser, on which anumber of hand crafted rules are executed. For the evaluation we created two dedicateddatasets, one derived from news articles and one based on texts from an encyclopedia.Our system achieves F-measures of up to 0.89 for sentences that have been correctlypreprocessed.
2018

Lovric Mario, Stipaničev Draženka , Repec Siniša , Malev Olga , Klobučar Göran

Combined toxic unit: Moving towards a multipath risk assessment strategy of organic contaminants in river sediment

, 2018

Konferenz
2018

Lacic Emanuel, Traub Matthias, Duricic Tomislav, Haslauer Eva, Lex Elisabeth

Gone in 30 Days! Predictions for Car Import Planning

it - Information Technology, De Gruyter Oldenbourg, 2018

Journal
A challenge for importers in the automobile industry is adjusting to rapidly changing market demands. In this work, we describe a practical study of car import planning based on the monthly car registrations in Austria. We model the task as a data driven forecasting problem and we implement four different prediction approaches. One utilizes a seasonal ARIMA model, while the other is based on LSTM-RNN and both compared to a linear and seasonal baselines. In our experiments, we evaluate the 33 different brands by predicting the number of registrations for the next month and for the year to come.
2018

Rexha Andi, Dragoni Mauro , Federici Marco

An Unsupervised Aspect Extraction Strategy For Monitoring Real-Time Reviews Stream

Elsevier, 2018

Journal
One of the most important opinion mining research directions falls in the extraction ofpolarities referring to specific entities (aspects) contained in the analyzed texts. Thedetection of such aspects may be very critical especially when documents come fromunknown domains. Indeed, while in some contexts it is possible to train domainspecificmodels for improving the effectiveness of aspects extraction algorithms, inothers the most suitable solution is to apply unsupervised techniques by making suchalgorithms domain-independent and more efficient in a real-time environment. Moreover,an emerging need is to exploit the results of aspect-based analysis for triggeringactions based on these data. This led to the necessity of providing solutions supportingboth an effective analysis of user-generated content and an efficient and intuitive wayof visualizing collected data. In this work, we implemented an opinion monitoringservice implementing (i) a set of unsupervised strategies for aspect-based opinion miningtogether with (ii) a monitoring tool supporting users in visualizing analyzed data.The aspect extraction strategies are based on the use of an open information extractionstrategy. The effectiveness of the platform has been tested on benchmarks provided by the SemEval campaign and have been compared with the results obtained by domainad aptedtechniques.
2018

Lassnig Markus, Stabauer Petra, Breitfuß Gert, Mauthner Katrin

Geschäftsmodellinnovationen im Zeitalter von Digitalisierung und Industrie 4.0

HMD Praxis der Wirtschaftsinformatik Wirtschaftsinformatik, Stefan Meinhard, Karl-Michael Popp, Springer Fachmedien Wiesbaden, Wiesbaden, 2018

Journal
Zahlreiche Forschungsergebnisse im Bereich Geschäftsmodellinnovationenhaben gezeigt, dass über 90% aller Geschäftsmodelle der letzten50 Jahre aus einer Rekombination von bestehenden Konzepten entstanden sind.Grundsätzlich gilt das auch für digitale Geschäftsmodellinnovationen. Angesichtsder Breite potenzieller digitaler Geschäftsmodellinnovationen wollten die Autorenwissen, welche Modellmuster in der wirtschaftlichen Praxis welche Bedeutung haben.Deshalb wurde die digitale Transformation mit neuen Geschäftsmodellen ineiner empirischen Studie basierend auf qualitativen Interviews mit 68 Unternehmenuntersucht. Dabei wurden sieben geeignete Geschäftsmodellmuster identifiziert, bezüglichihres Disruptionspotenzials von evolutionär bis revolutionär klassifiziert undder Realisierungsgrad in den Unternehmen analysiert.Die stark komprimierte Conclusio lautet, dass das Thema Geschäftsmodellinnovationendurch Industrie 4.0 und digitale Transformation bei den Unternehmenangekommen ist. Es gibt jedoch sehr unterschiedliche Geschwindigkeiten in der Umsetzungund im Neuheitsgrad der Geschäftsmodellideen. Die schrittweise Weiterentwicklungvon Geschäftsmodellen (evolutionär) wird von den meisten Unternehmenbevorzugt, da hier die grundsätzliche Art und Weise des Leistungsangebots bestehenbleibt. Im Gegensatz dazu gibt es aber auch Unternehmen, die bereits radikale Änderungenvornehmen, die die gesamte Geschäftslogik betreffen. Entsprechend wird imvorliegenden Artikel ein Clustering von Geschäftsmodellinnovatoren vorgenommen – von Hesitator über Follower über Optimizer bis zu Leader in Geschäftsmodellinnovationen
2018

d'Aquin Mathie , Kowald Dominik, Fessl Angela, Thalmann Stefan, Lex Elisabeth

AFEL - Analytics for Everyday Learning

Proceedings of the International Projects Track co-located with the 27th International World Wide Web Conference, ACM, Lyon, France, 2018

Konferenz
The goal of AFEL is to develop, pilot and evaluate methods and applications, which advance informal/collective learning as it surfaces implicitly in online social environments. The project is following a multi-disciplinary, industry-driven approach to the analysis and understanding of learner data in order to personalize, accelerate and improve informal learning processes. Learning Analytics and Educational Data Mining traditionally relate to the analysis and exploration of data coming from learning environments, especially to understand learners' behaviours. However, studies have for a long time demonstrated that learning activities happen outside of formal educational platforms, also. This includes informal and collective learning usually associated, as a side effect, with other (social) environments and activities. Relying on real data from a commercially available platform, the aim of AFEL is to provide and validate the technological grounding and tools for exploiting learning analytics on such learning activities. This will be achieved in relation to cognitive models of learning and collaboration, which are necessary to the understanding of loosely defined learning processes in online social environments. Applying the skills available in the consortium to a concrete set of live, industrial online social environments, AFEL will tackle the main challenges of informal learning analytics through 1) developing the tools and techniques necessary to capture information about learning activities from (not necessarily educational) online social environments; 2) creating methods for the analysis of such informal learning data, based on combining feature engineering and visual analytics with cognitive models of learning and collaboration; and 3) demonstrating the potential of the approach in improving the understanding of informal learning, and the way it is better supported; 4) evaluate all the former items in real world large scale applications and platforms.
2018

Kowald Dominik, Seitlinger Paul , Ley Tobias , Lex Elisabeth

The Impact of Semantic Context Cues on the User Acceptance of Tag Recommendations: An Online Study

Companion Proceedings of the 27th International World Wide Web Conference, ACM, Lyon, France, 2018

Konferenz
In this paper, we present the results of an online study with the aim to shed light on the impact that semantic context cues have on the user acceptance of tag recommendations. Therefore, we conducted a work-integrated social bookmarking scenario with 17 university employees in order to compare the user acceptance of a context-aware tag recommendation algorithm called 3Layers with the user acceptance of a simple popularity-based baseline. In this scenario, we validated and verified the hypothesis that semantic context cues have a higher impact on the user acceptance of tag recommendations in a collaborative tagging setting than in an individual tagging setting. With this paper, we contribute to the sparse line of research presenting online recommendation studies.
2018

Koncar Philipp

Synthetic Dataset for Outlier Detection

Zenodo, 2018

This synthetically generated dataset can be used to evaluate outlier detection algorithms. It has 10 attributes and 1000 observations, of which 100 are labeled as outliers. Two-dimensional combinations of attributes form differently shaped clusters. Attribute 0 & Attribute 1: Two circular clusters Attribute 2 & Attribute 3: Two banana shaped clusters Attribute 4 & Attribute 5: Three point clouds Attribute 6 & Attribute 7: Two point clouds with variances Attribute 8 & Attribute 9: Three anisotropic shaped clusters. The "outlier" column states whether an observation is an outlier or not. Additionally, the .zip file contains 10 stratified randomized train test splits (70% train, 30% test).
2018

Lovric Mario

Chemical outlier dataset

Zenodo, 2018

The objects are numbered. The Y-variable are boiling points. Other features are structural features of molecules. In the outlier column the outliers are assigned with a value of 1.The data is derived from a published chemical dataset on boiling point measurements [1] and from public data [2]. Features were generated by means of the RDKit Python library [3]. The dataset was infused with known outliers (~5%) based on significant structural differences, i.e. polar and non-polar molecules. Cherqaoui D., Villemin D. Use of a Neural Network to determine the Boiling Point of Alkanes. J CHEM SOC FARADAY TRANS. 1994;90(1):97–102. https://pubchem.ncbi.nlm.nih.gov/ RDKit: Open-source cheminformatics; http://www.rdkit.org
2018

Lacic Emanuel, Kowald Dominik, Reter-Haas Markus, Slawicek Valentin, Lex Elisabeth

Beyond Accuracy Optimization: On the Value of Item Embeddings for Student Job Recommendation

In Proceedings of the International Workshop on Multi-dimensional Information Fusion for User Modeling and Personalization (IFUP'2018) co-located with the 11th ACM International Conference on Web Search and Data Mining, WSDM'2018, ACM, Los Angeles, USA, 2018

Konferenz
In this work, we address the problem of recommending jobs touniversity students. For this, we explore the impact of using itemembeddings for a content-based job recommendation system. Fur-thermore, we utilize a model from human memory theory to integratethe factors of frequency and recency of job posting interactions forcombining item embeddings. We evaluate our job recommendationsystem on a dataset of the Austrian student job portal Studo usingprediction accuracy, diversity as well as adapted novelty, which isintroduced in this work. We find that utilizing frequency and recencyof interactions with job postings for combining item embeddingsresults in a robust model with respect to accuracy and diversity, butalso provides the best adapted novelty results
2018

Breitfuß Gert, Berger Martin, Doerrzapf Linda

Innovation Milieus for Mobility – Analysis of Innovation Lab Approaches for the Establishment of Urban Mobility Labs in Austria

TRA Vienna 2018 - Transport Research Arena, 2018

Konferenz
The initiative „Urban Mobility Labs“ (UML), driven by the Austrian Ministry of Transport, Innovation and Technology, was started to support the setup of innovative and experimental environments for research, testing, implementation and transfer of mobility solutions. This should happen by incorporating the scientific community, citizens and stakeholders in politics and administration as well as other groups. The emerging structural frame shall enhance the efficiency and effectivity of the innovation process. In this paper insights and in-depth analysis of the approaches and experiences gained in the eight UML exploratory projects will be outlined. These projects were analyzed, systematized and enriched with further considerations. Furthermore, their knowledge growth as user-centered innovation environments was documented during the exploratory phase.
2018

Hasani-Mavriqi Ilire, Kowald Dominik, Helic Denis, Lex Elisabeth

Consensus Dynamics in Online Collaboration Systems

Journal of Computational Social Networks , Ding-Zhu Du and My T. Thai, Springer Open, 2018

Journal
In this paper, we study the process of opinion dynamics and consensus building inonline collaboration systems, in which users interact with each other followingtheir common interests and their social pro les. Speci cally, we are interested inhow users similarity and their social status in the community, as well as theinterplay of those two factors inuence the process of consensus dynamics. Forour study, we simulate the di usion of opinions in collaboration systems using thewell-known Naming Game model, which we extend by incorporating aninteraction mechanism based on user similarity and user social status. Weconduct our experiments on collaborative datasets extracted from the Web. Our ndings reveal that when users are guided by their similarity to other users, theprocess of consensus building in online collaboration systems is delayed. Asuitable increase of inuence of user social status on their actions can in turnfacilitate this process. In summary, our results suggest that achieving an optimalconsensus building process in collaboration systems requires an appropriatebalance between those two factors.
2018

Luzhnica Granit, Veas Eduardo Enrique

Investigating Interactions for Text Recognition using a Vibrotactile Wearable Display

ACM International Conference on Intelligent User Interfaces , Tokyo, 2018

Konferenz
Vibrotactile skin-reading uses wearable vibrotactile displays to convey dynamically generated textual information. Such wearable displays have potential to be used in a broad range of applications. Nevertheless, the reading process is passive, and users have no control over the reading flow. To compensate for such drawback, this paper investigates what kind of interactions are necessary for vibrotactile skin reading and the modalities of such interactions. An interaction concept for skin reading was designed by taking into account the reading as a process. We performed a formative study with 22 participants to assess reading behaviour in word and sentence reading using a six-channel wearable vibrotactile display. Our study shows that word based interactions in sentence reading are more often used and preferred by users compared to character-based interactions and that users prefer gesture-based interaction for skin reading. Finally, we discuss how such wearable vibrotactile displays could be extended with sensors that would enable recognition of such gesture-based interaction. This paper contributes a set of guidelines for the design of wearable haptic displays for text communication.
2018

Lovric Mario, Krebs Sarah, Cemernek David, Kern Roman

BIG DATA IN INDUSTRIAL APPLICATION

XII Meeting of Young Chemical Engineers, Zagreb, Kroatien, 2018

Konferenz
The use of big data technologies has a deep impact on today’s research (Tetko et al., 2016) and industry (Li et al., n.d.), but also on public health (Khoury and Ioannidis, 2014) and economy (Einav and Levin, 2014). These technologies are particularly important for manufacturing sites, where complex processes are coupled with large amounts of data, for example in chemical and steel industry. This data originates from sensors, processes. and quality-testing. Typical application of these technologies is related to predictive maintenance and optimisation of production processes. Media makes the term “big data” a hot buzzword without going to deep into the topic. We noted a lack in user’s understanding of the technologies and techniques behind it, making the application of such technologies challenging. In practice the data is often unstructured (Gandomi and Haider, 2015) and a lot of resources are devoted to cleaning and preparation, but also to understanding causalities and relevance among features. The latter one requires domain knowledge, making big data projects not only challenging from a technical perspective, but also from a communication perspective. Therefore, there is a need to rethink the big data concept among researchers and manufacturing experts including topics like data quality, knowledge exchange and technology required. The scope of this presentation is to present the main pitfalls in applying big data technologies amongst users from industry, explain scaling principles in big data projects, and demonstrate common challenges in an industrial big data project
2018

Lovric Mario

Molecular modeling of the quantitative structure activity relationship in Python – a tutorial (part I)

Journal of Chemists and Chemical Engineers, Croatian Society of Chemical Engineers, Zagreb, 2018

Journal
Today's data amount is significantly increasing. A strong buzzword in research nowadays is big data.Therefore the chemistry student has to be well prepared for the upcoming age where he does not only rule the laboratories but is a modeler and data scientist as well. This tutorial covers the very basics of molecular modeling and data handling with the use of Python and Jupyter Notebook. It is the first in a series aiming to cover the relevant topics in machine learning, QSAR and molecular modeling, as well as the basics of Python programming
2018

Santos Tiago, Kern Roman

Understanding semiconductor production with variational auto-encoders

European Symposium on Artificial Neural Network (ESANN) 2018, 2018

Konferenz
Semiconductor manufacturing processes critically depend on hundreds of highly complex process steps, which may cause critical deviations in the end-product.Hence, a better understanding of wafer test data patterns, which represent stress tests conducted on devices in semiconductor material slices, may lead to an improved production process.However, the shapes and types of these wafer patterns, as well as their relation to single process steps, are unknown.In a first step to address these issues, we tailor and apply a variational auto-encoder (VAE) to wafer pattern images.We find the VAE's generator allows for explorative wafer pattern analysis, andits encoder provides an effective dimensionality reduction algorithm, which, in a clustering application, performs better than several baselines such as t-SNE and yields interpretable clusters of wafer patterns.
2018

Urak Günter, Ziak Hermann, Kern Roman

Source Selection of Long Tail Sources for Federated Search in an Uncooperative Setting

SAC, 2018

Konferenz
The task of federated search is to combine results from multiple knowledge bases into a single, aggregated result list, where the items typically range from textual documents toimages. These knowledge bases are also called sources, and the process of choosing the actual subset of sources for a given query is called source selection. A scenario wherethese sources do not provide information about their content in a standardized way is called uncooperative setting. In our work we focus on knowledge bases providing long tail content, i.e., rather specialized sources offering a low number of relevant documents. These sources are often neglected in favor of more popular knowledge sources, both by today’s Web users as well as by most of the existing source selection techniques. We propose a system for source selection which i) could be utilized to automatically detect long tail knowledge bases and ii) generates aggregated search results that tend to incorporate results from these long tail sources. Starting from the current state-of-the-art we developed components that allowed to adjust the amount of contribution from long tail sources. Our evaluation is conducted on theTREC 2014 Federated WebSearch dataset. As this dataset also favors the most popular sources, systems that include many long tail knowledge bases will yield low performancemeasures. Here, we propose a system where just a few relevant long tail sources are integrated into the list of more popular knowledge bases. Additionally, we evaluated the implications of an uncooperative setting, where only minimal information of the sources is available to the federated search system. Here a severe drop in performance is observed once the share of long tail sources is higher than 40%. Our work is intended to steer the development of federated search systems that aim at increasing the diversity and coverage of the aggregated search result.
2018

Rexha Andi, Kröll Mark, Ziak Hermann, Kern Roman

Authorship Identification of Documents with High Content Similarity

Scientometrics, Wolfgang Glänzel, Springer Link, 2018

Journal
The goal of our work is inspired by the task of associating segments of text to their real authors. In this work, we focus on analyzing the way humans judge different writing styles. This analysis can help to better understand this process and to thus simulate/ mimic such behavior accordingly. Unlike the majority of the work done in this field (i.e., authorship attribution, plagiarism detection, etc.) which uses content features, we focus only on the stylometric, i.e. content-agnostic, characteristics of authors.Therefore, we conducted two pilot studies to determine, if humans can identify authorship among documents with high content similarity. The first was a quantitative experiment involving crowd-sourcing, while the second was a qualitative one executed by the authors of this paper.Both studies confirmed that this task is quite challenging.To gain a better understanding of how humans tackle such a problem, we conducted an exploratory data analysis on the results of the studies. In the first experiment, we compared the decisions against content features and stylometric features. While in the second, the evaluators described the process and the features on which their judgment was based. The findings of our detailed analysis could (i) help to improve algorithms such as automatic authorship attribution as well as plagiarism detection, (ii) assist forensic experts or linguists to create profiles of writers, (iii) support intelligence applications to analyze aggressive and threatening messages and (iv) help editor conformity by adhering to, for instance, journal specific writing style.
2018

Andrusyak Bohdan, Kugi Thomas, Kern Roman

Daily Prediction of Foreign Exchange Rates Based on the Stock Marke

Proceedings of the PEFNet 2017 conference, Jana Stávková, Mendel University Press, Brno, 2018

Konferenz
The stock and foreign exchange markets are the two fundamental financial markets in the world and play acrucial role in international business. This paper examines the possibility of predicting the foreign exchangemarket via machine learning techniques, taking the stock market into account. We compare prediction modelsbased on algorithms from the fields of shallow and deep learning. Our models of foreign exchange marketsbased on information from the stock market have been shown to be able to predict the future of foreignexchange markets with an accuracy of over 60%. This can be seen as an indicator of a strong link between thetwo markets. Our insights offer a chance of a better understanding guiding the future of market predictions.We found the accuracy depends on the time frame of the forecast and the algorithms used, where deeplearning tends to perform better for farther-reaching forecasts
Kontakt Karriere

Hiermit erkläre ich ausdrücklich meine Einwilligung zum Einsatz und zur Speicherung von Cookies. Weiter Informationen finden sich unter Datenschutzerklärung

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.

Close