WEBIST 2014 Abstracts


Area 1 - Internet Technology

Full Papers
Paper Nr: 15
Title:

A Multitier Debugger for Web Applications

Authors:

Manuel Serrano

Abstract: Debugging Web applications is difficult because of their distributed nature but also because of the programming languages and tools commonly used to develop them. Taking benefit of the multitier aspect of the Hop programming language, we have built a new debugger for Web applications that copes with the server-side and the client-side of the executions. Its advantage over most debuggers for the Web is that it reports the full stack trace containing all the server-side and client-side frames that have conducted to an error. An error is reported on its actual position on the source code, wherever it occurs on the server or on the client. To help detecting errors as early as possible, the Hop debugger is accompanied with a debugging execution mode where types are checked before data structures are accessed, argument numbers are verified before functions are called, and array bounds are checked before vectors are accessed. Combining the debugger and the debugging mode makes errors of Web applications easier to understand and easier to localize. Hopefully they also become easier to fix.

Paper Nr: 24
Title:

Authentication and Authorisation for Widget-based Applications in a Loosely-coupled Web eLearning Environment

Authors:

Jean-Noël Colin and Minh Tien Hoang

Abstract: In this paper, we discuss the mechanisms used for authentication and authorisation of users in a widget-based web environment that integrates multiple components from multiple providers running at multiple locations. Users are typically teachers and pupils who access our platform either directly or through their usual Learning Management System (LMS). Although the technologies used themselves are not new, we believe that the approach we follow is innovative enough in their integration and use. Our approach is based on a survey ran among european teachers about their practices in terms of user credentials usage and sharing.

Paper Nr: 49
Title:

Annotated Trees and their Applications to XML Compression

Authors:

Tomasz Müldner, Jan Krzysztof Miziołek and Tyler Corbin

Abstract: Permutation based XML-conscious compressors permute the input document to improve the compression ratio and support efficiency of operations, such as queries or updates. One such compressor, XSAQCT, uses the properties of the permuted document, called an annotated tree, to these operations. This paper provides the formal background for the definition of an of D. It also provides an algorithm for creating an annotated tree for the XML document and its reverse algorithm, and discusses a measure of compressibility using an annotated tree. The theoretical and algorithm approaches are followed by the experimental results showing compressibility of annotated trees and a general analysis of semi-structured data and XML compression.

Paper Nr: 67
Title:

Filter-enabled Binary XML Dissemination in Embedded Networks

Authors:

Sebastian Käbisch and Richard Kuntschke

Abstract: Efficient data dissemination in distributed systems is a challenge that can be tackled by sharing common data and processing results among multiple queries. Doing so in an effective manner helps to save network bandwidth and computational resources. This is especially important in embedded networks where such resources are often extremely scarce. Disseminating resource-intensive XML data in embedded networks has been enabled by using binary XML technologies such as W3C’s EXI format. In this paper, we show how filter-enabled binary XML dissemination in embedded networks helps to further reduce resource demands. Thus, through the suitable placement of pre- and post-filters on binary XML data, bandwidth on network connections and computational resources on nodes can be saved. Consequently, more data can be processed with a certain amount of available resources within an embedded network.

Paper Nr: 89
Title:

QoS-Aware Parameterized Semantic Matchmaking Framework for Web Service Composition

Authors:

Salem Chakhar, Alessio Ishizaka and Ashraf Labib

Abstract: The paper presents a parameterized and highly customizable semantic matchmaking framework. The matchmaking approach on which this framework is based distinguishes three types of matching: functional attribute-level matching, functional service-level matching, and non-functional matching. The functional matching permits to eliminate web services that fail to meet the user functional requirements. The non-functional matching permits to categorize web services instances into different ordered QoS classes. A series of algorithms are advertised for the different types of matching. These algorithms are designed to support a customizable matching process that permits the user to control the matched attributes, the order in which attributes are compared, as well as the way the sufficiency is computed for all matching types.

Paper Nr: 115
Title:

Low Complexity Spatial Similarity Measure of GPS Trajectories

Authors:

Radu Mariescu-Istodor, Andrei Tabarcea, Rahim Saeidi and Pasi Fränti

Abstract: We attack the problem of trajectory similarity by approximating the trajectories using a geographical grid based on the MGRS 2D coordinate system. We propose a spatial similarity measure which is computationally feasible for big data collections. The proposed measure is based on cell matching with a similarity metric drawn from Jaccard index. We equip the proposed method with interpolation and dilation to overcome the problems missing data and different sampling frequencies when comparing two trajectories. The proposed measure is implemented online in the framework of Mopsia.

Paper Nr: 117
Title:

SocialSearch - A Social Platform for Web 2.0 Search

Authors:

Claudio Biancalana, Fabio Gasparetti, Alessandro Micarelli and Giuseppe Sansonetti

Abstract: In the last decade, social bookmarking services have gained popularity as a way of annotating and categoriz- ing a variety of different web resources. The idea behind this work is to exploit such services for enhancing traditional query expansion techniques. Specifically, the system we propose relies on three-dimensional co- occurrence matrices, where the further dimension is introduced to represent categories of terms sharing the same semantic property. Such categories, named semantic classes, are related to the folksonomy mined from social bookmarking services such as Delicious, Digg, and StumbleUpon. The paper illustrates a comparative experimental evaluation on real datasets, such as the one collected by the Open Directory Project and the TREC 2004. We also include the results of a specific disambiguation analysis aimed to evaluate the effective- ness of our approach in comparison with state-of-the-art techniques when satisfying queries characterized by polysemic and ambiguous terms.

Paper Nr: 123
Title:

An Overview of Cloud Identity Management-Models

Authors:

Bernd Zwattendorfer, Thomas Zefferer and Klaus Stranacher

Abstract: Unique identification and secure authentication are essential processes in various areas of application, e.g. in e-Government, e-Health, or e-Business. During the past years several identity management-systems and models have evolved. Many organizations and enterprises or even countries for their national eID solutions rely on identity management-systems for securing their applications. Since more and more applications are migrated into the cloud, secure identification and authentication are also vital in the cloud domain. However, cloud identity management-systems need to meet slightly different requirements than traditional identity management-systems and thus cannot be clustered into the same model types or categories. Therefore, in this paper we give an overview of different cloud identity management-models that have already emerged up to now. We further compare these models based on selected criteria, e.g. on practicability and privacy aspects.

Paper Nr: 129
Title:

Interoperability for Web Services based Smart Home Control Systems

Authors:

Hannu Järvinen and Petri Vuorimaa

Abstract: One of the problems in smart home systems today is the lack of interoperability on different levels. While applying closed, non-standard, and complex protocols can cause the problem on a lower level, the architecture and design of a common building control system can cause it on a higher level. We present a solution for enabling the interoperability on the higher level in building automation systems with XML based rules and a Web API. To ensure the interoperability, we define requirements for Web services based building automation control systems. A standard building automation guideline, oBIX, is used to provide interoperability on the low level, and adopted for the rule management and description on the high level. The resulting rule engine architecture and implementation are evaluated against the requirements. The solution provides interoperability using standard Web technologies and supports employing several control systems simultaneously.

Paper Nr: 130
Title:

An Integrated Approach for Designing and Validating REST Web Service Compositions

Authors:

Irum Rauf, Faezeh Siavashi, Dragos Truscan and Ivan Porres

Abstract: We present an integrated approach to design and validate RESTful composite web services. We use the Unified Modeling Language (UML) to specify the requirements, behavior and published resources of each web service. In our approach, a service can invoke other services and exhibit complex and timed behavior while still complying with the REST architectural style. We show how to transform service specifications into UPPAAL timed automata for verification and test generation. The service requirements are propagated to the UPPAAL timed automata during the transformation. Their reachability is verified in UPPAAL and they are used for computing coverage level during test generation. We validate our approach with a case study of a holiday booking web service.

Short Papers
Paper Nr: 7
Title:

On Metrics for Measuring Fragmentation of Federation over SPARQL Endpoints

Authors:

Nur Aini Rakhmawati, Marcel Karnstedt, Michael Hausenblas and Stefan Decker

Abstract: Processing a federated query in Linked Data is challenging because it needs to consider the number of sources, the source locations as well as heterogeneous system such as hardware, software and data structure and distribution. In this work, we investigate the relationship between the data distribution and the communication cost in a federated SPARQL query framework. We introduce the spreading factor as a dataset metric for computing the distribution of classes and properties throughout a set of data sources. To observe the relationship between the spreading factor and the communication cost, we generate 9 datasets by using several data fragmentation and allocation strategies. Our experimental results showed that the spreading factor is correlated with the communication cost between a federated engine and the SPARQL endpoints . In terms of partitioning strategies, partitioning triples based on the properties and classes can minimize the communication cost. However, such partitioning can also reduce the performance of SPARQL endpoint within the federation framework.

Paper Nr: 29
Title:

A Flexible System for a Comprehensive Analysis of Bibliographical Data

Authors:

Sahar Vahdati, Andreas Behrend, Gereon Schüller and Rainer Manthey

Abstract: Scientific literature has become easily accessible by now but a comprehensive analysis of the contents and interrelationships between research papers is often missing. Therefore, a time consuming bibliographical analysis is usually performed by scientists before they can really start their research. This manual process includes the identification of the most important research trends, major papers, auspicious approaches, established conference series as well as the search for most active groups for a specific research topic. In addition, scientists have to collect related academic literature for avoiding reinvention of already published results. Although a large number of literature management systems have been developed in order to support researchers in these tasks, the offered analysis of bibliographical data is still quite limited. In this paper, we identify some of the missing analysis features and show how they could be implemented using data about author affiliations, reference relations and additional metadata, automatically generated from a set of research articles. The resulting prototypical implementation indicates the way towards the design of a general and extendible bibliographic analysis system.

Paper Nr: 78
Title:

From Image to Audio Watermarking Using Self-Inverting Permutations

Authors:

Maria Chroni, Angelos Fylakis and Stavros D. Nikolopoulos

Abstract: The intellectual property infringement in music due to the proliferation of the internet and the ease of creating and distributing identical digital objects has brought watermarking techniques to the forefront of digital rights protection. Towards this direction, a significant number of watermarking techniques have been proposed in recent years in order to create robust and imperceptible audio watermarks. In this work we propose an audio watermarking technique which efficiently and secretly embeds information, or equivalently watermarks, into an audio digital signal. Our technique is based on the main idea of a recently proposed image watermarking technique expanding thus the digital objects that can be efficiently watermarked through the use of self-inverting permutations. More precisely, our audio watermarking technique uses the 1D representation of self-inverting permutations and utilizes marking at specific areas thanks to partial modifications of the audio's Discrete Fourier Transform (DFT); these modifications are made on the magnitude of specific frequency bands. We have evaluated the embedding and extracting algorithms by testing them on various and different in characteristics audio signals that were in WAV format and we have obtained positive results. The algorithms have been developed and tested using the mathematical software package Matlab.

Paper Nr: 96
Title:

Implicit User Profiling in News Recommender Systems

Authors:

Jon Atle Gulla, Arne Dag Fidjestøl, Xiaomeng Su and Humberto Castejon

Abstract: User profiling is an important part of content-based and hybrid recommender systems. These profiles model users’ interests and preferences and are used to assess an item’s relevance to a particular user. In the news domain it is difficult to extract explicit signals from the users about their interests, and user profiling depends on in-depth analyses of users’ reading habits. This is a challenging task, as news articles have short life spans, are unstructured, and make use of unclear and rapidly changing terminologies. This paper discusses an approach for constructing detailed user profiles on the basis of detailed observations of users’ interaction with a mobile news app. The profiles address both news categories and news entities, distinguish between long-term interests and running context, and are currently used in a real iOS mobile news recommender system that recommends news from 89 Norwegian newspapers.

Paper Nr: 107
Title:

Data Cube Computational Model with Hadoop MapReduce

Authors:

Bo Wang, Hao Gui, Mark Roantree and Martin F. O'Connor

Abstract: XML has become a widely used and well structured data format for digital document handling and message transmission. To find useful knowledge in XML data, data warehouse and OLAP applications aimed at providing supports for decision making should be developed. Apache Hadoop is an open source cloud computing framework that provides a distributed file system for large scale data processing. In this paper, we discuss an XML data cube model which offers us the complete views to observe XML data, and present a basic algorithm to implement its building process on Hadoop. To improve the efficiency, an optimized algorithm more suitable for this kind of XML data is also proposed. The experimental results given in the paper prove the effectiveness of our optimization strategies.

Paper Nr: 128
Title:

Distributed XML Processing over Multicore Servers

Authors:

Yoshiyuki Uratani, Hiroshi Koide and Dirceu Cavendish

Abstract: Nowadays, multicore CPU become popular technology to enhance services quality in Web services. This paper characterizes parallel distributed XML processing which can off-load the amount of processing at their servers to networking nodes with varying number of CPU cores. Our implemented distributed XML processing system sends XML documents from a sender node to a server node through relay nodes, which process the documents before arriving at the server. When the relay nodes are connected in tandem, the XML documents are processed in a pipelining manner. When the relay nodes are connected in parallel, the XML documents are processed in a parallel fashion. For well-formedness and grammar validation tasks, the parallel processing reveals inherent advantages compared with pipeline processing regardless of document type, number of CPU cores and processing environment. Moreover, the number of CPU cores impacts efficiency of distributed XML processing via buffer access contention.

Paper Nr: 145
Title:

Are Model-driven Techniques Used as a Means to Migrate SOA Applications to Cloud Computing?

Authors:

Miguel Botto, Javier González-Huerta and Emilio Insfran

Abstract: In recent years, cloud computing has emerged as an option to use computing resources as a solution for the global economic crisis, namely, a cheaper way to have IT resources. Thus, many companies have started to migrate their systems to cloud infrastructures, without the required support to carry out this process. In this position paper, we provide an overview of the current state of research on cloud computing migration. To understand this subject, we conducted a systematic mapping. The results suggest that research into cloud computing migration is still in its early stages. We identify research gaps and provide general recommendations about how these gaps may be addressed as well as future research directions that may have potential impact on this research field.

Paper Nr: 151
Title:

Mobile Multi-modal Learning Facilitated through Twitter - A Case Study of Twitter Phenomenology with Graduate Level Nurses

Authors:

Elizabeth M. La Rue, Lauren Terhorst and Kathleen G. McCafferty

Abstract: Adapting academic course content to a mobile world continues to evolve as technology changes. A conceptual deployment of Twitter in a graduate level nursing informatics course is discussed along with how tweeting may fit learning styles, human working memory capacity and reduce the reliance on content management systems to facilitate an online course. Mechanisms for using Twitter to deliver course content and pollinate student interaction while maintaining private individual Twitter accounts for the faculty member and students are discussed.

Posters
Paper Nr: 20
Title:

Delay-based Congestion Control Mechanism for Video Services - Mechanism including Backward Loading and Real-time Modes

Authors:

Juha Vihervaara and Pekka Loula

Abstract: Currently the congestion control of the Internet is implemented through overprovisioning and TCP protocol. Unfortunately, TCP is not completely suitable for the use of video services. However, recent studies have shown that these video services represent over half of Internet traffic, with a growing trend. There are also arguments against massive overprovisioning. Due to these factors, there is a need to equip video services with proper congestion control. Unfortunately, most of the congestion control mechanisms developed for the use of video services can only offer low priority services or TCP-friendly real-time services. This paper provides a study in which a new delay-based congestion control mechanism is presented. This mechanism can offer congestion control services for both of these service types.

Paper Nr: 21
Title:

An Approach to Transform Public Administration into SOA-based Organizations

Authors:

J. Sedeño, C. J. Torrecilla-Salinas, M. J. Escalona and M. Mejías

Abstract: Nowadays, Service-Oriented Architectures (SOA) is widely spread in private organizations. However, when transferring this knowledge to Public Administration, it is realized that it has not been transformed in terms of its legal nature into organizations capable to operate under the SOA paradigm. This fact prevents public administration bodies from offering the efficient services they have been provided by different boards of governments. A high-level framework to perform this transformation is proposed. Taking it as starting point, an instance of a SOA Target Meta-Model can be obtained by means of an iterative and incremental process based on the analysis of imperatives and focused on the particular business context of each local public administration. This paper briefly presents a practical experience consisting in applying this process to a Spanish regional public administration.

Paper Nr: 39
Title:

Implementing a Semantic Catalogue of Geospatial Data

Authors:

Helbert Arenas, Benjamin Harbelot and Christophe Cruz

Abstract: Complex spatial analysis requires the combination of heterogeneous datasets. However the identification of a dataset of interest is not a trivial task. Users need to review metadata records in order to select the most suitable datasets. We propose the implementation of a system for metadata management based on semantic web technologies. Our implementation helps the user with the selection task. In this paper, we present a CSW that uses a triplestore as its metadata repository. We implement a translator between Filter Encoding and SPARQL/GeoSPARQL in order to comply to basic OGC standards. Our results are promising however, this is a novel field with room for improvement.

Paper Nr: 42
Title:

Testing Discovered Web Services Automatically

Authors:

Pinar Karagoz and Selma Utku

Abstract: The reliability of web services is important for both users and software developers. In order to guarantee the reliability of the web services that are invoked and integrated at runtime, mechanisms for automatic testing of web services are needed. A basic issue in web service testing is to be able to generate appropriate input values for web services and to estimate whether the output obtained is proper for the functionality. In this work, we propose a method for automatic web service testing that uses semantics dependency-based and data mutation-based techniques to generate different test cases and to analyze web services. We check whether the services function properly under the input values generated and enriched from various data sources and we check robustness of web services by generating random and erronous data inputs. Experimental evaluation with real web services show that proposed mechanisms provide promising results for automatic testing of web services.

Paper Nr: 77
Title:

Evaluating Web Service QoE by Learning Logic Networks

Authors:

Natalia Kushik, Nina Yevtushenko, Ana Cavalli, Wissam Mallouli and Jeevan Pokhrel

Abstract: This paper is devoted to the problem of evaluating the quality of experience (QoE) for a given web service based on the values of service parameters (for instance, QoS indicators). Different self-learning algorithms can be used to reach this purpose. In this paper, we propose to use self-learning logic networks, called also circuits, for evaluating the QoE of web services, since modern software tools can efficiently deal with very large logic networks. As usual, for machine learning techniques, statistics are used to design the initial circuit that accepts service parameter values as inputs and produces the QoE value as an output. The circuit is self-adaptive, i.e., when a new end-user provides a feedback of the service satisfaction the circuit is resynthesized in order to behave properly (if needed). Such resynthesis (circuit learning) can be efficiently performed using a number of tools for logic synthesis and verification.

Paper Nr: 147
Title:

Designing a General Architecture for Data Interchange

Authors:

Alina Andreica, Josef Küng, Gabriela Şerban Czibulla and Christian Sacarea

Abstract: The paper describes principles for designing a general framework for automatic data interchange that scopes all three levels, data, semantic and knowledge. In spite of the huge amount of research already performed and existing standards and products, there is room to enhance information and knowledge integration. Consequent to defining the data interchange framework, we are going to apply these principles in developing and implementing a solution for academic data interchange. Such a solution has the potentiality for important advantages in academic cooperation and societal benefits.

Paper Nr: 148
Title:

Multimedia Retrieval based on Geometric Distance in Semi-structured Document

Authors:

Sana Fakhfakh, Mohamed Tmar and Walid Mahdi

Abstract: This paper is included on multimedia retrieval in XML document, whose goal is to find relevant multimedia element. In this article, we are particularly interested by studing the impact of various structural factors for image retrieval by our proposed method using a new evidence of source applied to media "image". This method consist to define a geometric distance between XML nodes. Experiments are undertaken into two data sets "INEX 2007" and "ImageCLEF 2010". The obtained results showed effectiveness of our approach.

Paper Nr: 150
Title:

Extensible Data Management Architecture for Smart Campus Applications - A Crowdsourcing based Solution

Authors:

Attila Adamkó and Lajos Kollár

Abstract: The technological advancements that have occurred during the past decade in various domains, including sensors, wireless communications, location positioning technologies and the web, allow the collection of a wide range of data. Possible sources of that data include intelligent devices (smartphones, tablets, etc.) containing various sensors, Web pages, and social networking sites. Collected data are subject to analysis (using data mining or pattern recognition approaches, for instance) and after processing new content might be inferred. This is a value-added service that can itself be used as a data source. In this paper, we use our University Campus as an example for establishing a data management architecture that integrates into a more general, extensible publish/subscribe based model of crowdsourced applications.

Paper Nr: 156
Title:

FastTriaje: A Mobile System for Victim Classification in Emergency Situations

Authors:

Alexandra Rivero-García, Candelaria Hernández-Goya, Iván Santos-González and Pino Caballero-Gil

Abstract: The high penetration of communication technologies and smartphones may help in many complex scenarios. This work presents a system to perform victim diagnosis in emergency situation and/or natural disasters. The implementation include a web platform, a web service and a mobile application. The synergy among these three elements and different communication technologies such as NFC and Wi-Fi allows to classify potential casualties in a fast, useful and reliable way. Robust cryptographic methods are used to ensure the access only to legitimate users.

Area 2 - Web Interfaces and Applications

Full Papers
Paper Nr: 17
Title:

The One Hand Wonder - A Framework for Enhancing One-handed Website Operation on Touchscreen Smartphones

Authors:

Karsten Seipp and Kate Devlin

Abstract: Operating a website with one hand on a touchscreen mobile phone remains a challenging task: solutions to adapt websites for mobile users do not address the ergonomic peculiarities of one-handed operation. We present the design and evaluation of the One Hand Wonder (OHW) – an easily-adaptable cross-platform JavaScript framework to support one-handed website navigation on touchscreen smartphones. It enhances usability without the need to redesign the existing website or to overwrite any CSS styles. User testing and quantitative evaluation confirm learnability and efficiency with clear advantages over non-enhanced browsing, and a discussion of the OHW’s versatility is given.

Paper Nr: 94
Title:

Contextinator - Project-based Management of Personal Information on the Web

Authors:

Ankit Ahuja, Ben Hanrahan and Manuel A. Pérez-Quiñones

Abstract: The web browser is a central workspace for knowledge workers, where they use cloud-based applications to access their information. While this solution fits nicely within our diverse ecosystem of devices, it may reintroduce and proliferate faults of the desktop, particularly information fragmentation. Information fragmentation is an increasingly important issue on the cloud as information is typically silo-ed within different applications. This results in users replicating storage and organization due to the lack of a unifying structure. As cloud applications become more rich, the need to investigate whether these faults of the past are still problematic becomes more important. To probe this question we created Contextinator, a tool for the web browser that assists in coordinating data for projects. Contextinator enables knowledge workers to manage cloud-based information and project artifacts in a centralized place, providing a unifying structure. In this paper, we discuss the design of our system, and the results of our mixed-method evaluation. Our findings contribute insight into the need for, and appropriateness of, projects as unifying structures for the web. Our results point to two types of projects we call ‘preparatory’ and ‘opportunistic’ based on when and why users create them.

Short Papers
Paper Nr: 25
Title:

GeoSPARQL Query Tool - A Geospatial Semantic Web Visual Query Tool

Authors:

Ralph Grove, James Wilson, Dave Kolas and Nancy Wiegand

Abstract: As geospatial data are becoming more widely used through mobile devices and location sensitive applications, the potential value of linked open geospatial data in particular has grown, and a foundation is being developed for the Semantic Geospatial Web. Protocols such as GeoSPARQL and stSPARQL extend SPARQL in order to take advantage of spatial relationships inherent in geospatial data. This paper presents GeoQuery, a graphical geospatial query tool that is based on Semantic Web technologies. GeoQuery presents a map-based user interface to geospatial search functions and geospatial operators. Rather than using a proprietary geospatial database, GeoQuery enables queries against any GeoSPARQL endpoint by translating queries expressed via its graphical user interface into GeoSPARQL queries, allowing geographic information scientists and other Web users to query linked data without knowing GeoSPARQL syntax.

Paper Nr: 79
Title:

Hypermodal - Dynamic Media Synchronization and Coordination between WebRTC Browsers

Authors:

Li Li, Wen Chen, Zhe Wang and Wu Chou

Abstract: This paper describes a Web based real-time collaboration system, Hypermodal, based on the concept of temporal linkage between resources. The system allows the users to construct, manipulate and exchange temporal linkages organized as synchronization trees. The temporal linkage is defined by RDF <sync> predicate based on a novel use of Media Fragments URI and permits on-the-fly tree updates while the resources in the tree are playing. We propose RDF <mirror> predicate and a new protocol to correlate and initialize distributed synchronization trees without requiring clock synchronization. Moreover, we develop a new REST API optimized for efficient tree updates and navigations based on super nodes. The preliminary test results on a prototype system show the approach is feasible and promising.

Paper Nr: 88
Title:

On Metrics for Measuring Fragmentation of Federation over SPARQL Endpoints

Authors:

Nur Aini Rakhmawati, Marcel Karnstedt, Michael Hausenblas and Stefan Decker

Abstract: Processing a federated query in Linked Data is challenging because it needs to consider the number of sources, the source locations as well as heterogeneous system such as hardware, software and data structure and distri- bution. In this work, we investigate the relationship between the data distribution and the communication cost in a federated SPARQL query framework. We introduce the spreading factor as a dataset metric for computing the distribution of classes and properties throughout a set of data sources. To observe the relationship between the spreading factor and the communication cost, we generate 9 datasets by using several data fragmentation and allocation strategies. Our experimental results showed that the spreading factor is correlated with the com- munication cost between a federated engine and the SPARQL endpoints . In terms of partitioning strategies, partitioning triples based on the properties and classes can minimize the communication cost. However, such partitioning can also reduce the performance of SPARQL endpoint within the federation framework.

Paper Nr: 127
Title:

A Semantic-based Data Service for Oil and Gas Engineering

Authors:

Lina Jia, Changjun Hu, Yang Li, Xin Liu, Xin Cheng, Jianjun Zhang and Junfeng Shi

Abstract: For complex data sources of oil and gas engineering, this paper summarizes characteristics and semantic relationships of oil data, and presents a semantic-based data service for oil and gas engineering (SDSOge). The domain semantic data model is constructed using ontology technology, and semantic-based data integration is achieved by ontology extraction, ontology mapping, query translation, and data cleaning. With the semantic-based data query and sharing service, users can directly access distributed and heterogeneous data sources through the global semantic data model. SDSOge has been used by upper applications, and the results show that SDSOge is efficient in providing a comprehensive and real-time data service, saving energy, and improving production.

Paper Nr: 141
Title:

Cloud Space - Web-based Smart Space with Management UI

Authors:

Anna-Liisa Mattila, Kari Systä, Jari-Pekka Voutilainen and Tommi Mikkonen

Abstract: The emergence of HTML5 allows more complex applications to be run in browsers. However, these applications need not run inside the browser only. In our previous work we have shown that it is feasible to implement mobile agents with Web technologies, such as HTML5 and JavaScript. These mobile agents can be used to control systems like home automation. In this paper we show how this execution environment can be described as a Cloud Space that provides the users with a new type of multi-device experience to the content and the environment the users need to access and control. Furthermore, we present a new way to control and monitor the Cloud Space through a web application with a 3D UI based on direct manipulation.

Paper Nr: 146
Title:

Sequential Model of User Browsing on Websites - Three Activities Defined: Scanning, Interaction and Reading

Authors:

Aneta Bartuskova and Ondrej Krejcar

Abstract: This paper presents a model of user browsing behaviour on websites. Main user activities on websites are suggested, discussed and supported by previous research. Proposed activities are then associated with three main aspects of the website - usability, aesthetics and information quality. Their role in each phase of user browsing on the website is discussed. Basic browsing model is then constructed on the basis of previous research´s conclusions, accompanied by new considerations. Model variations are taken into consideration and discussed in relevance to the mode of use.

Paper Nr: 154
Title:

Automated Usability Testing for Mobile Applications

Authors:

Wolfgang Kluth, Karl-Heinz Krempels and Christian Samsel

Abstract: In this paper we discuss the design and implementation of an automated usability evaluation method for iOS applications. In contrast to common usability testing methods, it is not explicitly necessary to involve an expert or subjects. These circumstances reduce costs, time and personnel expenditures. Professionals are replaced by the automation tool while test participants are exchanged with consumers of the launched application. Interactions of users are captured via a fully automated capturing framework which creates a record of user interactions for each session and sends them to a central server. A usability problem is defined as a sequence of interactions and pattern recognition specified by interaction design patterns is applied to find these problems. Nevertheless, it falls back to the user input for accurate results. Similar to the problem, the solution of the problem is based on the HCI design pattern. An evaluation shows the functionality of our approach compared to a traditional usability evaluation method.

Posters
Paper Nr: 36
Title:

SIWAM: Using Social Data to Semantically Assess the Difficulties in Mountain Activities

Authors:

Javier Rincón Borobia, Carlos Bobed, Angel Luis Garrido and Eduardo Mena

Abstract: In the last few years, the amount of people moving to the mountains to do several activities such as hiking, climbing or mountaineering, is steadily increasing. Not surprisingly, this has come along with a raise in the amount of accidents, which are mainly due to the inexperience of the people, and the lack of information and proper planning. Although one could expect to find appropriate updated information about this issue on the Internet, most of the information related to mountain activities is stored in personal blogs, or in Web sites that are not exploiting the possibilities that the Semantic Web and the Social Web offer regarding content generation and information processing. In this paper, we present SIWAM, a semantic framework oriented to share and evaluate the difficulties of mountain activities. It provides a thematic social network front-end to enable users to share their descriptions about their own experiences. Using text mining techniques on these descriptions, it extracts relevant facts about these experiences, which are used to evaluate the difficulty of the particular activity. The evaluation is done according to a well-established standard for evaluating the difficulty of mountain activities (MIDE), which is modeled in the system using ontologies.

Paper Nr: 59
Title:

Using Healthcare Planning Features to Drive Scientific Workflows on the Web

Authors:

Bruno S. C. M. Vilar, André Santanchè and Claudia Bauzer Medeiros

Abstract: Automated healthcare planning (care-flow) systems are usually designed to afford the dynamicity of health environments, in which changes occur constantly as a patient’s treatment progresses. This dynamic adaptation mechanism is based on blocks of activities, triggered and combined according to contextual data, producing a plan, which emerges from the interaction between these blocks and the context. However, tools that implement care-flow systems are still incipient, missing support for features like extensibility, collaboration and traceability of procedures. On the other hand, these features can be found in workflow systems that are widely used in a variety of environments (in business and scientific domains), with consolidated standards and technologies. However, workflow systems are not well suited to address the dynamicity of healthcare environments. In this paper we argue that care-flow and workflow systems have complementary characteristics and we present a software architecture that incorporates the emergent and context-driven approach of care-flow systems into workflow systems. We present a prototypical implementation validating the key concepts of our proposal, which uses an ontology representation of workflows combined with an ontology and SWRL rules.

Paper Nr: 70
Title:

Linked Data Strategy to Achieve Interoperability in Higher Education

Authors:

Guillermo García Juanes, Alioth Rodríguez Barrios, José Luis Roda García, Laura Gutiérrez Medina, Rita Díaz Adán and Pedro González Yanes

Abstract: An important challenge in centres of higher education is the use of Linked Data strategy to connect currently existing multiple information systems. These information systems are usually independent from one another, and the ability to obtain information by connecting different sources of data involves, in most cases, unacceptable costs and effort. In this work, we have developed a platform based on Linked Data that permits the interoperability of different sources of data, both internal as well as external. This interoperability is achieved by 1) the use of higher education ontologies, and 2) the use of a process that begins with the analysis of the data sources to be connected, followed by mapping of the closest ontologies, and ends with the generation and publication of data in valid formats for Linked Data. The final product permits stakeholders inside and outside the university to be able to make queries of two or more datasets in different information systems at the same time.

Paper Nr: 74
Title:

Interdependent Components for the Development of Accessible XUL Applications for Screen Reader Users

Authors:

Xabier Valencia, Myriam Arrue, Halena Rojas-Valduciel and Lourdes Moreno

Abstract: Web applications based on XUL technology have reached great development. This technology enables developers to easily create extensions and add-ons of Mozilla Firefox browser. It is essential to keep in mind accessibility in the development of such applications in order to not discriminate user groups. In this sense, standards and good practices have to be considered. Furthermore, User-Centred Design and Inclusive Design approaches should be followed as they involve users with disabilities in the development process. This paper presents an analysis of XUL accessibility guidelines created by Mozilla Foundation. An accessible XUL application has been designed and developed based on the guidelines. User testing has been conducted by two blind users revealing several important accessibility barriers. In addition, an expert review process was carried on by a blind accessibility consultant. They all used JAWS screen reader. The results obtained show that the existing guidelines conformance is not enough for ensuring accessibility of the application. There are other factors dependent on assistive technologies and user agent that have to be considered in the development of accessible XUL applications.

Paper Nr: 85
Title:

Integrating Adaptation and HCI Concepts to Support Usability in User Interfaces - A Rule-based Approach

Authors:

Luisa Fernanda Barrera, Angela Carrillo-Ramos, Leonardo Florez-Valencia, Jaime Pavlich-Mariscal and Nadia Alejandra Mejia-Molina

Abstract: A common problem in information systems development is to provide support for adaptation, to automatically adjust their services to different users and contexts. User Interfaces (UI) are required to adapt to those contexts and to satisfy specific criteria and standards to guarantee usability. Several methods have been created to ensure a degree of usability in UI. However, these methods focus mainly in the design stage of the development process. The benefits of these methods may be lost during execution time, since they do not address the necessity to dynamically adapt the interfaces both to context and users. To address this issue it is necessary to integrate User Interface Design with Adaptation, to ensure that UI usability is preserved at the execution time, for different users and contexts. This paper proposes the framework Tukuchiy, a rule-based system that dynamically generates Adaptative User Interfaces, based in HCI precepts. This guarantees their usability during execution time, while taking into account user preferences and context. This paper focused in the rule-based system of Tukuchiy. That rule system includes usability criteria commonly used for web pages, which were mapped to a desktop application.

Paper Nr: 87
Title:

Tactive, a Framework for Cross Platform Development of Tabletop Applications

Authors:

Ombretta Gaggi and Marco Regazzo

Abstract: The number and types of applications developed for multi-touch tabletops are dramatically increased in the last years, mainly due to the fact that interactive tabletops allow a more natural interaction with the user through their multi-touch interfaces. Despite many applications share a big set of common features, e.g., gestures recognition, interface orientation, etc., almost all applications implement their home made software solutions. In this paper we present Tactive, a software layer for fast development of portable applications for multi-touch interactive tabletops. Tactive allows to abstract from hardware and software equipment and to embed a web application into a application for multi-touch surfaces. Our framework supports up to five fingers gestures recognition and communication between different windows, and allows to save more than 60% of developing time.

Paper Nr: 91
Title:

Development Process and Evaluation Methods for Adaptive Hypermedia

Authors:

Martin Balík and Ivan Jelínek

Abstract: Adaptive Hypermedia address the fact that each individual user has different preferences and expectations. Hypermedia need adaptive features to provide an improved user experience. This requirement results in an increased complexity of the development process and evaluation methodology. In this article, we first discuss development methodologies used for hypermedia development in general and especially for user-adaptive hypermedia development. Second, we discuss evaluation methodologies that constitute a very important part of the development process. Finally, we propose a customized development process supported by ASF, a special framework designed to build Adaptive Hypermedia Systems.

Paper Nr: 93
Title:

CAPTCHA and Accessibility - Is This the Best We Can Do?

Authors:

Lourdes Moreno, María González and Paloma Martínez

Abstract: Web access is affected by a great amount of accessibility issues that do not allow some users to access all information presented. Therefore, Web accessibility is an important issue because everybody should access Web content independently of their access features. Among these accessibility issues, a Web content element that interferes with Web accessibility is a CAPTCHA. A CAPTCHA is a challenge-response test used to determine whether or not the user is a human instead of a computer or a robot. This type of element causes accessibility barriers especially to users with disabilities. This paper presents an overview about Web accessibility and CAPTCHA. Besides, an analysis of the accessibility barriers and a solution proposal depending on the type of disability is provided. Moreover, a survey of CAPTCHA approaches is introduced and its results are shown. With the knowledge gathered, a data discussion is provided. The lesson learned is that the CAPTCHA objective must be that security checks should be responsibility of websites or servers, that is, they cannot be delegated to the user.

Paper Nr: 100
Title:

Fuzzy-Ontology-Enrichment-based Framework for Semantic Search

Authors:

Hajer Baazaoui-Zghal and Henda Ben Ghezala

Abstract: The dominance of information retrieval on the Web makes integrating and designing ontologies for the on-line Information Retrieval Systems (IRS) an attractive research area. In addition to domain ontology, some attempts have been recently made to integrate fuzzy set theory with ontology, to provide a solution to vague and uncertain information. This paper presents a framework for semantic search based on ontology enrichment and fuzziness (FuzzOntoEnrichIR). FuzzOntoEnrichIR main components are: (1) a fuzzy information retrieval component, (2) an incremental ontology enrichment component and (3) an ontology repository component. The framework aims on the one hand to capitalize and formulate extraction-ontology rules based on a meta-ontology. On the other hand, it aims to integrate the domain ontology enrichment and the fuzzy ontology building in the IR process. The framework has been implemented and experimented to demonstrate the effectiveness and validity of the proposal.

Area 3 - Society, e-Business and e-Government

Full Papers
Paper Nr: 33
Title:

I Am off Then: Drivers of Travellers’ Intentions to Book Trips Online - An Integrated Study on Technology Acceptance and Satisfaction

Authors:

Maria Madlberger

Abstract: The tourism industry has undergone a substantial transformation since the emergence of electronic commerce. Especially travel agencies that are faced with growing online competition are increasingly dependent on achieving online sales. This study investigates antecedents of consumers’ intention to book a trip online at a travel agency’s website. The research draws on an integrated research model based on the technology acceptance model and customer satisfaction as introduced in the DeLone and McNeal model on information system success. An online survey among 292 consumers largely supports the hypothesized impact factors. Information quality serves as a significant object-based belief that influences satisfaction as an object-based attitude. In contrast, system quality has no impact on satisfaction. Satisfaction influences perceived usefulness, a key driver of online booking intention and perceived ease of use. The study provides several scholarly and managerial implications for the online distribution of tourism services.

Paper Nr: 120
Title:

TweetPos: A Tool to Study the Geographic Evolution of Twitter Topics

Authors:

Maarten Wijnants, Adam Blazejczak, Peter Quax and Wim Lamotte

Abstract: Popular Social Networking Sites (SNSs) like Twitter and Facebook are evolving into crowd-sourced, interdisciplinary sensor systems that “monitor” a wide spectrum of (physical) properties and topics. This paper introduces TweetPos, a web service that is intended to facilitate the analytical study of geographic tendencies in Twitter data feeds. To oblige the human cognitive features, the TweetPos tool maximally relies on visual data structures like heatmaps and charts to represent the geo-spatial sources of tweets. The tool compiles data bodies that grant insight in both past and present tweet posting behavior, incorporates an animation engine to highlight temporal trends, and leverages layered visualization techniques so that multiple topics can be offset against each other, all from a geographic perspective. Via the presentation of two representative use cases, we comprehensively demonstrate TweetPos’ data mining and analytical features and we illustrate the (geo-spatial) intelligence they can amount to. Thanks to a generic implementation, the TweetPos service is not geared towards a specific target audience but instead is sufficiently versatile to be valuable for a vast and varied collection of consumer profiles like social scientists and market analysts.

Short Papers
Paper Nr: 14
Title:

The Co-retweeted Network and Its Applications for Measuring the Perceived Political Polarization

Authors:

Samantha Finn, Eni Mustafaraj and Panagiotis T. Metaxas

Abstract: This paper introduces a novel network, the co-retweeted network, that is constructed as the undirected weighted graph that connects highly visible accounts who have been retweeted by members of the audience during some real-time event. Like bibliographics co-citation used to indicate that two papers treat a related subject matter, co-retweeting is used to indicate that two accounts present similar opinions in an online discussion. Thus, the co-retweeted network can be seen as a form of consulting the opinion of the crowd that is following the discussion about the similarity (or difference) of positions expressed by the highly visible accounts. When applied on political conversations related to some event, the co-retweeted network enables the measurement of the polarity of political orientation of major players (including news organizations) based on the views of the audience. It can also measure the degree of polarization of the event itself.

Paper Nr: 50
Title:

Business Process Improvement Methodology Based on Business Process Modelling - Case Study from Healthcare Sector

Authors:

Faisal Aburub

Abstract: Business process can be considered as one of the key pillars of successful e-business. One of the most important objectives of e-business is to support an organization by automating its business processes using information systems. It is important to make sure that the anticipated information systems is economically feasible and support business process of such organization effectively and sufficiently otherwise, the information systems will fail. One reason of information systems and e-business fail is because there are deficiencies in business processes of an organization. Therefore, a new methodology has been developed for business process improvement based on business process modelling particularly, Role Activity Diagramming (RAD). The methodology has been validated and tested using cancer registration process in Jordan as a case study. The key features of the methodology are that it is grounded in process modelling and brings together functional, non-functional and structural process analyses. Further research work is being undertaken to replicate this study on other application domains and thus generalize the adopted approach.

Paper Nr: 51
Title:

A Model to Specify Inter-company Cooperation

Authors:

Meriem Kermani, Mahmoud Boufaida and Emmanuel Paviot-Adet

Abstract: The impact of the evolution of the global economic ecosystem, has forced and still force again organizations to either adopt new behaviour schemas and to deeply change their structure, also to be more open to their environment as well. These impacts require a new cooperation philosophy from organizations side. Several studies have focused on the problem of the intercompany cooperation, proposing approaches that provide interoperability mechanisms. But this remains an open research domain. In this work we will propose a mediation architecture between different companies. A mediator allows us to create an intercompany cooperative process, the purpose of this solution is to keep the company architecture and ask the mediator which is a software-based agent to play an intermediary role between companies, and involve it in making the transformation between companies. We define a dynamic and cooperative inter-companies model that combines the agent technology and the decision trees paradigm. This last facilitates making decision by selecting the services that best meet customer needs, in order to create a composite service.

Paper Nr: 64
Title:

The Effectiveness of Electronic Word of Mouth on Consumers’ Perceptions of Adopting Products/Services - A Literature Review

Authors:

Ali Reza Montazemi and Hamed Qahri Saremi

Abstract: The objective of this paper is to identify factors that influence the effectiveness of electronic word of mouth (eWOM) message on the consumers’ perception of adopting products/services offered through self-service technology and brick-and-mortar channels. We draw on “social communication framework” and “Grounded Theory Literature Review Method” to systematically review the factors investigated in the extant eWOM literature to synthesize the effectiveness of eWOM message on consumers' perception of adopting products/services. This enabled us to identify several gaps in the body of knowledge in the eWOM literature that provide direction for future research to ameliorate the gaps.

Paper Nr: 95
Title:

UnderstandIT: A Community of Practice of Teachers for VET Education

Authors:

Maria De Marsico, Carla Limongelli, Filippo Sciarrone, Andrea Sterbini and Marco Temperini

Abstract: Social Networks are among the most popular achievements of Web 2.0. As a matter of fact, the Internet is full of social communities like Facebook or Twitter or others, which are used not only by teenagers but also by older users to share experiences, information and opinions. The expression Social Network allows for a wide interpretation, coming from the research in social and behavioral sciences. It also lends itself to encompass a kind of more thematically vertical communities, namely Communities of Practice, where groups of people interested to learn a particular practice are supported in achieving this goal thanks to the professional relationships with expert peers. The community network evolves with time, as novice users become experts in turn, and new communication protocols arise. We start from the consideration that most high school students nowadays use Web 2.0 instruments and tools for chatting, streaming or posting messages. An interesting challenge is the possibility for teachers the use of the same instruments and tools to enhance their daily work. To this aim, we present a first study of a Community of Practice of teachers, involved in Vocational Education and Training (VET), aiming at learning the use of Web 2.0 tools and applications to support the learning process of their pupils. In practice, we propose to use Web 2.0 to learn to teach through Web 2.0. As a platform we used the web application ELGG, that is an open source social network engine thanks to which we built a social environment where a sample of VET teachers participated to several social activities. A first evaluation of effectiveness of such an approach is encouraging.

Paper Nr: 98
Title:

Change Management in Service Oriented Virtual Organizations - A Structural Framework to Identify the Impacts and Triggers of Changes

Authors:

Waeal J. Obidallah and Bijan Raahemi

Abstract: Business processes and services of Service Oriented Virtual Organizations are subject to change to meet the internal and external requirements of the competitive, complex and rapidly changing collaborative environment they operate in. Efficient and practical change management solutions are needed to enable the partners to gain insight on the various triggers of change, and their impacts on the Virtual Organization’s operations and collaboration. This paper presents a structural change management framework to facilitate the process of change by allowing the participating partners in a Virtual Organization to identify and understand the levels and the triggers of changes in their environment. Our proposed solution consists of the structural framework including two layers, namely, the levels of changes, and triggers of changes. We present an example scenario to demonstrate how employing the proposed framework to identify, recognize and categorize various changes in the Service Oriented Virtual Organization improves the flexibility, and agility of the change management in the organizations.

Paper Nr: 105
Title:

An Investigation of Factors and Measurements for Successful e-Commerce Websites

Authors:

Xiaosong Li, Wei Lian and Abdolhossein Sarrafzadeh

Abstract: Business-to-Consumer (B2C) e-commerce is popular because of its convenience, speed and price. Although there has been intense debate about quality dimensions of e-commerce websites, more research is needed to find a well-established measurement. This empirical study identifies a set of measurements with 10 factors and their corresponding dimensions, including software development attributes based on the literature and the qualitative and quantitative data gathered from four different stakeholders. The survey results suggest that security, smooth transaction processes and smooth shopping processes are the most important concerns for online shoppers. The IS success model checking suggests that the proposed measurements are comprehensive. This work is compared with the customized ISO 9126 quality model.

Paper Nr: 108
Title:

Who Runs the Internet? - Classifying Autonomous Systems into Industries

Authors:

Annika Baumann and Benjamin Fabian

Abstract: The Internet consists of a network of Autonomous Systems (ASs). To understand which kind of organizations control those ASs can help to better assess the Internet structure in terms of economic interests and reliability. The current paper proposes a novel classification approach by combining ASspecific data with business data from the United States Securities and Exchange Commission. Furthermore, more detailed industry classes than in previous works are considered, inspired by the North American Industry Classification System (NAICS). Using our methodology on a recent data set, we were able to classify 56.69 % of the considered ASs into industries. This lays a foundation for our future work on investigating the important players of the Internet backbone as well as their economic interests and risks.

Paper Nr: 111
Title:

Organizational Maturity and Projects Performance - The Mediation of Benefits Management

Authors:

Jorge Gomes, Mário Romão, Helena Carvalho and Mário Caldeira

Abstract: Nowadays the investments made by organizations to get better business performances are crucial and difficult to obtain. So, it´s more and more important that the business-cases underline not only the short term objectives, but also the long-term benefits that promotes a sustainable future for organizations and investors. The main objective of our research is to study how investments on IS/IT influence projects performance in organizations. The rational of our model is that organizational maturity has a positive effect on projects performance with the mediation of benefits management. We emphasize that the integration between a maturity model and a benefits management approach can increase the effectiveness of strategic projects. Besides, it can also improve business confidence that the investments done match the desired maturity stages and will then collect benefits perceived as business value.

Paper Nr: 121
Title:

Technology-enabled Bullying & Adolescent Non-reporting - Breaking the Silence

Authors:

Justin Connolly and Regina Connolly

Abstract: Although early research has pointed to the fact that the successful intervention and resolution of cyberbullying incidents is to a large degree dependent on such incidents being reported to an adult caregiver, the literature consistently shows that adolescents who have been bullied tend not to inform others of their experiences. However, the reasons underlying reluctance to seek adult intervention remain undetermined. Understanding the factors that influence adolescent resistance will assist caregivers, teachers and those involved in the formulation of school anti-bullying policies in their attempts to counter the cyberbullying phenomenonre should be a space before of 12-point and after of 30-point.

Paper Nr: 133
Title:

Internet Science for Strategic Planning

Authors:

Žiga Turk

Abstract: The rapid development of information and communication technology (ICT) is perhaps the most influential driver that is fundamentally changing the world and the societies we live in. ICT is (a) changing the communication fabric that is linking the elements of societies together and is (b) automating human routine work. The latter is enabling automation and creating an abundance of food, industrial products and information. This abundance is pushing the value creation towards the creation of new knowledge and meaningful (rather than only functional) products and services. Information, knowledge and meaning are the three key commodities of the modern economy. Innovation and creativity are key processes creating these commodities. The two activities are very significantly supported by information and communication technologies. ICTs are politically acknowledged on several levels of future planning: in R&D programs, development strategies, future studies and visions. However, a scientific base for all this is lacking. The interaction among ICT, innovation and society at large is a subject of research projects such as EINS and spawning the birth of a new interdisciplinary science, Internet Science - that is studying the interaction between information technology and society.

Paper Nr: 152
Title:

Utilising Social Media Technology to Raise Brand Awareness in Higher Education

Authors:

Baomin Qi and Lindsey Mackie

Abstract: Marketing and more specifically raising brand awareness is vital for any successful organisation and its importance is demonstrated across all industry sectors. The widely adopted social media (SM) technology is believed to bring opportunities for brand awareness; and many universities worldwide have already broached the concept of using SM to communicate with potential students. However, the social media arena is deemed by many HEIs as frivolous, and the significance of SM is yet widely realised. This research will look at the power behind social media technology; the main focus is to explore whether the utilisation of SM technology could increase the brand awareness. In order to achieve this research aim exhaustive literature search was conducted and the available literature is systematically reviewed. The research found that although SM platforms have been exploited by most of institutions, there is little evidence to suggest that higher education is currently using social media effectively to engage with students, increase brand awareness and enhance image.

Posters
Paper Nr: 8
Title:

Experiences from and Attitudes towards Applying User Participation in Public e-Service Development

Authors:

Jesper Holgersson and Eva Söderström

Abstract: It is evident that user participation is a vital component for successful public e-service development. However, it is also apparent that there is little guidance in e-government research about how user participation should be implemented in practice. Some high level guidelines can be found regarding user participation design schools but there is very little guidance in existing research regarding how these design schools can be implemented in practice. In this paper we have explored public administrations’ experience of user participation, both in general systems development and in development of public e-services, in order to identify basic requirements that have to be fulfilled when implementing user participation in public e-service development. Thereafter we have applied these requirements on commonly used techniques to implement user participation in the light of three design schools: Participatory Design, User Centered Design, and User Innovation. Our results show that techniques to implement user participation in public e-service development must be adjusted to limited resources in terms of time and money as well as short development projects.

Paper Nr: 23
Title:

Analysis of LinkedIn Privacy Settings - Are they Sufficient, Insufficient or Just Unknown?

Authors:

Pilar Manzanares-Lopez, Juan Pedro Muñoz-Gea and Josemaria Malgosa-sanahuja

Abstract: Internet-based applications give users an easy way to communicate with each other on a scale and rate unseen in traditional media. Among them, the professional social networking sites (with LinkedIn as one the most widespread platforms) offer a useful way to create and maintain a professional contact network. LinkedIn is also a self-promotion tool, where employees, industries and communities get in touch. In this scenario, it seems logical to consider privacy as a fundamental subject. Controlling who can see our data may avoid that our information reaches our boss when we are looking for a job, a competitor, or even former or present work colleagues with whom we have had some conflict. This work analyzes deeply the privacy settings offered by LinkedIn, and also analyzes the privacy concerns among the users, examining how these concerns correlate to the knowledge of the privacy settings and the adequacy of their use.

Paper Nr: 27
Title:

Using PageRank for Detecting the Attraction between Participants and Topics in a Conversation

Authors:

Costin Chiru, Traian Rebedea and Adriana Erbaru

Abstract: In this paper we present a novel approach that uses the well-known PageRank algorithm for assessing multi-threaded chat conversations. As online conversations can be modelled as directed graphs, we have investigated a method for allowing a real-time analysis of the conversation using PageRank by computing the ranks of the utterances based on the explicit and implicit links available in the discussion. This model has been also extended to offer a method for computing connections between the debated topics and the chat participants and between each of the debated topics in the conversation, called the participant-topic and the topic-topic attraction. The results presented in this paper are promising, but also reflect several important differences between the existent offline analysis tools for chats and the PageRank method.

Paper Nr: 48
Title:

An Implementation-independent Evaluation Model for Server-based Signature Solutions

Authors:

Thomas Zefferer and Bernd Zwattendorfer

Abstract: During the past years, a general trend towards server-based signature solutions can be observed. Server-based signature solutions rely on a secure central server component that is able to securely store cryptographic keys and to create electronic signatures on behalf of users. Due to their various advantages compared to client-based solutions, it must be expected that server-based signature solutions will be increasingly deployed in security-critical fields of application in future. This raises the need for appropriate means to systematically evaluate the security of such solutions. Unfortunately, existing evaluation methods (e.g. Protection Profiles according to Common Criteria) are only partly applicable for evaluating server-based signature solutions. To overcome this issue, we propose a new implementation-independent evaluation model for server-based signature solutions. The proposed evaluation model is based on an abstract architectural model for server-based signature solutions and can hence be applied to arbitrary implementations. This way, we provide a powerful instrument to assess the security of future server-based signature solutions and pave the way for their adoption in security-critical fields of application.

Paper Nr: 90
Title:

Communities of Practice Supporting Research and Technological Development - A Practical Case at Electrical Sector in Mexico

Authors:

Yasmín Hernández, Salvador González-Castro and G. Arroyo-Figueroa

Abstract: The IIE is devoted to innovation, technological development and applied scientific research. In order to increase productivity and to anticipate the needs of electrical sector, IIE is promoting collaboration and sharing of knowledge, ideas and experience among researchers through communities of practice. Communities of practice enable creating and sharing of knowledge and its application in the development of research projects, realize and anticipate problems, and devised new research lines to be undertaken. To impulse communities of practice, several efforts have been conducted, such as funding, talks by experts, defining success indexes and regulations and the development of an information system based on Web technologies. The Web system supports distance collaboration and sharing of documents. This paper presents an approach to implement communities of practice.

Paper Nr: 109
Title:

Exploring the Bitcoin Network

Authors:

Annika Baumann, Benjamin Fabian and Matthias Lischke

Abstract: This explorative paper focuses on descriptive statistics and network analysis of the Bitcoin transaction graph based on recent data using graph mining algorithms. The analysis is carried out on different aggregations and subgraphs of the network. One important result concerns the relationship of network usage and exchange rate, where a strong connection could be confirmed. Moreover, there are indicators that the Bitcoin system is a “small world” network and follows a scale-free degree distribution. Furthermore, an example of how important network entities could be deanonymized is presented. Our study can serve as a starting point in investigating anonymity and economic relationships in Bitcoin on a new structural level.

Paper Nr: 125
Title:

Electronic Health Record Portal Adoption by Health Care Consumers - Proposal of a New Adoption Model

Authors:

Jorge Tavares and Tiago Oliveira

Abstract: The aim of this study is to identify a set of determinants of adoption of electronic health records (EHR) portals by health care consumers. Based on extensive literature review we suggest a new research model based on the unified theory of acceptance and use of technology in a consumer context (UTAUT2) by integrating a second order construct, Concern for Information Privacy (CFIP) framework and a moderator, chronic disability. A set of propositions is also included to test the new conceptual model. We also present a plan to validate the proposed model through empirical testing. The EHR portals are a part of the e-government strategy currently unfolding in Portugal. Understanding the acceptance and use of EHR portals by health care consumers should benefit the future sustainability of the Heath Care System, which will gain a more efficient use of resources.

Area 4 - Web Intelligence

Full Papers
Paper Nr: 6
Title:

The GENIE Project - A Semantic Pipeline for Automatic Document Categorisation

Authors:

Angel L. Garrido, Maria G. Buey, Sandra Escudero, Alvaro Peiro, Sergio Ilarri and Eduardo Mena

Abstract: Automatic text categorisation systems is a type of software that every day it is receiving more interest, due not only to its use in documentaries environments but also to its possible application to tag properly documents on the Web. Many options have been proposed to face this subject using statistical approaches, natural language processing tools, ontologies and lexical databases. Nevertheless, there have been no too many empirical evaluations comparing the influence of the different tools used to solve these problems, particularly in a multilingual environment. In this paper we propose a multi-language rule-based pipeline system for automatic document categorisation and we compare empirically the results of applying techniques that rely on statistics and supervised learning with the results of applying the same techniques but with the support of smarter tools based on language semantics and ontologies, using for this purpose several corpora of documents. GENIE is being applied to real environments, which shows the potential of the proposal.

Paper Nr: 43
Title:

Comparing Topic Models for a Movie Recommendation System

Authors:

Sonia Bergamaschi, Laura Po and Serena Sorrentino

Abstract: Recommendation systems have become successful at suggesting content that are likely to be of interest to the user, however their performance greatly suffers when little information about the users preferences are given. In this paper we propose an automated movie recommendation system based on the similarity of movie: given a target movie selected by the user, the goal of the system is to provide a list of those movies that are most similar to the target one, without knowing any user preferences. The Topic Models of Latent Semantic Allocation (LSA) and Latent Dirichlet Allocation (LDA) have been applied and extensively compared on a movie database of two hundred thousand plots. Experiments are an important part of the paper; we examined the topic models behaviour based on standard metrics and on user evaluations, we have conducted performance assessments with 30 users to compare our approach with a commercial system. The outcome was that the performance of LSA was superior to that of LDA in supporting the selection of similar plots. Even if our system does not outperform commercial systems, it does not rely on human effort, thus it can be ported to any domain where natural language descriptions exist. Since it is independent from the number of user ratings, it is able to suggest famous movies as well as old or unheard movies that are still strongly related to the content of the video the user has watched.

Paper Nr: 63
Title:

Product Feature Taxonomy Learning based on User Reviews

Authors:

Nan Tian, Yue Xu, Yuefeng Li, Ahmad Abdel-Hafez and Audun Josang

Abstract: In recent years, the Web 2.0 has provided considerable facilities for people to create, share and exchange information and ideas. Upon this, the user generated content, such as reviews, has exploded. Such data provide a rich source to exploit in order to identify the information associated with specific reviewed items. Opinion mining has been widely used to identify the significant features of items (e.g., cameras) based upon user reviews. Feature extraction is the most critical step to identify useful information from texts. Most existing approaches only find individual features about a product without revealing the structural relationships between the features which usually exist. In this paper, we propose an approach to extract features and feature relationships, represented as a tree structure called feature taxonomy, based on frequent patterns and associations between patterns derived from user reviews. The generated feature taxonomy profiles the product at multiple levels and provides more detailed information about the product. Our experiment results based on some popularly used review datasets show that our proposed approach is able to capture the product features and relations effectively.

Paper Nr: 80
Title:

Automatic Web Page Classification Using Visual Content

Authors:

António Videira and Nuno Goncalves

Abstract: There is a constantly increasing requirement for automatic classification techniques with greater classification accuracy. To automatically classify and process web pages, the current systems use the text content of those pages. However, little work has been done on using the visual content of a web page. On this account, our work is focused on performing web page classification using only their visual content. First a descriptor is constructed, by extracting different features from each page. The features used are the simple color and edge histograms, Gabor and Tamura features. Then two methods of feature selection, one based on the Chi-Square criterion, the other on the Principal Components Analysis are applied to that descriptor, to select the top discriminative attributes. Another approach involves using the Bag of Words (BoW) model to treat the SIFT local features extracted from each image as words, allowing to construct a dictionary. Then we classify web pages based on their aesthetic value, their recency and type of content. The machine learning methods used in this work are the Naive Bayes, Support Vector Machine, Decision Tree and AdaBoost. Different tests are performed to evaluate the performance of each classifier. Finally, we thus prove that the visual appearance of a web page has rich content not explored by current web crawlers based only on text content.

Paper Nr: 134
Title:

User Semantic Model for Dependent Attributes to Enhance Collaborative Filtering

Authors:

Sonia Ben Ticha, Azim Roussanaly, Anne Boyer and Khaled Bsaïes

Abstract: Recommender system provides relevant items to users from huge catalogue. Collaborative filtering and content-based filtering are the most widely used techniques in personalized recommender systems. Collaborative filtering uses only the user-ratings data to make predictions, while content-based filtering relies on semantic information of items for recommendation. Hybrid recommendation system combines the two techniques. The aim of this work is to introduce a new approach for semantically enhanced collaborative filtering. Many works have addressed this problem by proposing hybrid solutions. In this paper, we present another hybridization technique that predicts users preferences for items based on their inferred preferences for semantic information of items. For this, we design a new user semantic model by using Rocchio algorithm and we apply a latent semantic analysis to reduce the dimension of data. Applying our approach to real data, the MoviesLens 1M dataset, significant improvement can be noticed compared to usage only approach, and hybrid algorithm.

Short Papers
Paper Nr: 11
Title:

Extracting Multi-item Sequential Patterns by Wap-tree Based Approach

Authors:

Kezban Dilek Onal and Pinar Karagoz

Abstract: Sequential pattern mining constitutes a basis for solution of problems in web mining, especially in web usage mining. Research on sequence mining continues seeking faster algorithms. WAP-Tree based algorithms that emerged from the web usage mining literature have shown a remarkable performance on single-item sequence databases. In this study, we investigate the application of WAP-Tree based mining to multi-item sequential pattern mining and we present MULTI-WAP-Tree, which extends WAP-Tree for multi-item sequence databases. In addition, we propose a new algorithm MULTI-FOF-SP (MULTI-FOF-Sibling Principle) that extracts patterns on MULTI-WAP-Tree. MULTI-FOF-SP is based on the previous WAP-Tree based algorithm FOF (First Occurrence Forest) and an early pruning strategy called ”Sibling Principle” from the literature. Experimental results reveal that MULTI-FOF-SP finds patterns faster than PrefixSpan on dense multi-item sequence databases with small alphabets.

Paper Nr: 13
Title:

Handling Weighted Sequences Employing Inverted Files and Suffix Trees

Authors:

Klev Diamanti, Andreas Kanavos, Christos Makris and Thodoris Tokis

Abstract: In this paper, we address the problem of handling weighted sequences. This is by taking advantage of the inverted files machinery and targeting text processing applications, where the involved documents cannot be separated into words (such as texts representing biological sequences) or word separation is difficult and involves extra linguistic knowledge (texts in Asian languages). Besides providing a handling of weighted sequences using n-grams, we also provide a study of constructing space efficient n-gram inverted indexes. The proposed techniques combine classic straightforward n-gram indexing, with the recently proposed two-level n-gram inverted file technique. The final outcomes are new data structures for n-gram indexing, which perform better in terms of space consumption than the existing ones. Our experimental results are encouraging and depict that these techniques can surely handle n-gram indexes more space efficiently than already existing methods.

Paper Nr: 58
Title:

A Survey on Challenges and Methods in News Recommendation

Authors:

Özlem Özgöbek, Jon Atle Gulla and R. Cenk Erdur

Abstract: Recommender systems are built to provide the most proper item or information within the huge amount of data on the internet without the manual effort of the users. As a specific application domain, news recommender systems aim to give the most relevant news article recommendations to users according to their personal interests and preferences. News recommendation have specific challenges when compared to the other domains. From the technical point of view there are many different methods to build a recommender system. Thus, while general methods are used in news recommendation, researchers also need some new methods to make proper news recommendations. In this paper we present the different approaches to news recommender systems and the challenges of news recommendation.

Paper Nr: 60
Title:

Combining Learning-to-Rank with Clustering

Authors:

Efstathios Lempesis and Christos Makris

Abstract: This paper aims to combine learning-to-rank methods with an existing clustering underlying the entities to be ranked. In recent years, learning-to-rank has attracted the interest of many researchers and a large number of algorithmic approaches and methods have been published. Existing learning-to-rank methods have as goal to automatically construct a ranking model from training data. Usually, all these methods don't take into consideration the data's structure. Although there is a novel task named “Relational Ranking” which tries to make allowances for the inter-relationship between documents, it has restrictions and it is difficult to be applied in a lot of real applications. To address this problem, we create a per query clustering using state of the art algorithms from our training data. Then, we experimentally verify the effect of clustering on them.

Paper Nr: 76
Title:

A Methodology to Measure the Semantic Similarity between Words based on the Formal Concept Analysis

Authors:

Yewon Jeong, Yiyeon Yoon, Dongkyu Jeon, Youngsang Cho and Wooju Kim

Abstract: Recently, web users feel difficult to find the desired information on the internet despite a lot of useful information since it takes more time and effort to find it. In order to solve this problem, the query expansion is considered as a new alternative. It is the process of reformulating a query to improve retrieval performance in information retrieval operations. Although there are a few techniques of query expansion, synonym identification is one of them. Therefore, this paper proposes the method to measure the semantic similarity between two words by using the keyword-based web documents. The formal concept analysis and our proposed expansion algorithm are used to estimate the similarity between two words. To evaluate the performance of our method, we conducted two experiments. As the results, the average of similarity between synonym pairs is much higher than random pairs. Also, our method shows the remarkable performance in comparison with other method. Therefore, the suggested method in this paper has the contribution to find the synonym among a lot of candidate words.

Paper Nr: 106
Title:

A Recommendation System for Specifying and Achieving S.M.A.R.T. Goals

Authors:

Romain Bardiau, Magali Seguran, Aline Senart and Ana Maria Tuta Osman

Abstract: Businesses and public organizations are typically goal-oriented trying to maximize their performance. Goals are today set arbitrarily and are given without hints on how to achieve them. There are many applications that allow setting up goals and sub-goals but the process is still manual. In this paper, we present a recommendation system that helps the user specify S.M.A.R.T. goals and monitor the progress towards these goals. Given a main goal on a metric, the system recommends specific sub-goals or indicators based on the forecast of historical data. These recommended indicators are the most probable to have a higher contribution in helping the user to reach his main goal. The user can additionally monitor its progress with a visualization over time. We show how this system can be used in a business scenario for sales.

Paper Nr: 122
Title:

An Self-configuration Architecture for Web-API of Internet of Things

Authors:

Eric Bernardes Chagas Barros and Admilson de Ribamar L. Ribeiro

Abstract: The internet of things (IoT) is the paradigm that will dominate the computing world in the coming years. In this way, studies should be conducted in such way to ensure its enhancement and in the quest for that improvement is necessary to use the already existing technologies that apply to IoT. This paper's purpose is to unite different technologies like REST, cloud computing and embedded operating system in order to obtain mechanisms capable of self-configuration. Thus, it was possible to conclude that the architecture proposed would increase useful techniques for the implementation of systems that want to run the self-configuration as well as assist in setting up networks of computers that work with wireless sensors and IoT.

Paper Nr: 140
Title:

Detection of Semantic Relationships between Terms with a New Statistical Method

Authors:

Nesrine Ksentini, Mohamed Tmar and Faïez Gargouri

Abstract: Semantic relatedness between terms plays an important role in many applications, such as information retrieval, in order to disambiguate document content. This latter is generally studied among pairs of terms and is usually presented in a non-linear way. This paper presents a new statistical method for detecting relationships between terms called Least Square Mehod which defines these relations linear and between a set of terms. The evaluation of the proposed method has led to optimal results with low error rate and meaningful relationships. Experimental results show that the use of these relationships in query expansion process improves the retrieval results.

Paper Nr: 143
Title:

An Approach to Detect Polarity Variation Rules for Sentiment Analysis

Authors:

Pierluca Sangiorgi, Agnese Augello and Giovanni Pilato

Abstract: Sentiment Analysis is a discipline that aims at identifying and extract the subjectivity expressed by authors of information sources. Sentiment Analysis can be applied at different level of granularity and each of them still has open issues. In this paper we propose a completely unsupervised approach aimed at inducing a set of words patterns that change the polarity of subjective terms. This is a very important task because, while sentiment lexicons are valid tools that can be used to identify the polarity at word level, working at different level of granularity they are no longer sufficient, because of the various aspects to consider like the context, the use of negations and so on that can change the polarity of subjective terms.

Paper Nr: 153
Title:

A General Evaluation Framework for Adaptive Focused Crawlers

Authors:

Fabio Gasparetti, Alessandro Micarelli and Giuseppe Sansonetti

Abstract: Focused crawling is increasingly seen as a solution to increase the freshness and coverage of local repository of documents related to specific topics by selectively traversing paths on the web. The adaptation is a peculiar feature that makes it possible to modify the search strategies according to the particular environment, its alterations and its relationships with the given input parameters during the search. This paper introduces a general evaluation framework for adaptive focused crawlers.

Paper Nr: 155
Title:

A Domotic Ecosystem Driven by a Networked Intelligence

Authors:

Luca Ferrari, Matteo Gioia, Gian Luca Galliani and Bruno Apolloni

Abstract: We describe a diffuse control system for household appliances rooted in an Internet of Thing network empowered by a cognitive system. The key idea is that these appliances constitute an ecosystem populated by a plenty of devices with common features, yet called to satisfy in an almost repetitive way needs that may be very diversified, depending on the user preferences. This calls for a network putting them in connection and a cognitive system that is capable to interpret the user requests and translate them into instructions to be transmitted to the appliances. This in turn requires a proper architecture and efficient protocols for connecting the appliances to the network, as well as robust algorithms that concretely challenge cognitive and connectionist theories to produce the instructions ruling the appliances. We discuss both aspects from a design perspective and exhibit a mockup where connections and algorithms are implemented.

Posters
Paper Nr: 12
Title:

Improving Opinion-based Entity Ranking

Authors:

Christos Makris and Panagiotis Panagopoulos

Abstract: We examine the problem of entity ranking using opinions expressed in users' reviews. There is a massive development of opinions and reviews on the web, which includes reviews of products and services, and opinions about events and persons. For products especially, there are thousands of users' reviews, that consumers usually consult before proceeding in a purchase. In this study we are following the idea of turning the entity ranking problem into a matching preferences problem. This allows us to approach its solution using any standard information retrieval model. Building on this framework, we examine techniques which use sentiment and clustering information, and we suggest the naive consumer model. We describe the results of two sets of experiments and we show that the proposed techniques deliver interesting results.

Paper Nr: 16
Title:

XML Approximate Semantic Query based on Ontology

Authors:

Yunkai Zhu, Chunhong Zhang and Yang Ji

Abstract: More and more data is generated in XML format. How to effectively retrieve information from these data has attracted much research interest. Users have been used to keyword query without knowledge of data in advance. But XML data has additional structure than keywords. Almost all previous XML keyword queries require that user should be fully familiar with the XML structures and query syntax, which is not user-friendly and is seriously impediment to the prevalence of XML. In this paper, we propose to use natural sentence as query input because it can contain both keywords and their structure information. Query processing engine depends on NLP (Natural Language Process) technology and predefined templates to catch the query goal of user, optimally expressed as RDF (Resource Description Framework) triples. We exploit hierarchical structure relaxation based on query tree variation and vocabulary relaxation based on WordNet to relax input query. To better reflect the semantics of the query, we also use a certain domain OWL ontology constructed from XML schemas for reasoning and searching. Ontology gives us a reliable group of concepts and relations between the concepts. Ontology accurately transfers semantic information between human users and the computers. Finally we translate the RDF triples to SPARQL query sentences to retrieve RDF data.

Paper Nr: 18
Title:

Finding Domain Experts in Microblogs

Authors:

Shao Xianlei, Zhang Chunhong and Ji Yang

Abstract: As users and contents of microblogging services gain a sharp increase, it presents the challenge of finding domain experts who are of high profession but generally don’t have followers widely. To address this, we propose a domain experts finding system, which consists of three modules: data preprocessing module, user features extracting engine, experts identifying and ranking module. Firstly, we extract three kinds of features for characterizing social media authors, including user profile features, tweeting behavior features and linguistic content features which are generated by our Microblog Latent Dirichlet Allocation(Microblog Lda) model. Secondly, by casting the problem of finding domain experts as a 0-1 classification problem, we use the Gradient Boosted Decision Trees (GBDT) framework to do probabilistic classification over these features, execute a ranking procedure and yield a list of top N users for a given domain. Experimental results on actual datasets show our Microblog Lda outperforms LDA(Latent Dirichlet Allocation) and our system has a high accuracy in the task of finding domain experts in Microblogs.

Paper Nr: 26
Title:

Comparison between LSA-LDA-Lexical Chains

Authors:

Costin Chiru, Traian Rebedea and Silvia Ciotec

Abstract: This paper presents an analysis of three techniques used for similar tasks, especially related to semantics, in Natural Language Processing (NLP): Latent Semantic Analysis (LSA), Latent Dirichlet Allocation (LDA) and lexical chains. These techniques were evaluated and compared on two different corpora in order to highlight the similarities and differences between them from a semantic analysis viewpoint. The first corpus consisted of four Wikipedia articles on different topics, while the second one consisted of 35 online chat conversations between 4-12 participants debating four imposed topics (forum, chat, blog and wikis). The study focuses on finding similarities and differences between the outcomes of the three methods from a semantic analysis point of view, by computing quantitative factors such as correlations, degree of coverage of the resulting topics, etc. Using corpora from different types of discourse and quantitative factors that are task-independent allows us to prove that although LSA and LDA provide similar results, the results of lexical chaining are not very correlated with neither the ones of LSA or LDA, therefore lexical chains might be used complementary to LSA or LDA when performing semantic analysis for various NLP applications.

Paper Nr: 31
Title:

Prediction of Human Personality Traits From Annotation Activities

Authors:

Nizar Omheni, Omar Mazhoud, Anis Kalboussi and Ahmed HadjKacem

Abstract: We show how reader’s annotation activity captured during an active reading session relates to their personality, as measured by the standard Five Factor Model. For 120 volunteers having usually the habit of reading, we gather personality data and annotation practices. We examine correlations between readers personality and such features of their annotative activities such as the total number of annotation acts, average number of annotation acts, number of textual annotation acts, number of graphical annotation acts, number of referential annotation acts and number of compounding annotation acts. Our results show significant relationships between personality traits and such features of annotation practices. Then we show how multivariate regression allows prediction of the readers personalities traits given their annotation activities.

Paper Nr: 46
Title:

Towards Automatic Building of Learning Pathways

Authors:

Patrick Siehndel, Ricardo Kawase, Bernardo Pereira Nunes and Eelco Herder

Abstract: Learning material usually has a logical structure, with a beginning and an end, and lectures or sections that build upon one another. However, in informal Web-based learning this may not be the case. In this paper, we present a method for automatically calculating a tentative order in which objects should be learned based on the estimated complexity of their contents. Thus, the proposed method is based on a process that enriches textual objects with links to Wikipedia articles, which are used to calculate a complexity score for each object. We evaluated our method with two different datasets: Wikipedia articles and online learning courses. For Wikipedia data we achieved correlations between the ground truth and the predicted order of up to 0.57 while for subtopics inside the online learning courses we achieved correlations of 0.793.

Paper Nr: 61
Title:

Automated Identification of Web Queries using Search Type Patterns

Authors:

Alaa Mohasseb, Maged El-Sayed and Khaled Mahar

Abstract: The process of searching and obtaining information relevant to the information needed have become increasingly challenging. A broad range of web queries classification techniques have been proposed to help in understanding the actual intent behind a web search. In this research, we are introducing a new solution to automatically identify and classify the user's queries intent by using Search Type Patterns. Our solution takes into consideration query structure along with query terms. Experiments show that our approach has a high level of accuracy in identifying different search types.

Paper Nr: 75
Title:

A Domain Independent Double Layered Approach to Keyphrase Generation

Authors:

Dario De Nart and Carlo Tasso

Abstract: The annotation of documents and web pages with semantic metatdata is an activity that can greatly increase the accuracy of Information Retrieval and Personalization systems, but the growing amount of text data available is too large for an extensive manual process. On the other hand, automatic keyphrase generation, a complex task involving Natural Language Processing and Knowledge Engineering, can significantly support this activity. Several different strategies have been proposed over the years, but most of them require extensive training data, which are not always available, suffer high ambiguity and differences in writing style, are highly domain-specific, and often rely on a well-structured knowledge that is very hard to acquire and encode. In order to overcome these limitations, we propose in this paper an innovative domain-independent approach that consists of an unsupervised keyphrase extraction phase and a subsequent keyphrase inference phase based on loosely structured, collaborative knowledge such as Wikipedia, Wordnik, and Urban Dictionary. This double layered approach allows us to generate keyphrases that both describe and classify the text.

Paper Nr: 126
Title:

A Comparison of Three Pre-processing Methods for Improving Main Content Extraction from Hyperlink Rich Web Documents

Authors:

Moheb Ghorbani, Hadi Mohammadzadeh and Abdolreza Nazemi

Abstract: Most HTML web documents on the World Wide Web contain a lot of hyperlinks in the body of main content area and additional areas. As extraction of the main content of such hyperlink rich web documents is rather complicated, three simple and language-independent pre-processing main content extraction methods are addressed in this paper to deal with the hyperlinks for identifying the main content accurately. To evaluate and compare the presented methods, each of these three methods is combined with a prominent main content extraction method, called DANAg. The obtained results show that one of the methods delivers a higher performance in term of effectiveness in comparison with the other two suggested methods.

Area 5 - Mobile Information Systems

Full Papers
Paper Nr: 19
Title:

Cascading Information for Public Transport Assistance

Authors:

Christian Samsel, Shirley Beul-Leusmann, Maximilian Wiederhold, Karl-Heinz Krempels, Martina Ziefle and Eva-Maria Jakobs

Abstract: Over the last years, public transport has become both more prominent and more diverse. Because of the complex structure of today’s public transport networks, an electronic guidance is effectively required. Usually different transport modalities and service providers offer their own application to which the traveler has to adapt after changing between services. Additionally a current trend in mobile applications is the customization of GUI elements which leads to appealing looks but usually also to cluttered presentation of information. Both these problems cause a high cognitive stress on the traveler using the mobile application, especially while conducting other activities at the same time. Our approach to mitigate these issues is to create a mobile application applying the Gamification principle Cascading Information Theory to simplify the usage and additionally to use a back-end which allows to integrate data from various services hereby unifiying the presentation. A prototype of the application was evaluated in an initial user test for comparing our approach to the most popular mobile travel application in Germany.

Paper Nr: 86
Title:

Evaluating Impact of Cross-platform Frameworks in Energy Consumption of Mobile Applications

Authors:

Matteo Ciman and Ombretta Gaggi

Abstract: In this paper we analyze energy consumption of mobile applications using different smartphones sensors, e.g., GPS, accelerometer, etc., and features, e.g., acquiring video or audio from the environment. In particular, we have studied how the use of frameworks for mobile cross-platform development may influence the amount of required energy for the same operation. We use an hardware and software tool to measure energy consumption of the same application, using different sensors, when developed natively or using two frameworks, Titanium and PhoneGap. Our experiments have shown that frameworks have a significant impact on energy consumption which greatly increases compared to an equal native application. Moreover, the amount of consumed energy is not the same for all frameworks.

Short Papers
Paper Nr: 131
Title:

Provision of Personalized Data via Mobile Web Services in eHealth Scenarios

Authors:

Marc Jansen, Abbas Siddiqui and Oliver Koch

Abstract: In societies where the demographic change leads to a more and more unbalanced state between the elderly and all other different age groups, there, health management becomes one of the most significant problems, e.g., in order to allow what is often called successful aging. Ubiquitous use of smart mobile devices enables gathering of personalized data enriched with contextual information. This information can ideally be used in modern eHealth scenarios, resulting in mobile health (mHealth) scenarios. This paper describes how highly personalized and sensitive information, such as vital signs enriched with contextual information of a patient, can be stored at mobile devices and provided via modern web technologies for later-on analysis and health monitoring. Therefore, this paper presents a number of different scenarios in which such mobile technology provides certain benefits, discusses benefits, drawbacks and challenges of such an approach and describes an example implementation of an mHealth scenario.

Posters
Paper Nr: 116
Title:

Robots Collaboration based on Cloud Robotics System for Daily - Emergency Life

Authors:

Buribayeva Gulban and Taizo Miyachi

Abstract: A person should solve unpredictable problems and evacuate from difficult emergency dangers by himself/herself. These days many different tasks can be solved by robots. However difficult tasks need more information in serious changing environments and human minds which are shared by robots and cloud servers can achieve this greater information. We propose a robot collaboration methodology in a cloud robotics computing system that can enable a robot with few resources to solve the difficult problems in both human daily life and emergency situations and provide users with useful awareness. We discuss how robots collaborate utilizing cloud robotics capabilities in order to exchange data/contexts and to find candidate solutions and create maps for emergency evacuation. We also discuss experimental results by a prototype of robot system for daily/emergency life.