Académique Documents
Professionnel Documents
Culture Documents
Abstract—We propose a new framework for social-sensor some situations or facts can be regarded as social-sensors[5].
cloud services selection based on spatio-textual correlation Therefore, social-sensors are considered as a promising and
between user’s query and service. The proposed research emerging domain for the application of crowd-sourced sen-
defines a formal social-sensor cloud service model that abstracts
the functional and non-functional aspects of social-sensor data sors. Similarly, social-sensor clouds are cloud-based social
on the cloud in terms of spatio-temporal, textual and quality of networks (e.g., facebook, twitter, google+ etc.) also known
service parameters. Proposed framework is a 4-stage filtering as social-clouds[3], hosting and processing data from the
algorithm, to select social-sensor cloud services based on user social-sensors.
query and quality of service demands. 4-stage filtering is based The challenge here is the efficient and real-time delivery
on spatial correlation, textual correlation, visual features and
quality of service parameters. Analytical results are presented of social-sensor data, i.e., data shared by social-sensors,
to show the performance of the proposed approach. to the end users as requested, without exposing the com-
plexities of data collection and management. To overcome
Keywords-Social-Sensor; Social-Sensor Cloud; Social-Sensor
Cloud Service this challenge, the proposed research uses Service Oriented
Architecture (SOA), i.e., sensor-data-as-a-service[6] for dif-
ferent clients over a Social-Sensor Cloud. Transforming
I. I NTRODUCTION
the social-sensor data as a function of services provides
Rapid advancement of online social media, such as Twit- improved information flow and flexible organization of data.
ter, Instagram, Flicker and Facebook etc., in conjunction Social-sensor data is generated from multiple sources in
with devices like smartphones or tablet, is providing unique multiple formats. SOA abstracts Social-sensor data seam-
opportunities for addressing major economic, scientific and lessly as a service without exposing internal data structure
social challenges. The user-base of multiple social networks and how data can be accessed. This results in a uniform
is getting wider and more active in producing content about and ubiquitous delivery of social-sensor data as a service,
real world events, almost in real-time [1]. Posts related making it easy to access and reuse in multiple applications
to public events, especially multimedia content along with over different platforms, thus reducing the complexity of
quality and coverage parameters, may contain critical in- social-sensor data collection. The functionality of social-
formation that describes a situation [2]. This dramatically sensor data (e.g., time and location) is abstracted as a
increases the value of the information generated by social services and qualitative features (e.g., price, trust, coverage)
networks.Thus, social media provides a convenient platform are abstracted as non-functional properties of service. So
to collect various types of context sensitive information if users require a slightly different data structure or have
related to almost real-time events. Monitoring such events or specific functional or non-functional requirements, the im-
scenarios over social streams can assists concerned officials plementation and access is easy because the changes are
in analyzing an unfolding situation, such as in crisis man- minimal. SOA breaks down the service into independent
agement, urban management and scene analysis. Utilizing service(s), namely atomic services, making integration with
these streams available over social networks (also called other services easy. Therefore, social-sensor data abstracted
social clouds[3]) can significantly facilitate the task of scene as a service is easy to manage because of ease in integration
detection and aid in comprehending evolving situations as and collaboration of atomic services. Other benefits include
per the user requests[4]. These requests have both func- higher availability, better scalability, dynamic deployment
tional and non-functional constraints. Functional constraints and greater testability. Therefore, social-sensor cloud service
include (but not limited to) time, location, mentions etc., and is the solution for efficient transformation of social-sensors
non-functional constraints refer to the desired quality, i.e., data stored over social-sensor clouds into actionable infor-
data quality, resolution, availability etc. mation.
The users contributing their individual ’data’ that express As depicted in Fig.1, the Social-Sensor Cloud Service
509
[14]. This framework fully exploits the information of social a similar kind. It is assumed that the surveillance of the
data over multiple dimensions and the similarity between road segment through the conventional sensors is minimal.
two messages is measured by the distance between their In such case scene reconstruction may be performed in
distributions. traditional way by a range of experts, including forensic
Most of these approaches are data centric, built upon data scientists, forensic engineers or health and safety advisers,
mining and analysis techniques. This require considerable etc. However, evidence or witnesses available can be quite
amount of expertise and time. Moreover, transition of tradi- limited in their value and reliability [1]. The urban man-
tional cloud systems to the SOA-based sensor-cloud raises agement (i.e., the user) might require specific direction and
need to consider spatio-temporal aspects of sensor data with information context that is not available through conven-
better performance and faster access to new services. Thus, tional means. Moreover, the user might have various quality
using SOA and social-sensors for scene analysis is far better requirements too (e.g., resolution, colour etc). The advent of
than using image processing over the batch of images or smart-phones provides unique opportunities to address this
traditional cloud computing to build the scene. problem.
The wide deployment and availability of smart-phones
B. Crowd-sourcing and Sensing-as-a-Service users and their connectivity with social networks might pro-
Crowd-sourcing and Sensing-as-a-Service is a large-scale vide extra visual coverage by either sharing images or posts.
sensing paradigm based on the power of IoT devices, includ- The idea is to leverage this freely available information
ing smart phones, smart vehicles and wearable devices etc. over social network clouds to help investigators to analyse
[5][13]. This allows the increasing number of mobile phone the accidents scene. The process of providing investigators
users to share local knowledge (e.g., local information, extra visual coverage can be considered as service selection
event coverage, and traffic conditions) acquired by their problem. The commuters on the road can be regarded as
sensor-enhanced devices and the information can be further social-sensors sharing their image data over social-sensor
aggregated in the cloud for large-scale sensing [16]. The clouds, i.e., social networks. Using social media image
mobility of large-scale mobile users makes sensing-as-a- (social-sensor data) as-a-service, i.e., social-sensor cloud
service a versatile platform that can often replace static service, can help to fulfil the urban management’s need
sensing infrastructures [18]. A broad range of applications of required coverage and direction. Social-sensor cloud
are thus enabled, including traffic planning [5], environment services have multiple functional and non-functional prop-
monitoring [20], mobile social recommendation [25], public erties involved, adding value to unstructured social-sensors
safety [26], and so on. data. The functional properties of such services include
(but not limited to) visual, spatio-temporal, textual and
C. Service selection and composition context information of the image. If the temporal, spatial
Service Selection and Composition is one of the and context information of the accident event is provided,
hottest research problems in service-oriented computing it will be very helpful for detecting and building the scene
[6][17][19][15]. The service composition has been applied using images or videos available through social-sensor cloud
in a number of domains including scene analysis and visual services. Moreover, users might demand different sets of
surveillance [19]. The service composition problem can be non-functional properties for different query sets (e.g., image
categorized into two areas. The first area focuses on the quality, trust in service, price of service etc). The challenge
functional composability among component services. The is to make sure these services are in the same information
second area aims to do optimal service composition based context, i.e., covering same event or segment of area required
on non-functional properties (QoS). In [17], service compo- by the user. Visual information context is essential to make
sition from media service perspective has been discussed. sure that images belong to same scene and determining the
Authors of [6] and [15] propose a composition approach for information and location context of a social-sensor cloud
Sensor-Cloud and crowd sourced services based on dynamic service is a task of particular interest.
features such as spatio-temporal aspects. Algorithms are The aim is to select the social-sensor cloud services
presented in both papers to support the proposed approaches. which are in the same spatial and visual context based on
Analytical and simulation results are presented to show the the functional properties of the service as required by the
performance of the proposed. However, Social-sensor cloud user and users quality of service requirements (i.e., non-
service selection using functional/non-functional attributes functional requirements). As shown in Fig. 2, proposed
through social-sensor is yet to be explored. solution is to filter and select the best available Social-Sensor
Cloud Service, according to the users requirement, from the
III. M OTIVATING S CENARIO pool of Social-Sensor Cloud Services available over Social-
Lets assume that scene reconstruction of the accident is Sensor Cloud. This is done on the basis of spatial correlation,
required by urban management to determine the cause(s) or textual correlation, visual features of image and quality of
aftermath(s) of the accident to prevent further incidents of service parameters specified by user.
510
filtered services in S and S”. Finally, user defined quality
parameters (e.g., direction reference DRef = 60o from True
North) are considered in selection of best suited services.
The threshold values of QoS parameters are adjusted by
user of the service at time of query generation. The selected
services can assist in building a visual summary of a required
scene in given space and time.
511
from an Atomic Service. The following propose the minimal along with the quality measures of service are used to
functional requirements associated with an atomic service: select the best available service(s). The selection approach
• T is time of the service at which the image is taken is implemented in following steps:
• M is special mentions and keywords, providing addi-
tional information regarding image.
• L x,y is service location where x,y is longitude
and latitude position of the service
• VisF is a set of key points and associated visual features
of image extracted through image processing service.
C. Quality Model of an Atomic Social-sensor Cloud Service
Discovering and selecting the best available service sat-
isfying user’s requirements is an important challenge. For
this some baseline QoS attributes for Social-sensor Cloud
services are introduced. The proposed QoS criteria of an
atomic services includes:
• Qserv is a tuple Q1 , Q2 ... Qn , where each Qi denotes
a Quality of service (QoS) of Serv. The QoS criteria
include:
– Pricing is the price of the service, i.e., does the
service need any sort of financial incentive for Fig. 3: Social-Sensor Cloud Service Selection Framework
service provider or not
– Direction Reference is the direction of the service A. Service filtering with respect to spatial correlation
coverage, e.g., 60o from True North. The first step is to reduce the search space (Algorithm 1).
– Resolution is the minimum requirement of image It is assumed that all available services are associated to a
resolution to be provided by service two-dimensional geo-tagged location. For the effective area
– Colour Quality is image’s definition, i.e., grey of the query q, we define a circular event area AREAq of
scale or high definition radius r that encloses a set of services relevant to q. The ser-
vices outside this area are assumed to have little probability
V. Q O S- AWARE S OCIAL - SENSOR C LOUD S ERVICE
of being relevant to the query. Haversine formula [22] is used
S ELECTION
to calculate the distance between service location and query
We propose a framework to filter and localise the best location. The Haversine formula is an equation in navigation,
available Social-sensor Cloud Service according to the user’s giving great-circle distances between two points on a sphere
query. The query q can be defined as a tuple q l,d,Q (i.e., earth) from their longitudes and latitudes. For a service
, giving the location, description and quality parameters Serv < lon1 , lat1 > and query q.l < lon2 , lat2 >, formulas
of the required service(s). The entry lx,y is the two- used for distance calculation are [22]:
dimensional geographical space, i.e., decimal longitude and
latitude position, while d is a phrase describing the query a = (sin(Δlat /2))2 + cos(lat1 ) ∗ cos(lat2 ) ∗ (sin(Δlon /2))2
(e.g., Melbourne Central). Q is a set of non-functional √ √ (1)
attributes direction reference DRef, resolution R, price P and c = 2 ∗ arctan 2( a, 1 − a) (2)
colour quality CQ.
dist = Radiusearth ∗ c
Fig. 3 shows the proposed selection framework for social-
sensor cloud services. The aim of our approach is to find Where, Δlon = lon2 − lon1 , Δlat = lat2 − lat1 and dist
the available services that match with users’ requirements is the distance between service and query. If the distance is
by constructing the information and location context of the less than r it is deduced that the service and the query are
service with the functional and non-functional properties spatially correlated.
of the service. Initially we reduce the search space using
the user’s query location and textual description. For any B. Service filtering with respect to textual correlation
location of query q, all services available in the area of radius To improve the efficiency of the proposed approach,
r are selected. Further, textual correlation, i.e., similarity textual correlation is also considered. It might happen that
between d and M, of nearby services outside radius r is the service does not lie spatially in the query area AREAq ,
considered. Next, the location context of all filtered services but has textual relation with query q. For example, the
is extracted using their VisF, by mapping on the visual service is located outside the radius r of the query area but
features of available map(s). Finally, this extracted context special mention M of the service is related to description
512
Algorithm 1 Spatial Correlation Filtering Algorithm 2 Textual Correlation Filtering
Input: S is the collection of services - each service is Input: S is the collection of services - each service is
associated with its geo-tagged location < x, y >; a query q associated with its geo-tagged location < x, y > and
with location l < x, y >; a query area’s radius threshold r mention M ; a query q with location l < x, y > and textual
Output: The set S of all services which lie in query area description d; a query area’s radius threshold r ; textual
such that distance (Serv < x, y > to q.l < x, y >) ≤ r similarity threshold θ
1: for Each Serv in S do Output: The set S” of all spatio-textually correlated services
2: Calculate the dist between serv and q.l using which lie in query area such that distance disg, q.l < x, y >
Haversine formula to Serv < x, y >, is r ≤ dist ≤ r and d is similar to M
3: if dist ≤ r then 1: for Each Serv in S do
4: Serv ∈ S 2: Calculate the dist between Serv < x, y > and q.l <
x, y > using Haversine formula
3: if r ≤ dist ≤ r then
d of the query. In such cases textual correlation in terms 4: Serv ∈ S”temp S”temp is temporary set to
of spatio-textual similarity is used for service filtration. Fig. hold filtered services
4. illustrates the spatio-textual correlation between query q 5: for Each Serv in S”temp do
and services. Although services Serv5 , Serv7 , and Serv8 do 6: Calculate θ between Serv.M and q.d using LIN
not lie in the query area but are spatially close to query q. 7: if θ ≥ θ then
Thus these services are further filtered for textual correlation. 8: Serv ∈ S”
Service filtering with respect to textual correlation shows
that services Serv7 and Serv8 are textually relevant to query
q. of the intersection of the two concepts. The relatedness
score is between 0.0 (the lowest) to 1.0 (the highest).
For implementation Java based library, WS4J (WordNet
Similarity for Java) is used. The use of this library is
defined in an on-line documentation 1 . We have used θ to
define relatedlin (q.d, Serv.M ). Higher value of θ shows
higher textual correlation. Algorithm 2 shows the textual
correlation filtering.
513
Algorithm 3 Service Localisation using VisF and quality features, i.e., colour quality and resolution,
Input: S and S” is the set of spatially and textually and SIFT features as QoS property values. For the image
correlated services - each service is associated with its VisF; features similarity, we found that using a minimum similarity
SIFT features I.F of structured 360o image dataset I, of event threshold of 50% usually gives best results, i.e., VisF and
area AREAq ; Visual dissimilarity threshold α I.F need to be at least 50% match. For the simplicity of
Output: The set SetServ of localised services related to our experiments we decided to keep this threshold fixed. In
query.
Algorithm 4 QoS-Aware Service Selection
1: for Each Serv in S ∪ S” do
N o.of mKP Input: Q , i.e., quality requirements of user and SetServ
2: SimI = ( T otalN o.of KP ) ∗ 100 is the set of localised services related to query
3: if SimI ≥ α then
Output: The set SelServ of selected services related to
4: Serv ∈ SetServ
query.
1: for Each Serv in SetServ do
2: if Serv.Q ≥ Q for each parameter of quality then
and location for each feature, thus providing invariance
P,CQ,R are separately
to these transformations.
3: Serv ∈ SelServ compared. If all true then
4) Keypoint descriptors generation, i.e., descriptors com-
condition satisfies
prising of the local image gradients are generated.
These gradients are measured at the selected scale in
the region around each keypoint. addition, QoS parameter of price is manually assigned to
all services. A set of experiments are conducted to evaluate
To localise the filtered service set S and S” we have used
the significance of our proposed filtering steps and selection
a 360o structured image dataset, I, of event area AREAq .
approach in terms of precision and recall. The proposed
The image set extracted from Google Map Street View is an
algorithms are implemented in Java and Matlab. All the
example of such dataset. Localisation is done by similarity
experiments are conducted on a Windows 7 desktop with
analysis between SIFT features of image set I and VisF of
a 3.60 GHZ Intel Core i7 processor and 8 GB RAM.
services in S’ and S”, as shown in Algorithm 3. This is
We have conducted two set of experiments to evaluate
done by individually comparing each feature from VisF of
the effectiveness of the proposed approach. To evaluate the
services in S’ and S” to features of image set I and finding
service filtering w.r.t. spatial correlation, various values of
candidate matching features based on Euclidean distance of
r are experimented. The minimum unit of r is 7 m., i.e.,
their feature vectors. To condense the matching keypoints
average least measurable increment in decimal longitude and
into a scalar quantity the percentage of keypoints that
latitude values. Units of r are further increased in multiple
match are calculated, i.e., the number of matching keypoints
of 7m. For textual correlation, the value of r’ is set with
(Number of mKP ) divided by the total number of keypoints
7m increment for respective r. In QoS-aware selection, for
(Total number of KP ) for each image. Further, services are
simplicity purpose, in this study we are using only basic QoS
localised and selected if the percentage of similarity, i.e.,
parameters, i.e., price P, color quality CQ and resolution R.
SimI is above the set threshold α.
The threshold values of these parameters, i.e., Q’(P’,CQ’,R’)
D. QoS-Aware Service Selection are adjusted by user of the service. For the experiments
In the final stage of service selection (Algorithm 4), user we generated 7 different queries based on locations in our
defined quality parameters are considered in selection of best dataset. In these experiments we evaluate spatial correlation,
suited services. The threshold values of these parameters are i.e., single filtering and proposed approach of spatio-textual
adjusted by user of the service at time of query generation. correlation. Precision, recall and f-measure matrices are used
for evaluating the experiments. They are computed by:
VI. E XPERIMENTS AND R ESULTS N o.of releventselectedservices
A set of experiments is conducted for evaluation and to P recisionqueryresult =
N o.of selectedservices
analyse and investigate the contribution of our proposed (4)
N o.of releventselectedservices
framework . To the best of our knowledge, there is no real Recallqueryresult =
spatio-temporal service test case to evaluate our approach. N o.of releventservices
(5)
Therefore, we focus on evaluating the proposed approach 2 · precision · recall
real dataset. The set is a collection of 200 GPS tagged F − score = (6)
(precision + recall)
user uploaded images downloaded from social networks
(flicker, twitter, google images, etc.). To create the services We have investigated that how precision and recall of the
based on images, we have extracted its geo-tagged location, query result varies by applying the textual and spatial filters.
For spatial correlation, we have conducted the experi-
special mentions and tags as its textual description and time
ments with the values of r respectively in 7m, 14m and
when image was captured as the functional property values,
514
21m. In this part of experiment we have ignored textual [2] B. Resch, ”People as sensors and collective sensing-contextual
correlation. For each r, we have obtained the total numbers observations complementing geo-sensor network measure-
of services, relevant services and relevant selected services. ments.” In PLBS 2013, pp. 391-406.
[3] Chard, Kyle, S. Caton, O. Rana, and K. Bubendorfer. ”Social
Similarly, for spatio-textual correlation, we conducted the cloud: Cloud computing in social networks.” in IEEE 3rd ICCC
experiments with r’ in conjunction with spatial correlation. 2010, pp. 99−106
For textual similarity, using previous research as reference [4] M. Schinas, S. Papadopoulos, Y. Kompatsiaris, and P. Mitkas,
”Visual event summarization on social media using topic
we have set the value of threshold θ = 0.5 [24]. We con-
modelling and graph-based ranking algorithms”, In Proc. of
ducted these experiments with three pairs of r and r’. Values 5th ACM on ICMR 2015
used for (r,r’) are (7,14), (14,21) and (21,28). For each pair [5] C. Aggarwal and T. Abdelzaher, ”Social sensing”, In Managing
of (r,r’), we have obtained the total number of services and and mining sensor data, Springer US, 2013, pp. 237−297
[6] A. G. Neiat, A. Bouguettaya, T. Sellis and Z. Ye, ”Spatio-
relevant selected services. The results show that for the both temporal composition of sensor cloud services”, in ICWS 2014
approaches r=7 gives gives optimal performance. Moreover, [7] W. Wang, S. De, G. Cassar and K. Moessner, ”An experimental
the proposed approach of spatio-textual correlation shows study on geospatial indexing for sensor service discovery.” In
best performance with the highest f-score for all 3 pairs of Expert Systems with Applications, 2015, pp. 3528−3538
[8] Socialsensors.com.sg, ”Social Sensors - Sensing real-
(r,r’). Therefore, overall the proposed approach of spatio- world activities from Social Media” online available at:
textual correlation show better performance than the single http://socialsensors.com.sg/ [Accessed 10 Dec. 2015]
filter, i.e., spatial correlation. The results are depicted in Fig. [9] J. Chae, D. Thom, H. Bosch, Y. Jang, R. Maciejewski, D. S.
5(a),(b) and (c). Ebert and T. Ertl, ”Spatiotemporal social media analytics for
abnormal event detection and examination using seasonal-trend
decomposition”, in VAST 2012
[10] X. Dong, D. Mavroeidis, F. Calabrese and P. Frossard, ”Mul-
tiscale Event Detection in Social Media”, in DMKD 2014
[11] X. Dong, D. Mavroeidis, F. Calabrese and P. Frossard, ”Effi-
cient Location-Based Event Detection in Social Text Streams”
in ISaBDE, Springer 2015, pp 213−222
[12] X. Zhou and L. Chen, ”Event detection over twitter social
media streams”, in The VLDB Journal 2014, pp 381−400
[13] S. Elers ”Online investigation: Using the internet for inves-
tigative policing practice.” in Australasian Policing 6,(2014)
[14] G. Schindler, M. Brown and R. Szeliski, ”City-scale location
recognition”, In CVPR 2007, pp. 17
[15] A. G. Neiat, A. Bouguettaya, T. Sellis. Spatio-Temporal
Composition of Crowdsourced Services. In Service-Oriented
Computing, Springer Berlin Heidelberg, 2015, pp 373−382.
Fig. 5 - Spatial Correlation Vs. Spatio-Textual correlation [16] B. Guo, et al. ”From participatory sensing to mobile crowd
sensing.” In PERCOM Workshops, IEEE, 2014.
VII. C ONCLUSION [17] J. Fernndez, et al. ”An intelligent surveillance platform for
large metropolitan areas with dense sensor deployment.” In
In conclusion, this paper proposes a social-sensor cloud Sensors 13.6 (2013), pp 7414−7442.
service selection framework based on spatio-textual corre- [18] B. Guo, et al. ”Building human-machine intelligence in
lation, visual features and QoS parameters. The proposed mobile crowd sensing.” In IT Professional (2015), pp 46−52.
[19] Balke, Wolf-Tilo, J. Diederich, ”A quality-and cost-based
framework evaluates service selection in terms of spatial and selection model for multimedia service composition in mobile
spatio-textual correlation. Experimental results show that environments.” In ICWS06
our approach is better than the single filtering approach. [20] C. Perera, Z. Arkady, C. Peter, and G. Dimitrios, ”Sensing
In future, we plan to focus on social-sensor clouds services as a service model for smart cities supported by internet of
things.” In Tran.on ETT 25, no. 1 (2014), pp 81−93.
indexing for fast service discovery and composition for scene [21] D.G. Lowe, ”Distinctive image features from scale-invariant
building and event analysis. key points”, Inter. Jour. of Computer Vision 2004, pp 91-110
[22] G. Van Brummelen, ”Heavenly Mathematics: The Forgotten
ACKNOWLEDGEMENT Art of Spherical Trigonometry”, Princeton Uni. Press. 2013
[23] S. Patwardhan, S. Banerjee and T. Pedersen, ”Using measures
This publication was made possible by NPRP grant 9-224- of semantic relatedness for word sense disambiguation”, In
1-049 from the Qatar National Research Fund (a member of Computational Linguistics & Intelligent Text Processing 2003
Qatar Foundation) and DP160103595 from the Australian [24] R. Mihalcea, C. Corley and C. Strapparava, ”Corpus-based
Research Council. The findings achieved herein are solely and knowledge-based measures of text semantic similarity”,
In AAAI 2006, Vol. 6
the responsibility of the authors. [25] U. Gretzel, S. Marianna, X. Zheng, and K. Chulmo, ”Smart
tourism: foundations and developments.” In Electronic Markets
R EFERENCES 25, no. 3 (2015), pp 179−188.
[26] Kantarci, Burak, and M. Hussein, ”Trustworthy crowdsourc-
[1] A. Rosi, M. Mamei, F. Zambonelli, S Dobson, G. Stevenson
ing via mobile social networks.” In GLOBECOM, 2014 IEEE,
and J. Ye, ”Social sensors and pervasive services: Approaches
pp 2905−2910
and perspectives”, in PERCOM 2011
515