Académique Documents
Professionnel Documents
Culture Documents
Abstract Recommender systems mine the data on the web and are
helpful in identifying and recommending user information they might
be interested in which gives the user more personal web experience.
Social web has become the buzzword today. And even
recommender systems havent remain untouched by this social
effect. As the title suggests in this paper we propose a way to use web
browser to create a tag based profile of a user learning about his
interests which then can be used for personalised recommendations.
We also propose an architecture using web browser to collect the user
information from social networks & how this data is processed by
recommender & provide an API to use this and access this processed
data in your own recommender applications. We also explain how
this approach is advantageous over the normal recommenders
available today.
Keywords Knowledge Base, Social Tagging, Folksonomy,
Collaborative filtering, Ontology.
I. INTRODUCTION
During the last decade we have seen rise of some social sites
like 1Facebook, 2Twitter, 3Flickr, 4LinkedIn, 5Delicious which
have had a great impact on the web today. The sole reason for
this success has been the way users allow people to interact
with other people. They may know share their content,
express themselves the way they want, And while doing this
people knowingly or unknowingly leave behind the traces of
their areas of interests, likes, preferences, etc. Thus social
networks are a gold mine of information we need for
recommending things to the user.
Now traditionally most recommenders are concerned only
with information specific to a particular domain. But the
information obtained from the social profiles of a user are
multi-faced & can be used in multi-domain recommendations.
Recommendation systems used today can be broadly
classified into content- based, item based collaborative
filtering & user based collaborative filtering & hybrid
recommendations systems. Each of them is suitable in
different scenarios.
1
www.facebook.com
www.twitter.com
3
www.flickr.com
4
www.linkedin.com
5
www.delicious.com
2
ii)
iii)
In the first step the tags that are too large or too
small (depending on threshold values) are
removed. Then the tags with special characters are
converted to normal ASCII form. The stop words
like articles(a, an , the), conjunctions (like and,
but, or, yet, for, nor, so), pronouns (like he, she, it,
they) are removed from the tags.
Then the spelling mistakes in the tags are
corrected by using the Google Did you mean
Feature inspired by the approach taken in [2].
Since Google corrects the words and finds results
relevant to a word which is very close to the
misspelled word. This can be done by passing the
misspelled word to Google as a search query and
in the result Google returns with the word which is
close to the misspelled word. Generally the words
that have a very small Levenstein distance.
Many a times sites that allow users to add tags
combine the separate words into a single word by
removing spaces for ease of handling data. Like
for examle:
artificialintelligence artificial intelligence
newyork new york
sanfransisco san fransisco
iv)
Once the words are filtered the next task is to derive semantic
of the filtered information keywords to derive the concepts
and then assign the concepts to suitable ontologies.
Freebase is a collaborative KB which is in the form of
collection of collection of structured data generated from
many sources. Its data is accessible as open API, database
dump to programmers who can use it in their applications.
The main feature of Freebase that makes it useful for our
system is the fact that it uses a graph model instead of tables.
It arranges the data in the form of set of nodes or set of edges
that establish relationships between the nodes. Freebase can
be queried through Metaweb Query Language.
Our approach is to use Freebase for categorization of tags.
After the keywords have been filtered, we try to define the
category of tag by searching for an entry in Freebase. It
returns with the IDs of most relevant topics from knowledge
base. Another query to Freebase returns the category of the
item of interest. We do not have to worry about the ambiguity
of meanings because Freebase already has a field score that
gives the similarity score between the tag and topic from
Freebase. Once we have categorized all tags graph of user
interests is derived from the categorized objects.
A.
RECOMMENDATION ALGORITHM
ii)
Random Walk:
In this approach the similarity between the 2 nodes
is defined as a function of the probability of
reaching a node from other node by walking
randomly.
IV.
OVERVIEW OF ARCHITECTURE
V.
CONCLUSION
REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]