Identifying and Characterizing Communities in Social Networks

Pool, S.H.

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	van Leeuwen, Matthijs
dc.contributor.advisor	Siebes, Arno
dc.contributor.advisor	Bonchi, Francesco
dc.contributor.author	Pool, S.H.
dc.date.accessioned	2018-07-19T17:04:30Z
dc.date.available	2018-07-19T17:04:30Z
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/29554
dc.description.abstract	Methods for detecting community structures in graphs already exist for many years. This subject is studied by physicists, sociologists and also computer scientists. Traditional methods consider only the vertices and edges, we call this graph data. In social networks much more information, in addition to the graph data is available. This can be demographical information, hobbies or any other interest people put online; we refer to this kind of data as description data. Traditional methods try to partition the vertices in groups according to a quality measure like modularity. These methods do not allow overlap; all vertices are member of exactly one group. In real social networks, communities have overlap, for example your friends and your family. Thus, a good method for finding communities in social networks should allow communities to overlap. Other interesting information can be obtained by exploiting the description data. A very useful application is identifying which elements of the description data characterize a community. In this thesis we study this problem, with the goal of finding the top-$k$ communities in a certain data set. We introduce an algorithm which alternates between two steps. The first is finding closely linked vertices on the graph side with a fast and effective hill climbing algorithm. The other is reducing the description complexity of this community. The algorithm starts with a candidate set, and the algorithm is applied on each community one by one. This allows communities to overlap with the communities found before. To evaluate our methodology, we performed experiments on real world data obtained from a number of online social networks, i.e. LastFM, Delicious, and Flickr. The results show that the proposed method identifies interesting and overlapping communities, characterized by detailed descriptions. Visualizations of both the subgraphs and the descriptions contribute to an easier interpretation and thus better understanding of the communities. At the end we are able to find cohesive communities with concise descriptions, in large data sets, within a relatively short amount of time.
dc.description.sponsorship	Utrecht University
dc.format.extent	3319883
dc.format.mimetype	application/pdf
dc.language.iso	en_US
dc.title	Identifying and Characterizing Communities in Social Networks
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	community detection, description, behavioral and demographic information.
dc.subject.courseuu	Applied Computing Science

Files in this item

Name:: thesis.pdf
Size:: 3.166Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record