As the first step of our Social Network Discovery project, we wanted to pull down a bunch of data from some data sources where we could massage it and filter it before it went on it’s way to the next step in the process. We chose to store our data in a MySQL database and [...]
Bayesian classi?cation algorithms, the multi-class multi-kernel Relevance Vector Ma- chines (mRVMs) that have been recently proposed. Since this is a multi-author blog and this is my first contribution, I will start by introducing myself. My name is Jason Marcell. I am a CS Masters student working under the supervision of Dr. Theodoros Damoulas. My primary [...]
We have used two approaches for applying LDA model to our dataset. LingPipe: LingPipe, is a Java library which provides many of the functions required in NLP. We have used LingPipe in the following manner. We have used symmetric KL Divergence to calculate the similarity of a paper with respect to each of the seed [...]
I am using the write up done by Kiyan from here. The Problem: We possess a set of seed papers written on topics in computational sustainability, and a corpus of papers in Computer Science (derived from the DBLP database) that contains the seed papers as a subset. The problem is to develop a similarity measure between papers that [...]
This post focuses on the paper – “Non-Redundant Multi-View lustering Via Orthogonalization”, Cui, Fern and Dy. This paper focuses on getting multiple non redundant multi view clusters to effectively get potentially new views on a same dataset. Their approach is to produce views which are NOT independent from each other, rather it is to get [...]
An overview of the project and the related work in economics.