<?xml version="1.0" encoding="UTF-8"?>

<rdf:RDF
   xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
   xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
   xmlns="http://purl.org/rss/1.0/"
   xmlns:dc="http://purl.org/dc/elements/1.1/"
   xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/"
   xmlns:dcterms="http://purl.org/dc/terms/"

>
<channel rdf:about="http://www.citeulike.org/about">
<pubDate>Sat, 26 Jul 2008 04:21:39 BST</pubDate>


	<title>CiteULike: Group: Adaptive-Web - with tag classification</title>
	<description>CiteULike: Group: Adaptive-Web - with tag classification</description>


	<link>http://www.citeulike.org/group/2118/tag/classification</link>
	<dc:publisher>CiteULike.org</dc:publisher>
	<dc:language>en-gb</dc:language>
	<dc:rights>Copyright &#169; 2004-2008 citeulike.org</dc:rights>
	<items>
    <rdf:Seq>
        <rdf:li rdf:resource="http://www.citeulike.org/group/2118/article/1137196"/>
        <rdf:li rdf:resource="http://www.citeulike.org/group/2118/article/777419"/>
        <rdf:li rdf:resource="http://www.citeulike.org/group/2118/article/691964"/>

	</rdf:Seq>
	</items>
	</channel>


<item rdf:about="http://www.citeulike.org/group/2118/article/1137196">
    <title>PEBL: Web Page Classification without Negative Examples</title>
    <link>http://www.citeulike.org/group/2118/article/1137196</link>
    <description>&lt;i&gt;IEEE Transactions on Knowledge and Data Mining, Vol. 16, No. 1. (January 2004), pp. 70-81.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Web page classification is one of the essential techniques for Web mining because classifying Web pages of an interesting class is often the first step of mining the Web. However, constructing a classifier for an interesting class requires laborious preprocessing such as collecting positive and negative training examples. For instance, in order to construct a &#34;homepage&#34; classifier, one needs to collect a sample of homepages (positive examples) and a sample of nonhomepages (negative examples). In particular, collecting negative training examples requires arduous work and caution to avoid bias. The paper presents a framework, called positive example based learning (PEBL), for Web page classification which eliminates the need for manually collecting negative training examples in preprocessing. The PEBL framework applies an algorithm, called mapping-convergence (M-C), to achieve high classification accuracy (with positive and unlabeled data) as high as that of a traditional SVM (with positive and negative data). M-C runs in two stages: the mapping stage and convergence stage. In the mapping stage, the algorithm uses a weak classifier that draws an initial approximation of &#34;strong&#34; negative data. Based on the initial approximation, the convergence stage iteratively runs an internal classifier (e.g., SVM) which maximizes margins to progressively improve the approximation of negative data. Thus, the class boundary eventually converges to the true boundary of the positive class in the feature space. We present the M-C algorithm with supporting theoretical and experimental justifications. Our experiments show that, given the same set of positive examples; the M-C algorithm outperforms one-class SVMs, and it is almost as accurate as the traditional SVMs.</description>
    <dc:title>PEBL: Web Page Classification without Negative Examples</dc:title>

    <dc:creator>Hwanjo Yu</dc:creator>
    <dc:creator>Jiawei Han</dc:creator>
    <dc:creator>Kevin Chen-Chuan</dc:creator>
    <dc:source>IEEE Transactions on Knowledge and Data Mining, Vol. 16, No. 1. (January 2004), pp. 70-81.</dc:source>
    <dc:date>2007-03-02T20:28:36-00:00</dc:date>
    <prism:publicationYear>2004</prism:publicationYear>
    <prism:publicationName>IEEE Transactions on Knowledge and Data Mining</prism:publicationName>
    <prism:volume>16</prism:volume>
    <prism:number>1</prism:number>
    <prism:startingPage>70</prism:startingPage>
    <prism:endingPage>81</prism:endingPage>
    <prism:category>classification</prism:category>
    <prism:category>mining</prism:category>
    <prism:category>pebl</prism:category>
    <prism:category>web</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/group/2118/article/777419">
    <title>Automated subject classification of textual Web pages, based on a controlled vocabulary: Challenges and recommendations</title>
    <link>http://www.citeulike.org/group/2118/article/777419</link>
    <description>&lt;i&gt;New Review in Hypermedia and Multimedia, Vol. 12, No. 1. (June 2006), pp. 11-27.&lt;/i&gt;</description>
    <dc:title>Automated subject classification of textual Web pages, based on a controlled vocabulary: Challenges and recommendations</dc:title>

    <dc:creator>Golub</dc:creator>
    <dc:creator>Koraljka</dc:creator>
    <dc:identifier>doi:10.1080/13614560600774313</dc:identifier>
    <dc:source>New Review in Hypermedia and Multimedia, Vol. 12, No. 1. (June 2006), pp. 11-27.</dc:source>
    <dc:date>2006-07-28T08:31:19-00:00</dc:date>
    <prism:publicationYear>2006</prism:publicationYear>
    <prism:publicationName>New Review in Hypermedia and Multimedia</prism:publicationName>
    <prism:issn>1361-4568</prism:issn>
    <prism:volume>12</prism:volume>
    <prism:number>1</prism:number>
    <prism:startingPage>11</prism:startingPage>
    <prism:endingPage>27</prism:endingPage>
    <prism:publisher>Taylor and Francis Ltd</prism:publisher>
    <prism:category>classification</prism:category>
    <prism:category>ontology</prism:category>
</item>



<item rdf:about="http://www.citeulike.org/group/2118/article/691964">
    <title>A Hybrid User Model for News Story Classification</title>
    <link>http://www.citeulike.org/group/2118/article/691964</link>
    <description>&lt;i&gt;(1999)&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;. We present an intelligent agent designed to compile a daily news program for individual users. Based on feedback from the user, the system automatically adapts to the user's preferences and interests. In this paper we focus on the system's user modeling component. First, we motivate the use of a multi-strategy machine learning approach that allows for the induction of user models that consist of separate models for long-term and short-term interests. Second, we investigate the utility of...</description>
    <dc:title>A Hybrid User Model for News Story Classification</dc:title>

    <dc:creator>D Billsus</dc:creator>
    <dc:creator>M Pazzani</dc:creator>
    <dc:source>(1999)</dc:source>
    <dc:date>2006-06-10T20:56:51-00:00</dc:date>
    <prism:publicationYear>1999</prism:publicationYear>
    <prism:category>classification</prism:category>
    <prism:category>news</prism:category>
    <prism:category>recommender</prism:category>
    <prism:category>user-profile</prism:category>
</item>



</rdf:RDF>

