Register | Log in | FAQ      [?] 
Recent | Unread | Search | Authors | Tags | Export

ChaTo's text [42 articles]

Recent papers added to ChaTo's library classified by the tag text. You can also see everyone's text.
  • notes Floatcascade learning for fast imbalanced web mining
    (2008), pp. 71-80.
    by Xiaoxun Zhang, Xueying Wang, Honglei Guo, Zhili Guo, Xian Wu, Zhong Su
    posted to text machine-learning classification by ChaTo on 2008-06-25 11:17:58 as read
  • notes Yago: a core of semantic knowledge
    WWW (2007), pp. 697-706.
    by Fabian M Suchanek, Gjergji Kasneci, Gerhard Weikum
  • Flickr tag recommendation based on collective knowledge
    (2008), pp. 327-336.
    by Börkur Sigurbjörnsson, Roelof van Zwol
    posted to text tagging social by ChaTo on 2008-06-25 11:13:49 as read along with 2 people dlaguardia intrect
  • Genealogical trees on the web: a search engine user perspective
    (2008), pp. 367-376.
    by Ricardo Baeza-Yates, álvaro Pereira, Nivio Ziviani
    posted to web-characterization time text corpus by ChaTo on 2008-06-25 11:12:58 as read
  • notes Syntactic clustering of the Web
    Computer Networks and ISDN Systems, Vol. 29, No. 8-13. (September 1997), pp. 1157-1166.
    by Andrei Z Broder, Steven C Glassman, Mark S Manasse, Geoffrey Zweig
  • Thumbs up?: sentiment classification using machine learning techniques
    (July 2002), pp. 79-86.
    by Bo Pang, Lillian Lee, Shivakumar Vaithyanathan
    posted to text classification by ChaTo on 2008-06-12 11:06:54 as read along with 2 people koles eegilbert
  • Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies
    The VLDB Journal, Vol. 7, No. 3. (August 1998), pp. 163-178.
    by Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan
    posted to text classification by ChaTo on 2008-06-12 11:05:41 as read
  • Hierarchical classification of Web content
    (July 2000), pp. 256-263.
    by Susan Dumais, Hao Chen
    posted to text classification by ChaTo on 2008-06-12 11:03:55 as ** along with 3 people gyuli adamsi agaelebe
  • Topical locality in the Web
    (July 2000), pp. 272-279.
    by Brian D Davison
    posted to text similarity link-analysis by ChaTo on 2008-06-12 10:51:34 as read along with 1 person jsenn
  • Managing Gigabytes: Compressing and Indexing Documents and Images
    (15 May 1999)
    by Ian H Witten, Ian H Witten, Alistair Moffat, Timothy C Bell
  • Introduction to Modern Information Retrieval (McGraw-Hill Computer Science Series)
    (01 September 1983)
    by Gerard Salton
    posted to text by ChaTo on 2008-06-12 10:42:03 as * along with 4 people jelsas Jaykul rayzhang Tronhus
  • Set-based model: a new approach for information retrieval
    (2002), pp. 230-237.
    by Bruno Pôssas, Nivio Ziviani, Wagner Meira, Berthier Ribeiro-Neto
    posted to clustering text by ChaTo on 2008-02-29 19:19:18 as read
  • notes Opinion spam and analysis
    (2008), pp. 219-230.
    by Nitin Jindal, Bing Liu
    posted to adversarial-ir blogs text by ChaTo on 2008-02-20 11:26:18 as read along with 1 person jliegl
  • notes Understanding temporal aspects in document classification
    (2008), pp. 159-170.
    by Fernando, Leonardo Rocha, Renata Araújo, Thierson Couto, Marcos Gon\ccalves, Wagner Meira
    posted to classification text time web-characterization by ChaTo on 2008-02-20 11:17:15 as read
  • A divisive information theoretic feature clustering algorithm for text classification
    J. Mach. Learn. Res., Vol. 3 (2003), pp. 1265-1287.
    by Inderjit S Dhillon, Subramanyam Mallela, Rahul Kumar
    posted to clustering text by ChaTo on 2007-11-21 09:44:01 as **** along with 1 person ciga
  • A link-based ranking scheme for focused search
    (2007), pp. 1125-1126.
    by Philip O'Brien, Tony Abou-Assaleh, Tapajyoti Das, Weizheng Gao, Yingbo Miao, Zhen Zhen
    posted to link-analysis ranking text by ChaTo on 2007-09-26 15:39:19 as read
  • Serial Sharers: Detecting Split Identities of Web Authors
    (July 2007)
    by Einat Amitay, Sivan Yogev, Elad Yom-Tov
    posted to clustering social text by ChaTo on 2007-09-21 17:45:48 as read
  • A Survey of Web Information Extraction Systems
    IEEE Transactions on Knowledge and Data Engineering, Vol. 18, No. 10. (October 2006), pp. 1411-1428.
    by Mohammed Kayed, Khaled F Shaalan
  • notes Beyond PageRank: machine learning for static ranking
    (May 2006), pp. 707-715.
    by Matthew Richardson, Amit Prakash, Eric Brill
    posted to machine-learning quality ranking text by ChaTo on 2007-06-07 12:59:01 as read along with 1 person pprett
  • Indexing by latent semantic analysis
    Journal of the American Society for Information Science, Vol. 41, No. 6. (7 January 1999), pp. 391-407.
    by Scott Deerwester, Susan T Dumais, George W Furnas, Thomas K Landauer, Richard Harshman
  • notes A content-driven reputation system for the wikipedia
    (2007), pp. 261-270.
    by Thomas B Adler, Luca de Alfaro
  • notes Review spam detection
    (2007), pp. 1189-1190.
    by Nitin Jindal, Bing Liu
    posted to adversarial-ir text by ChaTo on 2007-05-10 23:28:53 as read
  • notes Detecting near-duplicates for web crawling
    (2007), pp. 141-150.
    by Gurmeet S Manku, Arvind Jain, Anish D Sarma
  • notes On-line Supervised Spam Filter Evaluation
    ACM Transactions on Information Systems, Vol. 25, No. 3. (July 2007)
    by Gordon V Cormack, Thomas R Lynam
    posted to adversarial-ir classification text by ChaTo on 2007-05-10 21:29:22 as read
  • notes Page-level template detection via isotonic smoothing
    (2007), pp. 61-70.
    by Deepayan Chakrabarti, Ravi Kumar, Kunal Punera
  • notes Internet-scale collection of human-reviewed data
    (2007), pp. 231-240.
    by Qi Su, Dmitry Pavlov, Jyh-Herng Chow, Wendell C Baker
  • notes Dynamic personalized pagerank in entity-relation graphs
    (2007), pp. 571-580.
    by Soumen Chakrabarti
    posted to link-analysis text by ChaTo on 2007-05-09 20:59:36 as **** along with 2 people donade pprett
  • notes Efficient search in large textual collections with redundancy
    (2007), pp. 411-420.
    by Jiangong Zhang, Torsten Suel
    posted to indexing similarity text by ChaTo on 2007-05-09 20:42:32 as read along with 2 people kaineci AlisonBabeu
  • notes Respect my authority!: HITS without hyperlinks, utilizing cluster-based language models
    (2006), pp. 83-90.
    by Oren Kurland, Lillian Lee
    posted to link-analysis ranking text by ChaTo on 2007-05-03 14:33:43 as **
  • Blocking Blog Spam with Language Model Disagreement
    (May 2005)
    by Gilad Mishne, David Carmel, Ronny Lempel
    posted to adversarial-ir text by ChaTo on 2006-10-17 18:23:50 as read
  • Detecting nepotistic links by language model disagreement
    (2006), pp. 939-940.
    by András A Benczúr, István Bíró, Károly Csalogány, Máté Uher
    posted to adversarial-ir text by ChaTo on 2006-10-11 11:09:51 as read
  • notes Tracking Web Spam with Hidden Style Similarity
    (10 August 2006)
    by Tanguy Urvoy, Thomas Lavergne, And P Filoche
    posted to adversarial-ir text by ChaTo on 2006-08-21 21:41:18 as read
  • A comparison of implicit and explicit links for web page classification
    (2006), pp. 643-650.
    by Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen
  • notes Retroactive answering of search queries
    (2006), pp. 457-466.
    by Beverly Yang, Glen Jeh
  • Generating query substitutions
    (2006), pp. 387-396.
    by Rosie Jones, Benjamin Rey, Omid Madani, Wiley Greiner
  • notes N-Gram-Based Text Categorization
    (1994), pp. 161-175.
    by William B Cavnar, John M Trenkle
  • On The Evolution of Clusters of Near-Duplicate Web Pages.
    Journal of Web Engineering, Vol. 2, No. 4. (2004), pp. 228-246.
    by Dennis Fetterly, Mark Manasse, Marc Najork
    posted to clustering text by ChaTo on 2006-04-11 13:14:12 as read
  • notes Information retrieval as statistical translation
    (1999), pp. 222-229.
    by Adam Berger, John Lafferty
  • notes Detecting phrase-level duplication on the world wide web
    (2005), pp. 170-177.
    by Dennis Fetterly, Mark Manasse, Marc Najork
    posted to clustering text by ChaTo on 2005-09-14 15:07:38 as read along with 1 person MaineC
  • notes Frequency of occurrence of numbers in the World Wide Web
    (26 April 2005)
  • The Geometry of Information Retrieval
    (12 August 2004)
  • Modeling text collections and its application to the Web
    Applied Probability: Recent Advances (2004)
    by Ricardo Baeza-Yates, Gonzalo Navarro
    edited by Ricardo Baeza-Yates, Joe Glaz, Henryk Gzyl, Juerg Huesler, Jose L Palacios
    posted to text web-characterization by ChaTo on 2005-07-07 10:53:54 as read
  • Note: You may cite this page as: http://www.citeulike.org/user/ChaTo/tag/text

    RIS BibTeX RSS
    CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.