| |
In SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval (2004), pp. 494-495.
Abstract
We describe an evaluation of result set filtering techniques for providing ultra-high precision in the task of presenting related news for general web queries. In this task, the negative user experience generated by retrieving non-relevant documents has a much worse impact than not retrieving relevant ones. We adapt cost-based metrics from the document filtering domain to this result filtering problem in order to explicitly examine the tradeoff between missing relevant documents and retrieving non-relevant ones. A large manual evaluation of three ...
|
| |
Computer Networks and ISDN Systems, Vol. 30, No. 1-7. (April 1998), pp. 107-117.
Abstract
In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems. The prototype with a full text and hyperlink database of at least 24 million pages is available at http://google.stanford.edu/ To engineer a search engine is a challenging task. Search engines index tens to hundreds of millions of Web ...
|
| |
Advanced Information Systems Engineering (2004), pp. 398-413.
Abstract
We describe a systematic methodology for extending the audiovisual content description standards (MPEG-7 and TV-Anytime) with domain-specific knowledge descriptions expressed in OWL. The domain-specific descriptions of the audiovisual content metadata are completely transparent to applications and tools that use MPEG-7 and TV-Anytime, allowing them to use the domain- specific ontologies without any software changes. We also present an interoperability mechanism between OWL and the audiovisual content description standards, which allows MPEG-7 and TV-Anytime descriptions and their domain-specific extensions to be described ...
|
| |
Data & Knowledge Engineering, Vol. 35, No. 3. (December 2000), pp. 259-298.
Abstract
Since media-based evaluation yields similarity values, results to a multimedia database query, Q ( Y 1 ,…, Y n ), is defined as an ordered list S Q of n -tuples of the form X 1 ,…, X n . The query Q itself is composed of a set of fuzzy and crisp predicates, constants, variables, and conjunction, disjunction, and negation operators. Since many multimedia applications require partial matches, S Q includes ...
|
| |
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on, Vol. 2 (2004), pp. 823-826 Vol.2.
Abstract
Digital television is a reality, providing new facilities for multimedia content delivery. Due to the large amount of available content from multiple broadcasters, the use of personalized recommendation techniques can help the user in the process of consuming A/V content. The viewer, who is actually a passive subject, is changed into an active one, who can organize the TV program guide of interest whenever and however he wants, achieving a real "a/spl grave/ la carte TV". We present an end-to-end system ...
|
| |
Software: Practice and Experience, Vol. 38, No. 9. (2008), pp. 925-960.
Abstract
Digital Television will bring a significant increase in the amount of channels and programs available to end users, with many more difficulties to find contents appealing to them among a myriad of irrelevant information. Thus, automatic content recommenders should receive special attention in the following years to improve their assistance to users. The current content recommenders have important deficiencies that hamper their wide acceptance. In this paper, we present a new approach for automatic content recommendation that significantly reduces those deficiencies. ...
|
| |
ACM Trans. Inf. Syst., Vol. 22, No. 1. (January 2004), pp. 5-53.
Abstract
Recommender systems have been evaluated in many, often incomparable, ways. In this article, we review the key decisions in evaluating collaborative filtering recommender systems: the user tasks being evaluated, the types of analysis and datasets being used, the ways in which prediction quality is measured, the evaluation of prediction attributes other than quality, and the user-based evaluation of the system as a whole. In addition to reviewing the evaluation strategies used by prior researchers, we present empirical results from the analysis ...
|
| |
Trust Management In Trust Management (2006), pp. 93-104.
Abstract
Social networks are growing in number and size, with hundreds of millions of user accounts among them. One added benefit of these networks is that they allow users to encode more information about their relationships than just stating who they know. In this work, we are particularly interested in trust relationships, and how they can be used in designing interfaces. In this paper, we present FilmTrust, a website that uses trust in web-based social networks to create predictive movie recommendations. Using ...
|
| |
In SIGIR (2008), pp. 531-538.
Abstract
In this paper, we look at the "social tag prediction" problem. Given a set of objects, and a set of tags applied to those objects by users, can we predict whether a given tag could/should be applied to a particular object? We investigated this question using one of the largest crawls of the social bookmarking system del.icio.us gathered to date. For URLs in del.icio.us, we predicted tags based on page text, anchor text, surrounding hosts, and other tags applied to the ...
|
| |
Web Semantics: Science, Services and Agents on the World Wide Web, Vol. 6, No. 2. (April 2008), pp. 139-150.
Abstract
In this paper we describe a model for automatically generating video documentaries. This allows viewers to specify the subject and the point of view of the documentary to be generated. The domain is matter-of-opinion documentaries based on interviews. The model combines rhetorical presentation patterns used by documentary makers with a data-driven approach. Rhetorical presentation patterns provide the viewer with an engaging viewing experience, while a data-driven approach can be applied to growing media repositories. To date, the modeling of rhetoric has ...
|
| |
Web Semantics: Science, Services and Agents on the World Wide Web, Vol. 6, No. 2. (April 2008), pp. 151-161.
Abstract
Newspapers are evolving and this causes great changes in how newspapers reach their consumers, but also in how newspapers work internally. Advanced computerised support is needed in order to cope with the new needs, which require that machines are aware of a greater part of the underlying semantics. Ontologies and Semantic Web technologies are clear candidates for web-wide semantics. However, newspapers have made great investments in their current news management systems and their wish is to undertake a smooth transition. Our ...
|
| |
Web Semant., Vol. 6, No. 2. (April 2008), pp. 162-169.
|
| |
Web Semantics: Science, Services and Agents on the World Wide Web, Vol. 6, No. 4. (November 2008), pp. 243-249.
Abstract
In this article we describe a Semantic Web application for semantic annotation and search in large virtual collections of cultural-heritage objects, indexed with multiple vocabularies. During the annotation phase we harvest, enrich and align collection metadata and vocabularies. The semantic-search facilities support keyword-based queries of the graph (currently 20 M triples), resulting in semantically grouped result clusters, all representing potential semantic matches of the original query. We show two sample search scenario’s. The annotation and search software is open source and is ...
|
| |
The Semantic Web (2008), pp. 552-565.
Abstract
Developers of Semantic Web applications face a challenge with respect to the decentralised publication model: where to find statements about encountered resources. The “linked data” approach, which mandates that resource URIs should be de-referenced and yield metadata about the resource, helps but is only a partial solution. We present Sindice, a lookup index over resources crawled on the Semantic Web. Our index allows applications to automatically retrieve sources with information about a given resource. In addition we allow resource retrieval through ...
|
| |
Intelligent Systems, IEEE In Intelligent Systems, IEEE, Vol. 23, No. 3. (2008), pp. 20-28.
Abstract
Although research on integrating semantics with the Web started almost as soon as the Web was in place, a concrete Semantic Web that is, a large-scale collection of distributed semantic metadata emerged only over the past four to five years. The Semantic Web's embryonic nature is reflected in its existing applications. Most of these applications tend to produce and consume their own data, much like traditional knowledge- based applications, rather than actually exploiting the Semantic Web as a large-scale information source. ...
|
| |
(1977)
Abstract
The knowledge engineer practices the art of bringing the principles and tools of AI research to bear on difficult applications problems requiring experts” knowledge for their solution. The technical issues of acquiring this knowledge, representing it, and using it appropriately to construct and explain lines-of-reasoning, are important problems in the design of knowledge-based systems. Various systems that have achieved expert level performance in scientific and medical inference illuminates the art of knowledge engineering and its parent science, Artificial Intelligence. ...
|
| |
CEUR Workshop Proceedings (15-17 December 2008)
Abstract
Thanks to the huge efforts deployed in the community for creating, building and generating semantic information for the Semantic Web, large amounts of machine processable knowledge are now openly available. Watson is an infrastructure component for the Semantic Web, a gateway that provides the necessary functions to support applications in using the Semantic Web. In this paper, we describe a number of applications relying on Watson, with the purpose of demonstrating what can be achieved with the Semantic Web nowadays and what ...
|
| |
|
| |
The Semantic Web - ISWC 2006 (2006), pp. 242-257.
Abstract
Semantic Web languages are being used to represent, encode and exchange semantic data in many contexts beyond the Web – in databases, multiagent systems, mobile computing, and ad hoc networking environments. The core paradigm, however, remains what we call the Web aspect of the Semantic Web – its use by independent and distributed agents who publish and consume data on the World Wide Web. To better understand this central use case, we have harvested and analyzed a collection of Semantic Web ...
|
| |
In In 3rd Workshop on Scripting For The Semantic Web (SFSW07, Vol. 2 (2007)
Abstract
Abstract. In this paper we describe a distributed architecture consisting of a combination of scripting tools that interact with each other in order to help to find and query decentralized RDF data. Thanks to this architecture, anyone can participate in the collaborative discovery of Semantic Web documents by simply browsing the web. This system is useful for dynamic discovery of RDF content and can provide a useful source of RDF documents for other Semantic Web applications. Key components of this architecture ...
|
| |
Computer, Vol. 38, No. 10. (2005), pp. 62-69.
Abstract
To help human users and software agents find relevant knowledge on the Semantic Web, the Swoogle search engine discovers, indexes, and analyzes the ontologies and facts that are encoded in Semantic Web documents. ...
|
| |
In CIKM '04: Proceedings of the thirteenth ACM conference on Information and knowledge management (2004), pp. 652-659.
Abstract
Swoogle is a crawler-based indexing and retrieval system for the Semantic Web. It extracts metadata for each discovered document, and computes relations between documents. Discovered documents are also indexed by an information retrieval system which can use either character N-Gram or URIrefs as keywords to find relevant documents and to compute the similarity among a set of documents. One of the interesting properties we compute is <i>ontology rank</i>, a measure of the importance of a Semantic Web document. ...
|
| |
In WWW '05: Special interest tracks and posters of the 14th international conference on World Wide Web (2005), pp. 902-903.
Abstract
In this short paper we estimate the size of the public indexable web at 11.5 billion pages. We also estimate the overlap and the index size of Google, MSN, Ask/Teoma and Yahoo! ...
|
| |
Intelligent Systems, IEEE, Vol. 23, No. 3. (May 2008), pp. 29-40.
Abstract
This paper deals with applying semantic Web technologies to the social Web can lead to a social semantic Web, creating a network of interlinked and semantically rich knowledge. One of the most visible trends on the Web is the emergence of social Web sites, which help people create and gather knowledge by simplifying user contributions via blogs, tagging and folksonomies, wikis, podcasts, and online social networks. The social Web has enabled community-based knowledge acquisition. ...
|
| |
Science, Vol. 311, No. 5757. (6 January 2006), pp. 88-90.
Abstract
Social networks evolve over time, driven by the shared activities and affiliations of their members, by similarity of individuals' attributes, and by the closure of short network cycles. We analyzed a dynamic social network comprising 43,553 students, faculty, and staff at a large university, in which interactions between individuals are inferred from time-stamped e-mail headers recorded over one academic year and are matched with affiliations and attributes. We found that network evolution is dominated by a combination of effects arising from ...
|
| |
Journal of Computer-Mediated Communication, Vol. 3, No. 1. (1997), pp. 0-0.
Abstract
Abstract When a computer network connects people or organizations, it is a social network. Yet the study of such computer-supported social networks has not received as much attention as studies of human-computer interaction, online person-to-person interaction, and computer-supported communication within small groups. We argue the usefulness of a social network approach for the study of computer-mediated communication. We review some basic concepts of social network analysis, describe how to collect and analyze social network data, and demonstrate where social network data ...
|
| |
(17 May 2002)
Abstract
Social networks have the surprising property of being "searchable": Ordinary people are capable of directing messages through their network of acquaintances to reach a specific but distant target person in only a few steps. We present a model that offers an explanation of social network searchability in terms of recognizable personal identities: sets of characteristics measured along a number of social dimensions. ...
|
| |
(3 November 2004)
Abstract
We address the question of how participants in a small world experiment are able to find short paths in a social network using only local information about their immediate contacts. We simulate such experiments on a network of actual email contacts within an organization as well as on a student social networking website. On the email network we find that small world search strategies using a contact's position in physical space or in an organizational hierarchy relative to the target can effectively be used to locate most ...
|
| |
(2003)
Abstract
The problem of how to fast and efficiently access the large amount of stored information has been studied for a long time in Information Retrieval. One example is web search engines, which periodically crawl and index the available information on the Web. However, much information might not be online and cannot be accessed by web search engines. Instead, this kind of information exists in people's head and only can be accessed by asking the right people. ...
|
| |
Advances in Information Retrieval (2006), pp. 289-301.
Abstract
We present an analysis of a large blog search engine query log, exploring a number of angles such as query intent, query topics, and user sessions. Our results show that blog searches have different intents than general web searches, suggesting that the primary targets of blog searchers are tracking references to named entities, and locating blogs by theme. In terms of interest areas, blog searchers are, on average, more engaged in technology, entertainment, and politics than web searchers, with a particular ...
|