Mining Context-Specific Web Knowledge : an Experimental Dictionary-based Approach

Di Lecce, V.; Calabrese, M.; Soldo, D.

doi:10.1007/978-3-540-85984-0_108

This work presents an experimental semantic approach for mining knowledge from the World Wide Web (WWW). The main goal is to build a context-specific knowledge base from web documents. The basic idea is to use a reference knowledge provided by a dictionary as the indexing structure of domain-specific computed knowledge instances organised in the form of interlinked text words. The WordNet lexical database has been used as reference knowledge for the English web documents. Both the reference and the computed knowledge are actually conceived as word graphs. Graph is considered here as a powerful way to represent structured knowledge. This assumption has many consequences on the way knowledge can be explored and similar knowledge patterns can be identified. In order to identify context-specific elements in knowledge graphs, the novel semantic concept of “minutia” has been introduced. A preliminary evaluation of the efficacy of the proposed approach has been carried out. A fair comparison strategy with other non-semantic competing approaches is currently under investigation.

Mining Context-Specific Web Knowledge : an Experimental Dictionary-based Approach / V. Di Lecce, M. Calabrese, D. Soldo (LECTURE NOTES IN COMPUTER SCIENCE). - In: Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence / [a cura di] D.S. Huang, D.C. Wunsch, D.S. Levine, K.H. Jo. - Berlin : Springer, 2008. - ISBN 978-3-540-85983-3. - pp. 896-905 (( Intervento presentato al 4. convegno International Conference on Intelligent Computing, ICIC tenutosi a Shanghai nel 2008 [10.1007/978-3-540-85984-0_108].

Mining Context-Specific Web Knowledge : an Experimental Dictionary-based Approach

V. Di Lecce;M. Calabrese^Secondo;D. Soldo

2008

Abstract

This work presents an experimental semantic approach for mining knowledge from the World Wide Web (WWW). The main goal is to build a context-specific knowledge base from web documents. The basic idea is to use a reference knowledge provided by a dictionary as the indexing structure of domain-specific computed knowledge instances organised in the form of interlinked text words. The WordNet lexical database has been used as reference knowledge for the English web documents. Both the reference and the computed knowledge are actually conceived as word graphs. Graph is considered here as a powerful way to represent structured knowledge. This assumption has many consequences on the way knowledge can be explored and similar knowledge patterns can be identified. In order to identify context-specific elements in knowledge graphs, the novel semantic concept of “minutia” has been introduced. A preliminary evaluation of the efficacy of the proposed approach has been carried out. A fair comparison strategy with other non-semantic competing approaches is currently under investigation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
			Knowledge Discovery; Semantic Web; Web Mining; WordNet
		
	Settori scientifico-disciplinari del contributo
	
			Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
		
	Data di pubblicazione
	
			2008
		
	DOI
	
			https://dx.doi.org/10.1007/978-3-540-85984-0_108
		
	Tipologia
	
			Book Part (author)
		
	Appare nelle tipologie:
	
			03 - Contributo in volume

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/214176

Citazioni

ND

5

3

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca