the-data-mining-blog: April 2005

Saturday, April 30, 2005

A Survey of Web Metrics - Web Mining related paper

dhyani02survey.pdf (application/pdf-Objekt)

# posted by Dr. Martin Menzel @ 11:44 PM 0 comments

Similarity Queries - Web Mining related paper

cohen99recognizing.pdf (application/pdf-Objekt)

# posted by Dr. Martin Menzel @ 11:40 PM 0 comments

HTML Similarities - Web Mining related paper

jeh02simrank.pdf (application/pdf-Objekt)

# posted by Dr. Martin Menzel @ 11:37 PM 0 comments

Friday, April 29, 2005

Elsevier.com

# posted by Dr. Martin Menzel @ 4:54 AM 0 comments

Elsevier.com

# posted by Dr. Martin Menzel @ 4:54 AM 0 comments

Author Gateway - Getting Published - LaTeX file guidelines

# posted by Dr. Martin Menzel @ 4:54 AM 0 comments

IEEE Intelligent Systems

# posted by Dr. Martin Menzel @ 4:45 AM 0 comments

AI Reference Shelf

# posted by Dr. Martin Menzel @ 4:40 AM 0 comments

Journal Informations in the Reference List ...

lavrac96intelligent.pdf (application/pdf-Objekt)

Journal Informations in the Reference List ...

# posted by Dr. Martin Menzel @ 4:39 AM 0 comments

Elsevier Author Gateway

# posted by Dr. Martin Menzel @ 4:31 AM 0 comments

Elsevier Author Gateway

# posted by Dr. Martin Menzel @ 4:30 AM 0 comments

Elsevier Author Gateway

# posted by Dr. Martin Menzel @ 4:30 AM 0 comments

Elsevier Author Gateway

# posted by Dr. Martin Menzel @ 4:30 AM 0 comments

Machine Learning and Natural Language Processing Lab

Machine Learning and Natural Language Processing Lab: "Link Statistical Methods in Medical Research"

# posted by Dr. Martin Menzel @ 4:28 AM 0 comments

Potential Utility of Data-Mining Algorithms for Early Detection of Potentially Fatal/Disabling Adverse Drug Reactions: A Retrospective Evaluation -- H

Potential Utility of Data-Mining Algorithms for Early Detection of Potentially Fatal/Disabling Adverse Drug Reactions: A Retrospective Evaluation -- Hauben and Reich 45 (4): 378 -- The Journal of Clinical Pharmacology

# posted by Dr. Martin Menzel @ 4:28 AM 0 comments

Elsevier Author Gateway

# posted by Dr. Martin Menzel @ 4:27 AM 0 comments

ScienceDirect - Artificial Intelligence in Medicine - List of Issues

# posted by Dr. Martin Menzel @ 4:25 AM 0 comments

Tuesday, April 12, 2005

Statistical Data Mining Tutorials

# posted by Dr. Martin Menzel @ 3:33 AM 0 comments

Monday, April 11, 2005

EntropyBasedLinkAnalysis.pdf (application/pdf-Objekt)

# posted by Dr. Martin Menzel @ 1:59 AM 0 comments

Saturday, April 09, 2005

A sequential algorithm for training text classifiers

# posted by Dr. Martin Menzel @ 2:54 AM 0 comments

Discovering informative content blocks from Web documents

# posted by Dr. Martin Menzel @ 2:52 AM 0 comments

Template detection via data mining and its applications

# posted by Dr. Martin Menzel @ 2:50 AM 0 comments

Friday, April 08, 2005

kdd2003-webNoise.pdf (application/pdf-Objekt)

# posted by Dr. Martin Menzel @ 12:49 PM 0 comments

Sunday, April 03, 2005

treeFinderSys.pdf

treeFinderSys.pdf (application/pdf-Objekt)

XML Tree Finder System: a First Step towards XML Data Mining
Final Report
Anguo Dong
Supervisor: Dr.Reda Alhajj
Computer Science Department
University of Calgary
April 5, 2004
Abstract
The problem of searching frequent trees from a
collection of tree-structured XML data modeling is
considered. The aim of this XML Tree Finder system(
XTFS) is to find the tree whose exact or perturbed
copies are frequent in a collection of the labeled
trees. The definition of the labeled tree will be
given later.Frequent here means that the tree we find
is the Maximal Common Tree of the collection of the
labeled tree.

# posted by Dr. Martin Menzel @ 8:14 AM 0 comments

WISDOM: Web Intrapage Informative Structure Mining Based on Document Object Model

WISDOM: Web Intrapage Informative Structure Mining Based on Document Object Model

Hung-Yu Kao, Jan-Ming Ho, Ming-Syan Chen, IEEE
To increase the commercial value and accessibility of pages, most content sites tend to publish their pages with intrasite redundant information, such as navigation panels, advertisements, and copyright announcements. Such redundant information increases the index size of general search engines and causes page topics to drift. In this paper, we study the problem of mining intrapage informative structure in news Web sites in order to find and eliminate redundant information. Note that intrapage informative structure is a subset of the original Web page and is composed of a set of fine-grained and informative blocks. The intrapage informative structures of pages in a news Web site contain only anchors linking to news pages or bodies of news articles. We propose an intrapage informative structure mining system called WISDOM (Web Intrapage Informative Structure Mining based on the Document Object Model) which applies Information Theory to DOM tree knowledge in order to build the structure. WISDOM splits a DOM tree into many small subtrees and applies a top-down informative block searching algorithm to select a set of candidate informative blocks. The structure is built by expanding the set using proposed merging methods. Experiments on several real news Web sites show high precision and recall rates which validates WISDOM's practical applicability.

Index Terms- Index Terms- Intrapage informative structure, DOM, entropy, information extraction.

# posted by Dr. Martin Menzel @ 7:43 AM 0 comments

Advanced Data Mining

Advanced Data Mining: "Lecture Notes on Graphical Modeling: Part 2 Directed Graphs"

# posted by Dr. Martin Menzel @ 7:17 AM 0 comments

Web Mining und Personalisierung in Echtzeit - DynaMine.pdf

DynaMine.pdf (application/pdf-Objekt)

Web Mining und Personalisierung in Echtzeit

# posted by Dr. Martin Menzel @ 5:02 AM 0 comments

Predictive Modeling - HolzSlides.pdf

HolzSlides.pdf (application/pdf-Objekt)

# posted by Dr. Martin Menzel @ 5:00 AM 0 comments

Data Mining mit Neuro-Fuzzy Systemen sor_99.pdf

sor_99.pdf (application/pdf-Objekt)

# posted by Dr. Martin Menzel @ 4:54 AM 0 comments

the-data-mining-blog

Saturday, April 30, 2005

A Survey of Web Metrics - Web Mining related paper

Similarity Queries - Web Mining related paper

HTML Similarities - Web Mining related paper

Friday, April 29, 2005

Elsevier.com

Elsevier.com

Author Gateway - Getting Published - LaTeX file guidelines

IEEE Intelligent Systems

AI Reference Shelf

Journal Informations in the Reference List ...

Elsevier Author Gateway

Elsevier Author Gateway

Elsevier Author Gateway

Elsevier Author Gateway

Machine Learning and Natural Language Processing Lab

Potential Utility of Data-Mining Algorithms for Early Detection of Potentially Fatal/Disabling Adverse Drug Reactions: A Retrospective Evaluation -- H

Elsevier Author Gateway

ScienceDirect - Artificial Intelligence in Medicine - List of Issues

Tuesday, April 12, 2005

Statistical Data Mining Tutorials

Monday, April 11, 2005

EntropyBasedLinkAnalysis.pdf (application/pdf-Objekt)

Saturday, April 09, 2005

A sequential algorithm for training text classifiers

Discovering informative content blocks from Web documents

Template detection via data mining and its applications

Friday, April 08, 2005

kdd2003-webNoise.pdf (application/pdf-Objekt)

Sunday, April 03, 2005

treeFinderSys.pdf

WISDOM: Web Intrapage Informative Structure Mining Based on Document Object Model

Advanced Data Mining

Web Mining und Personalisierung in Echtzeit - DynaMine.pdf

Predictive Modeling - HolzSlides.pdf

Data Mining mit Neuro-Fuzzy Systemen sor_99.pdf

About Me

Links

archives