Menu
Home
Contact us
Stats
Categories
Calendar
Toggle Wiki
Wiki Home
Last Changes
Rankings
List pages
Orphan pages
Sandbox
Print
Toggle Image Galleries
Galleries
Rankings
Toggle Articles
Articles home
List articles
Rankings
Toggle Blogs
List blogs
Rankings
Toggle Forums
List forums
Rankings
Toggle File Galleries
List galleries
Rankings
Toggle Maps
Mapfiles
Toggle Surveys
List surveys
Stats
ITHEA Classification Structure > H. Information Systems  > H.2 DATABASE MANAGEMENT  > H.2.8 Database Applications 
INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS
By: Blagoev et al. (3990 reads)
Rating: (1.00/10)

Abstract: An approach for indirect spatial data extraction by learning restricted finite state automata from web documents created using Bulgarian language are outlined in the paper. It uses heuristics to generalize initial finite-state automata that recognizes only the positive examples and nothing else into automata that recognizes as larger language as possible without extracting any non-positive examples from the training data set. The learning method, program realization and experiments are presented. The investigation is carried out in accordance and following the rules of EU INSPIRE Network.

Keywords: Automatic Data Extraction, Restricted Finite State Automata, Web Documents, Indirect Spatial Data, INSPIRE network.

ACM Classification Keywords: H.2.8 Database Applications - Data mining; F.1.1 Models of Computation – Finite State Automata

Link:

INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS

Dimitar Blagoev, George Totkov, Milena Staneva, Krassimira Ivanova, Krassimir Markov, Peter Stanchev

Print
H.2.8 Database Applications
article: STORING INFORMATION VIA NATURAL LANGUAGE ADDRESSING – A STEP TOWARD MODELING ... · ALGORITHM FOR QUICK NUMBERING OF LARGE VOLUMES OF DATA · RDFARM - A SYSTEM FOR STORING LARGE SETS OF RDF TRIPLES AND QUADRUPLES BY ... · SELF-CITATIONS EFFECT ON SCIENTOMETRIC INDEXES · SHAPING THE CITATION-PAPER RANK DISTRIBUTIONS: BEYOND HIRSCH’S MODEL · ONTOARM - A SYSTEM FOR STORING ONTOLOGIES BY NATURAL LANGUAGE ADDRESSING · METHOD OF DATA ANALYSIS BASED ON CLUSTERING IN “SYNDROMES” INDICATORS SPACE · ANALYZING THE LOCALIZATION OF LANGUAGE FEATURES WITH COMPLEX SYSTEMS TOOLS ... · WORDARM - A SYSTEM FOR STORING DICTIONARIES AND THESAURUSES BY ... · ASSOCIATION RULE MINING WITH N-DIMENSIONAL UNIT CUBE CHAIN SPLIT TECHNIQUE · ON A METHOD OF MULTI-ALGORITHMIC CLASSIFICATION · PROCESSING SETS OF CLASSES’ LOGICAL REGULARITIES · CITATION-PAPER RANK DISTRIBUTIONS AND ASSOCIATED SCIENTOMETRIC INDICATORS ... · MULTI-VARIANT PYRAMIDAL CLUSTERING AND ANALYSIS HIGH-DIMENSIONAL DATA · THEORETICAL ANALYSIS OF EMPIRICAL RELATIONSHIPS FOR PARETODISTRIBUTED... · INTEGRATED ENVIRONMENT FOR STORING AND HANDLING INFORMATION IN TASKS OF ... · ABOUT MULTI-VARIANT CLUSTERING AND ANALYSIS HIGH-DIMENSIONAL DATA · COMPUTATIONAL MODEL FOR SERENDIPITY · METHOD FOR EVALUATING OF DISCREPANCY BETWEEN REGULARITIES SYSTEMS IN ... · ASTRONOMICAL PLATES SPECTRA EXTRACTION OBJECTIVES AND POSSIBLE SOLUTIONS ... · METHODS OF REGULARITIES SEARCHING BASED ON OPTIMAL PARTITIONING · AN APPROACH TO VARIABLE AGGREGATION IN EFFICIENCY ANALYSIS · INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS · METHODS FOR EVALUATING OF REGULARITIES SYSTEMS STRUCTURE · COMPOSITE BLOCK OPTIMIZED CLASSIFICATION DATA STRUCTURES · INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS · INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS · HOW TO USE A DESKTOP VERSION OF A DBMS FOR CLIENT-SERVER APPLICATIONS · DEVELOPMENT OF DATABASE FOR DISTRIBUTED INFORMATION MEASUREMENT ... · THE DEVELOPMENT OF THE GENERALIZATION ALGORITHM BASED ON THE ROUGH SET THEORY · THE ROLE OF DBMS IN ANALYTICAL PROCESSES OF THE LOGISTIC ·
Login
[ register | I forgot my password ]
World Clock
Powered by Tikiwiki Powered by PHP Powered by Smarty Powered by ADOdb Made with CSS Powered by RDF powered by The PHP Layers Menu System
RSS Wiki RSS Blogs rss Articles RSS Image Galleries RSS File Galleries RSS Forums RSS Maps rss Calendars
[ Execution time: 0.08 secs ]   [ Memory usage: 7.57MB ]   [ GZIP Disabled ]   [ Server load: 0.41 ]
Powered by Tikiwiki CMS/Groupware