Menu
Home
Contact us
Stats
Categories
Calendar
Toggle Wiki
Wiki Home
Last Changes
Rankings
List pages
Orphan pages
Sandbox
Print
Toggle Image Galleries
Galleries
Rankings
Toggle Articles
Articles home
List articles
Rankings
Toggle Blogs
List blogs
Rankings
Toggle Forums
List forums
Rankings
Toggle File Galleries
List galleries
Rankings
Toggle Maps
Mapfiles
Toggle Surveys
List surveys
Stats
ITHEA Classification Structure > F. Theory of Computation  > F.2 ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY  > F.2.2 Nonnumerical Algorithms and Problems 
ITHEA Classification Structure > H. Information Systems  > H.3 INFORMATION STORAGE AND RETRIEVAL  > H.3.1 Content Analysis and Indexing 
ITHEA Classification Structure > H. Information Systems  > H.3 INFORMATION STORAGE AND RETRIEVAL  > H.3.3 Information Search and Retrieval 
ITHEA Classification Structure > I. Computing Methodologies  > I.2 ARTIFICIAL INTELLIGENCE  > I.2.7 Natural Language Processing 
ITHEA Classification Structure > I. Computing Methodologies  > I.7 DOCUMENT AND TEXT PROCESSING   > I.7.2 Document Preparation 
IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION
By: Igor Zaslawskiy, Aram Avetisyan, Vardan Gevorgyan (2880 reads)
Rating: (1.00/10)

Abstract: In this paper we present some research results and propose solutions for natural language string lookup techniques. In particular a fast method is suggested for searching dictionary entries for possible matches of sentence words without using relational databases or full dictionary load into machine random access memory. Such approach is essential for minimizing the speed dependency from dictionary size and available machine resources as well as for the scalability of the analyzer software. The mentioned is based on an implementation of Aho-Corasick? Aho, Corasick, 1977 automata with a number of optimizations in the indexing and lookup algorithm.

Keywords: UNL, natural language processing, dictionary lookup, indexing, search, XML, pattern matching machine, string matching algorithm, information search

ACM Classification Keywords: F.2.2 Non-numerical Algorithms and Problems – Pattern matching, Sorting and searching, I.2.7 Natural Language Processing - Text analysis, Language parsing and understanding, I.7.2 Document Preparation – Index generation, Markup languages, H.3.1 Content Analysis and Indexing – Dictionaries, Indexing Methods, H.3.3 Information Search and Retrieval - Retrieval models, Search process.

Link:

IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION

Igor Zaslawskiy, Aram Avetisyan, Vardan Gevorgyan

http://www.foibg.com/ijita/vol17/ijita17-2-p03.pdf

Print
F.2.2 Nonnumerical Algorithms and Problems
article: CONVEXITY RELATED ISSUES FOR THE SET OF HYPERGRAPHIC SEQUENCES · LINEAR PROGRAM FORM FOR RAY DIFFERENT DISCRETE TOMOGRAPHY · CONSTRAINT CONVEXITY TOMOGRAPHY AND LAGRANGIAN APPROXIMATIONS · IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION · APPROXIMATION GREEDY ALGORITHM FOR RECONSTRUCTING OF (0,1)-MATRICES WITH DIFFERE · SELF-MODIFICATED PREDICATE NETWORKS · THE INVERSE MASLOV METHOD AND ANT TACTICS FOR EXHAUSTIVE SEARCH DECREASING · CONSTRAINED OBJECT-CHARACTERIZATION TABLES AND ALGORITHMS1 · CONSTRUCTION OF CLASS LEVEL DESCRIPTION FOR EFFICIENT RECOGNITION OF ... · EFFICIENT SIMULATION FOR PROLOG IMPLEMENTATION OF IMAGE RECOGNITION PROBLEM · THE INVERSE METHOD FOR SOLVING ARTIFICIAL INTELLIGENCE PROBLEMS IN ... · EVALUATION OF GREEDY ALGORITHM OF CONSTRUCTING (0,1)-MATRICES WITH DIFFERENT ... · DISCRETE ARTIFICIAL INTELLIGENCE PROBLEMS AND NUMBER OF STEPS OF THEIR SOLUTION · ON STRUCTURAL RECOGNITION WITH LOGIC AND DISCRETE ANALYSIS · SEQUENCING JOBS WITH UNCERTAIN PROCESSING TIMES ... · REALIZATION OF AN OPTIMAL SCHEDULE FOR THE TWO-MACHINE · RECENT RESULTS ON STABILITY ANALYSIS ... · LEARNING TECHNOLOGY IN SCHEDULING BASED ON THE MIXED GRAPHS ·
H.3.1 Content Analysis and Indexing
article: SOCIAL SEARCH ENGINE AND INTELLECTUAL DATABASE OF PEOPLE · INTELLECTUAL SEARCH ENGINE OF ADEQUATE INFORMATION IN INTERNET FOR CREATING ... · IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION · THE SYSTEM OF MULTILINGUAL TEXT DATA PROCESSING ON THE BASE OF THE MODIFIED ... · AN APPROACH TO THE MODELING OF THE COGNITIVE ABILITY MANAGING THE FOCUS OF ... · MODEL OF LEXICOGRAPHICAL DATABASE: STRUCTURE, BASIC FUNCTIONALITY... · АСПЕКТЫ НЕКЛАССИЧЕСКОЙ ТЕОРИИ НОМИНАЦИИ И · TERMINOLOGICAL ANNOTATION OF THE DOCUMENT IN A RETRIEVAL CONTEXT ON THE BASIS... · THE DEVELOPMENT SUPPORT SYSTEM "ONTOINTEGRATOR" FOR LINGUISTIC APPLICATIONS · THE COMBINED APPROACH TO PRESENTATION OF MULTIMEDIA CONTENT ... · AUTOMATIC GENERATION OF TITLES FOR A CORPUS OF QUESTIONS · THE DEVELOPMENT SUPPORT SYSTEM "ONTOINTEGRATOR" ... · APPLIED PROBLEMS OF FUNCTIONAL HOMONYMY RESOLUTION FOR RUSSIAN LANGUAGE ·
H.3.3 Information Search and Retrieval
article: Facts extraction from the semi-structured text information · NEAREST NEIGHBOR SEARCH AND SOME APPLICATIONS · SOCIAL SEARCH ENGINE AND INTELLECTUAL DATABASE OF PEOPLE · INTELLECTUAL SEARCH ENGINE OF ADEQUATE INFORMATION IN INTERNET FOR CREATING ... · ИНФОРМАЦИОННАЯ ТЕХНОЛОГИЯ ПРИМЕНЕНИЯ СЕМАНТИЧЕСКИ ОРИЕНТИРОВАННЫХ МЕТОДОВ ... · SYSTEM OF INTELLIGENT SEARCH, CLASSIFICATION AND DOCUMENT SUMMARISATION FOR INTE · IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION · THE SYSTEM OF MULTILINGUAL TEXT DATA PROCESSING ON THE BASE OF THE MODIFIED ... · LOGIC-LINGUISTIC MODEL OF FACT GENERATION FROM TEXT STREAMS OF CORPORATE... · ЛОГИКО-ЛИНГВИСТИЧЕСКАЯ МОДЕЛЬ ИЗВЛЕЧЕНИЯ ФАКТОВ ИЗ СЛАБОСТРУКТУРИРОВАННОЙ ... · Solution of the Problem of Formal Evaluation of Effectiveness of ... · МЕТОДЫ АВТОМАТИЗИРОВАННОГО ДИСКУРСИВНОГО � · BUILDING THE LIBRARY CATALOG SEARCH MODEL BASED ON THE FUZZY SIMILARITY ... · DATABASE SERVER USAGE IN THE SOCIAL NETWORKS ANALYSIS · REGIONS OF SUFFICIENCY FOR METRICAL DATA RETRIEVAL · DATA AND METADATA EXCHANGE REPOSITORY USING AGENTS IMPLEMENTATION · DISTANCE MATRIX APPROACH TO CONTENT IMAGE RETRIEVAL · DISTANCE MATRIX APPROACH TO CONTENT IMAGE RETRIEVAL · BRIDGING THE GAP BETWEEN HUMAN LANGUAGE AND COMPUTER-ORIENTED REPRESENTATIONS ·
I.2.7 Natural Language Processing
article: SYNTACTIC OPERATIONS – MODELING LANGUAGE FACULTY · ON MENTAL REPRESENTATIONS: LANGUAGE STRUCTURE AND MEANING REVISED · IMPROVING AUTOMATIC SPEECH RECOGNITION ACCURACY BY MEANS OF PRONUNCIATION VARIAT · УНИВЕРСАЛЬНАЯ СИСТЕМА ПРОГРАММ МОРФОЛОГИЧЕСКОГО АНАЛИЗА НАУЧНО-ТЕХНИЧЕСКИХ ... · SPAM AND PHISHING DETECTION IN VARIOUS LANGUAGES · GRAMMATICAL PRIMING DOES FACILITATE VISUAL WORD NAMING, AT LEAST IN SERBIAN · MULTILINGUAL REDUCED N-GRAM MODELS · COGNITIVE MODEL OF TIME AND ANALYSIS OF NATURAL LANGUAGE TEXTS · IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION · О МОДЕЛИРОВАНИИ ПОНИМАНИЯ · ФОРМАЛЬНОЕ ОПРЕДЕЛЕНИЕ СИТУАЦИИ ДЛЯ СЕМАНТ · THE EDUCATIONAL TECHNOLOGY FOR LEARNING FOREIGN WORDS · PARAMETERIZATION OF COMMENTS FROM PERUVIAN FACEBOOK AND TWITTER... · THE STUDY OF FACTORS RELADED WITH SINGLE-DOCUMENT KEYWORD EXTRACTION · AUTOMATED TAG EXTRACTION & CLUSTERING IN DOCUMENTS CONTAINING COMPOSITIONAL ... · STUDYING SPECIAL TEXT RUSSIAN CORPORA BY THE LEXICO-SYNTACTIC MODELS · STUDYING SPECIAL TEXT RUSSIAN CORPORA BY THE LEXICO-SYNTACTIC MODELS · CLASSIFICATION OF PRIMARY MEDICAL RECORDS WITH RUBRYX-2: FIRST EXPERIENCE · MACHINE TRANSLATION IN THE COURSE “COMPUTER TECHNOLOGIES IN LINGUISTICS” .. · CLASSIFICATION OF FREE TEXT CLINICAL NARRATIVES (SHORT REVIEW) · METHODS AND TOOLS OF COMPUTATIONAL LINGUISTICS FOR THE CLASSIFICATION ... · LEXISTERM – THE PROGRAM FOR TERM SELECTION BY THE CRITERION OF SPECIFICITY · ELECTION DATA VISUALIZATION · COMPUTER SUPPORT OF SEMANTIC TEXT ANALYSIS OF A TECHNICAL SPECIFICATION ON ... · MOBILE ELECTION · MOBILE SEARCH AND ADVERTISING · ALGEBRA LOGIC APPROACH TO PERSON’S THINKING MECHANISMS FORMALIZATION · COMPUTER SUPPORT OF SEMANTIC TEXT ANALYSIS OF A TECHNICAL SPECIFICATION ON DESIG · LSPL-PATTERNS AS A TOOL FOR INFORMATION EXTRACTION FROM NATURAL LANGUAGE TEXTS · NUMERIC-LINGUAL DISTINGUISHING FEATURES OF SCIENTIFIC DOCUMENTS · HIERARCHICAL THREE-LEVEL ONTOLOGY FOR TEXT PROCESSING · HIERARCHICAL THREE-LEVEL ONTOLOGY FOR TEXT PROCESSING · COMPUTER-AIDED SYSTEM OF SEMANTIC TEXT ANALYSIS ... · METHODOLOGY FOR LANGUAGE ANALYSIS AND GENERATION ... · ANALYSIS AND COORDINATION OF EXPERT STATEMENTS IN THE PROBLEMS ... · SEMANTIC SEARCH OF INTERNET INFORMATION RESOURCES ON BASE OF ONTOLOGIES ... · INTELLIGENT SEARCH AND AUTOMATIC DOCUMENT CLASSIFICATION AND CATALOGING ... · VERBAL DIALOGUE VERSUS WRITTEN DIALOGUE · INFORMATION PROCESSING IN A COGNITIVE MODEL OF NLP · EXPERIMENTS IN DETECTION AND CORRECTION OF RUSSIAN MALAPROPISMS BY MEANS ... · COMMON SCIENTIFIC LEXICON FOR AUTOMATIC DISCOURSE ANALYSIS OF SCIENTIFIC ... ·
I.7.2 Document Preparation
article: INTEGRATION OF ONTOLOGY RESOURCES INTO OPEN FORMAT DOCUMENTS FOR ... · MULTIDIMENSIONAL ONTOLOGY OF ELECTRONIC DOCUMENT AS A BASE OF INFORMATION SYSTEM · ТЕХНОЛОГИЯ СОЗДАНИЯ ДОКУМЕНТ-ОРИЕНТИРОВАННЫХ СИСТЕМ, ОСНОВАННЫХ ... · IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION · AGENT-BASED DOCUMENT MANAGEMENT WITHIN THE WHOLE LIFECYCLE OF... ·
Login
[ register | I forgot my password ]
World Clock
Powered by Tikiwiki Powered by PHP Powered by Smarty Powered by ADOdb Made with CSS Powered by RDF powered by The PHP Layers Menu System
RSS Wiki RSS Blogs rss Articles RSS Image Galleries RSS File Galleries RSS Forums RSS Maps rss Calendars
[ Execution time: 0.09 secs ]   [ Memory usage: 7.54MB ]   [ GZIP Disabled ]   [ Server load: 0.28 ]
Powered by Tikiwiki CMS/Groupware