Menu
Home
Contact us
Stats
Categories
Calendar
Toggle Wiki
Wiki Home
Last Changes
Rankings
List pages
Orphan pages
Sandbox
Print
Toggle Image Galleries
Galleries
Rankings
Toggle Articles
Articles home
List articles
Rankings
Toggle Blogs
List blogs
Rankings
Toggle Forums
List forums
Rankings
Toggle File Galleries
List galleries
Rankings
Toggle Maps
Mapfiles
Toggle Surveys
List surveys
Stats
ITHEA Classification Structure > G. Mathematics of Computing  > G.3 PROBABILITY AND STATISTICS 
ITHEA Classification Structure > H. Information Systems  > H.3 INFORMATION STORAGE AND RETRIEVAL  > H.3.3 Information Search and Retrieval 
ITHEA Classification Structure > I. Computing Methodologies  > I.2 ARTIFICIAL INTELLIGENCE  > I.2.4 Knowledge Representation Formalisms and Methods 
Solution of the Problem of Formal Evaluation of Effectiveness of ...
By: Nina Khairova, Nataliya Sharonova, Dmytro Uzlov (4190 reads)
Rating: (1.00/10)

Abstract: The traditional approach (the comparison with a "reference" result) for evaluating quality of the technology to identify knowledge extracted from text arrays is badly applicable out of a need to create the reference answer for each specific set of electronic documents. In this paper we show that integral quantitative coefficients of recall, precision and F-measure can be used to assess effectiveness of linguistic technologies of knowledge identification in texts. Justifying the possibility of using the test collections method for the experimental validation of obtained efficiency coefficients, we propose the use of the approach based on mathematical statistics methods. The procedures of using sampling fraction of the indicator as a characteristic of evaluating the proportion of relevant documents in the general population are reviewed. The paper shows the argumentation to the fact that, in important practical cases of text collection samples, asymmetry of a confidence interval at the binomial distribution can be overcome by approximated transition to the normal distribution. We also propose the methods of determining the confidence interval for the indicator fraction that are based on Wilson approach, and the method of determining the required size of the relevant sample depending on the specified error and confidence probability as well.

Key worlds: evaluation of effectiveness, semistructured text information, test collections method, size sample

ACM Classification Keywords: H.3.3 .Information Search and Retrieval, I.2.4. Knowledge Representation Formalisms and Methods, G.3. Probability and statistics – Statistical computing

Link:

Solution of the Problem of Formal Evaluation of Effectiveness of the Technology Knowledge Identification in Semistructured Text Information

Nina Khairova, Nataliya Sharonova, Dmytro Uzlov

http://www.foibg.com/ijicp/vol01/ijicp01-03-p03.pdf

Print
G.3 PROBABILITY AND STATISTICS
article: DECISION-MAKING IN GROUPS OF INTERVAL ALTERNATIVES · RISK BEHAVIOUR IN A SET OF INTERVAL ALTERNATIVES · Peculiarities Analysis of Statistical Information in ICT · About convergence of fuzzy perceptive elements sequences, defined on ... · INFORMATION SYSTEM OF FORECASTING BASED ON COMBINED MODELS WITH TIME SERIES ... · Integrated Approach to the Study of Fractal Time Series · METHOD OF DATA ANALYSIS BASED ON CLUSTERING IN “SYNDROMES” INDICATORS SPACE · MATRIXES LEAST SQUARES METHOD AND EXAMPLES OF ITS APPLICATION · PHYSICAL PHENOMENON OF STATISTICAL STABILITY · ANALYSIS OF FEATURES AND POSSIBILITIES OF BANK FUNCTIONING EFFICIENCY BASED ... · SUB-OPTIMAL NONPARAMETRIC HYPOTHESES DISCRIMINATING WITH GUARANTEED DECISION · Solution of the Problem of Formal Evaluation of Effectiveness of ... · ANALYSIS OF THE PROPERTIES OF ORDINARY LEVY MOTION BASED ON THE ESTIMATION ... · SPREADING THE MOORE - PENROSE PSEUDO INVERSE ON MATRICES EUCLIDEAN SPACES: ... · Evaluating Expected Effectiveness of Interval Alternatives · · EVALUATING EXPECTED EFFECTIVENESS OF INTERVAL ALTERNATIVES · COMPARISON OF DIFFERENT WAVELET BASES IN THE CASE OF WAVELETS EXPANSIONS... · О ПРИМЕНИМОСТИ ОЦЕНКО МАТЕМАТИЧЕСКОГО ОЖИД · VECTORS AND MATRIXES LEAST SQUARES METHOD: FOUNDATION AND APPLICATION ... · VECTORS AND MATRIXES IN GROUPING INFORMATION PROBLEM · ОЦЕНКА ИНТЕРВАЛЬНЫХ АЛЬТЕРНАТИВ:... · ON SOME PROPERTIES OF REGRESSION MODELS BASED ON CORRELATION MAXIMIZATION ... · RECURRENT PROCEDURE IN SOLVING THE GROUPING INFORMATION PROBLEM IN APPLIED... · DIVERGENT AND MULTIPLE-VALUED SEQUENCES AND FUNCTIONS · ‘FEATURE VECTORS’ IN GROUPING INFORMATION PROBLEM IN APPLIED MATHEMATICS: .. · MODELING TELECOMMUNICATIONS TRAFFIC USING THE STOCHASTIC MULTIFRACTAL CASCADE... · INTERVALS AS ULTRAMETRIC APPROXIMATIONS ACCORDING TO THE SUPREMUM NORM · DIFFERENTIAL GEOMETRY DERIVED FROM DIVERGENCE FUNCTIONS... · COMPARATIVE ANALYSIS FOR ESTIMATING OF THE HURST EXPONET FOR STATIONARY AND ... · DISTURBANCE OF STATISTICAL STABILITY (PART II) · FUZZY SETS AS A MEAN FOR UNCERTAINTY HANDLING: MATH, APPLIED MATH, HEURISTICS · FUZZY SETS: MATH, APPLIED MATH, HEURISTICS? PROBLEMS AND INTERPRETATIONS · СИММЕТРИЯ В ЗАПИСИ ГЕНЕТИЧЕСКОЙ ИНФОРМАЦИИ · ЕВКЛИДОВЫ ПРОСТРАНСТВА ЧИСЛОВЫХ ВЕКТОРОВ И · METHOD FOR EVALUATING OF DISCREPANCY BETWEEN REGULARITIES SYSTEMS IN ... · CORRELATION MAXIMIZATION IN REGRESSION MODELS BASED ON CONVEX COMBINATIONS · NEURAL NETWORK SEGMENTATION OF VIDEO VIA TIME SERIES ANALYSIS · GOD-ICS. ON FUNDAMENTAL INFORMATION FIELD QUEST · ОЦЕНИВАНИЕ РИСКА РЕГРЕССИОННОЙ МОДЕЛИ В СЛ� · ПОСТРОЕНИЕ ЛОГИКО-ВЕРОЯТНОСТНЫХ МОДЕЛЕЙ В� · ОПТИМИЗАЦИЯ ОЦЕНКИ ВЕРОЯТНОСТИ ОШИБОЧНОЙ К · ON A PROBLEM OF QOS CHARACTERISTICS INTERPRETATION IN TRANSIT NETWORKS · OPTIMAL FORECASTING BASED ON CONVEXCORRECTING PROCEDURES · COMPARATIVE ANALYSIS OF STATISTICAL PROPERTIES OF THE HURST EXPONENT ... · DISTURBANCE OF STATISTICAL STABILITY · A SURVEY OF NONPARAMETRIC TESTS FOR THE STATISTICAL ANALYSIS OF EVOLUTIONARY ... · COGNITION HORIZON AND THE THEORY OF HYPER-RANDOM PHENOMENA · IMPROVED CRYPTOANALYSIS OF THE SELF-SHRINKING ... · EVALUATION OF PARETO/D/1/K QUEUE BY SIMULATION · N A PROBLEM OF QOS CHARACTERISTICS INTERPRETATION IN TRANSIT NETWORKS · APPLICATION OF THE HETEROGENEOUS SYSTEM PREDICTION METHOD · STUDY OF QUEUEING BEHAVIOUR IN IP BUFFERS · EXTREME SITUATIONS PREDICTION BY MULTIDIMENSIONAL HETEROGENEOUS ... · APPLICATION OF THE MULTIVARIATE PREDICTION METHOD TO TIME SERIES 1 · DETECTION OF LOGICAL-AND-PROBABILISTIC CORRELATION IN TIME SERIES1 ·
H.3.3 Information Search and Retrieval
article: Facts extraction from the semi-structured text information · NEAREST NEIGHBOR SEARCH AND SOME APPLICATIONS · SOCIAL SEARCH ENGINE AND INTELLECTUAL DATABASE OF PEOPLE · INTELLECTUAL SEARCH ENGINE OF ADEQUATE INFORMATION IN INTERNET FOR CREATING ... · ИНФОРМАЦИОННАЯ ТЕХНОЛОГИЯ ПРИМЕНЕНИЯ СЕМАНТИЧЕСКИ ОРИЕНТИРОВАННЫХ МЕТОДОВ ... · SYSTEM OF INTELLIGENT SEARCH, CLASSIFICATION AND DOCUMENT SUMMARISATION FOR INTE · IMPLEMENTATION OF DICTIONARY LOOKUP AUTOMATA FOR UNL ANALYSIS AND GENERATION · THE SYSTEM OF MULTILINGUAL TEXT DATA PROCESSING ON THE BASE OF THE MODIFIED ... · LOGIC-LINGUISTIC MODEL OF FACT GENERATION FROM TEXT STREAMS OF CORPORATE... · ЛОГИКО-ЛИНГВИСТИЧЕСКАЯ МОДЕЛЬ ИЗВЛЕЧЕНИЯ ФАКТОВ ИЗ СЛАБОСТРУКТУРИРОВАННОЙ ... · Solution of the Problem of Formal Evaluation of Effectiveness of ... · МЕТОДЫ АВТОМАТИЗИРОВАННОГО ДИСКУРСИВНОГО � · BUILDING THE LIBRARY CATALOG SEARCH MODEL BASED ON THE FUZZY SIMILARITY ... · DATABASE SERVER USAGE IN THE SOCIAL NETWORKS ANALYSIS · REGIONS OF SUFFICIENCY FOR METRICAL DATA RETRIEVAL · DATA AND METADATA EXCHANGE REPOSITORY USING AGENTS IMPLEMENTATION · DISTANCE MATRIX APPROACH TO CONTENT IMAGE RETRIEVAL · DISTANCE MATRIX APPROACH TO CONTENT IMAGE RETRIEVAL · BRIDGING THE GAP BETWEEN HUMAN LANGUAGE AND COMPUTER-ORIENTED REPRESENTATIONS ·
I.2.4 Knowledge Representation Formalisms and Methods
article: ONTOLOGY OF EDUCATIONAL STANDARDS · MULTI-LAYER KNOWLEDGE REPRESENTATION · Taxonomyzation of Natural Language Texts · Facts extraction from the semi-structured text information · SEMANTIC NET FROM CONCEPTS AS A MODEL OF STUDENT’S KNOWLEDGE: HOW STABLE ARE ... · ТЕМПОРАЛЬНЫЕ СЕТИ ПЕТРИ И ИХ ПРИМЕНЕНИЕ В ИНТЕЛЛЕКТУАЛЬНЫХ СИСТЕМАХ ПОДДЕРЖКИ... · МЕТОДЫ МОДЕЛИРОВАНИЯ ВРЕМЕННЫХ ЗАВИСИМОСТЕЙ В ИНТЕЛЛЕКТУАЛЬНЫХ СИСТЕМАХ С ... · KNOWLEDGE REPRESENTATION IN THE AUTOMATED LEARNING SYSTEMS · ИНВАРИАНТНЫЕ ЗАДАЧИ ОНТОЛОГИЧЕСКИХ СИСТЕМ · МОДЕЛЬ ОНТОЛОГИЧЕСКОГО ИНТЕРФЕЙСА АГРЕГАЦИИ ИНФОРМАЦИОННЫХ РЕСУРСОВ И СРЕДСТВ... · МОДЕЛИРОВАНИЕ ВРЕМЕННЫХ ЗАВИСИМОСТЕЙ В ИНТЕЛЛЕКТУАЛЬНЫХ СИСТЕМАХ ПОДДЕРЖКИ ... · ФОРМАЛИЗАЦИЯ ПРОБЛЕМЫ ИЗВЛЕЧЕНИЯ ЗНАНИЙ ИЗ ЕСТЕСТВЕННО ЯЗЫКОВЫХ ТЕКСТОВ · ЦЕЛОСТНОСТЬ ОБРАЗОВ: О МОДЕЛИРОВАНИИ СМЫСЛА И ПОНИМАНИЯ · К ВОПРОСУ ВИЗУАЛИЗАЦИИ ОНТОГРАФОВ ПРИ РАЗРАБОТКЕ ОНТОЛОГИЙ ПРЕДМЕТНЫХ ДИСЦИПЛИН · СИСТЕМА ПРЕДОСТАВЛЕНИЯ ДИСТАНЦИОННЫХ УСЛУГ В ОБРАЗОВАНИИ ... · ИНСТРУМЕНТЫ ПОДДЕРЖКИ ПРОЦЕССОВ АНАЛИТИЧЕСКОЙ ДЕЯТЕЛЬНОСТИ ЭКСПЕРТА ... · ОБНАРУЖЕНИЕ ЗНАНИЙ НА ОСНОВЕ СЕТЕВЫХ СТРУКТУР · МЕТОДИКА ИСПОЛЬЗОВАНИЯ СРЕДСТВ СТРУКТУРИЗАЦИИ УЧЕБНОГО МАТЕРИАЛА · SELF-MODIFICATED PREDICATE NETWORKS · ON PROBLEM OF ADEQUACY OF MULTISET MATHEMATICAL MODELS · LOGIC-LINGUISTIC MODEL OF FACT GENERATION FROM TEXT STREAMS OF CORPORATE... · ON A METHOD OF MULTI-ALGORITHMIC CLASSIFICATION · DEVELOPMENT, STUDY AND PRESENTATION OF FUNCTIONS AND OPERATIONS ON ONTOLOGIES · PROCESSING SETS OF CLASSES’ LOGICAL REGULARITIES · PECULIARITIES OF LINKED DATA PROCESSING IN SEMANTIC APPLICATIONS · THE INVERSE MASLOV METHOD AND ANT TACTICS FOR EXHAUSTIVE SEARCH DECREASING · THE INTELLIGENT DECISION SUPPORT SYSTEM FOR DIAGNOSTIC OF DIFFICULT DISEASES... · OWL as a Standard Model for Transdisciplinary Knowledje Representation in ... · Solution of the Problem of Formal Evaluation of Effectiveness of ... · TOWARDS A SEMANTIC CATALOG OF SIMILARITY MEASURES · CONSTRUCTION OF CLASS LEVEL DESCRIPTION FOR EFFICIENT RECOGNITION OF ... · ANALYSIS AND PROCESSING OF THE TEXT INFORMATION AIMED AT EXTRACTING BASIC ... · METHODS AND TOOLS OF KNOWLEDGE MANAGEMENT AT THE SEMANTIC WEB ENVIROMENT · CREATING SET OF RELATED CONCEPTS FOR AUTOMATIC ONTOLOGY SYNTHESIS · АРХИТЕКТУРНО-СТРУКТУРНЫЕ ОСОБЕННОСТИ СРЕД� · EFFICIENT SIMULATION FOR PROLOG IMPLEMENTATION OF IMAGE RECOGNITION PROBLEM · SOFTWARE FOR THE RECOGNITION OF POLYHEDRON CONTOUR IMAGES IN THE FRAMEWORK ... · THE INVERSE METHOD FOR SOLVING ARTIFICIAL INTELLIGENCE PROBLEMS IN ... · КОГНИТИВНАЯ СЕМИОТИКА В ПРОЦЕССАХ ... · ON COMBINATION OF DEDUCTION AND ANALYTICAL TRANSFORMATIONS ... · DISTANCE BETWEEN OBJECTS DESCRIBED BY PREDICATE FORMULAS · TACIT KNOWLEDGE AS A RESOURCE FOR ORGANIZATIONS AND ITS INTENSITY IN VARIOUS ... · ФИЗИКО-ОНТОЛОГИЧЕСКИЙ ПОДХОД К ПОСТРОЕНИЮ � · DISCRETE ARTIFICIAL INTELLIGENCE PROBLEMS AND NUMBER OF STEPS OF THEIR SOLUTION · ОТОБРАЖЕНИЕ И ВЫВОД ПО АНАЛОГИИ НА ОСНОВЕ Н · ОПРЕДЕЛЕНИЕ ПОНЯТИЯ «СМЫСЛ» ЧЕРЕЗ ОНТОЛОГИ · ОБРАБОТКА ПРЕДЛОЖЕНИЙ ЕСТЕСТВЕННОГО ЯЗЫКА · К АНАЛИЗУ ЕСТЕСТВЕННО-ЯЗЫКОВЫХ ОБЪЕКТОВ · ИСПОЛЬЗОВАНИЕ ТЕХНОЛОГИИ SEMANTIC WEB ДЛЯ ИНТЕЛЛ · MERGING WIKI AND ONTOLOGICAL APPROACH TO E-LEARNING PORTAL DESIGN · METHODS OF SYNTHESIZING REVERSIBLE SPATIAL MULTIVALUED STRUCTURES OF ... · INTEGRATION OF FINANCIAL DOMAIN KNOWLEDGE ON BASE OF SEMANTIC WEB TECHNOLOGIES · PRESENTATION OF ONTOLOGIES AND OPERATIONS ON ONTOLOGIES IN FINITE-STATE ... · BASIC PRINCIPLES OF ORGANIZATION OF THE MEDIUM AND THINKING ... · METHODS OF SYNTHESIZING REVERSIBLE SPATIAL MULTIVALUED ... · TOWARDS CONTENT-SENSITIVE ACCESS TO THE ARTEFACTS OF THE BULGARIAN ICONOGRAPHY · USE OF KNOWLEDGE TECHNOLOGIES FOR PRESENTATION OF BULGARIAN FOLKLORE ... · DOUBLE-WAVELET NEURON BASED ON ANALYTICAL ACTIVATION FUNCTIONS · MATHEMATICAL MODELS OF DOMAIN ONTOLOGIES1 · FORMING KNOWLEDGE BASES IN THE COMPUTER KNOWLEDGE BANK ON MEDICAL DIAGNOSTICS 1 · ONTOLOGICAL APPROACH TO DOMAIN KNOWLEDGE REPRESENTATION FOR INFORMATION ... · AN ANALYSIS OF SOME RELATIONS AMONG DOMAIN ONTOLOGIES1 · NEW KNOWLEDGE OBTAINING IN STRUCTURAL-PREDICATE MODELS OF KNOWLEDGE · A MATHEMATICAL APPARATUS FOR ONTOLOGY SIMULATION. SPECIALIZED EXTENSIONS OF ... · A MATHEMATICAL APPARATUS FOR DOMAIN ONTOLOGY SIMULATION. AN EXTENDABLE ... · A MATHEMATICAL APPARATUS FOR DOMAIN ONTOLOGY SIMULATION. LOGICAL ... · DOMAINS WITH COMPLICATED STRUCTURES AND THEIR ONTOLOGIES1 · KNOWLEDGE-BASED ROBOT CONTROL · MULTIALGEBRAIC SYSTEMS IN INFORMATION GRANULATION · SELFSTRUCTURIZED SYSTEMS ·
Login
[ register | I forgot my password ]
World Clock
Powered by Tikiwiki Powered by PHP Powered by Smarty Powered by ADOdb Made with CSS Powered by RDF powered by The PHP Layers Menu System
RSS Wiki RSS Blogs rss Articles RSS Image Galleries RSS File Galleries RSS Forums RSS Maps rss Calendars
[ Execution time: 0.08 secs ]   [ Memory usage: 7.69MB ]   [ GZIP Disabled ]   [ Server load: 0.26 ]
Powered by Tikiwiki CMS/Groupware