


卷 26, 编号 4 (2016)
- 年: 2016
- 文章: 21
- URL: https://bakhtiniada.ru/1054-6618/issue/view/12238
Mathematical Method in Pattern Recognition
Tight risk bounds for multi-class margin classifiers
摘要
We consider a problem of risk estimation for large-margin multi-class classifiers. We propose a novel risk bound for the multi-class classification problem. The bound involves the marginal distribution of the classifier and the Rademacher complexity of the hypothesis class. We prove that our bound is tight in the number of classes. Finally, we compare our bound with the related ones and provide a simplified version of the bound for the multi-class classification with kernel based hypotheses.



Semi-supervised classification using multiple clusterings
摘要
Graph determines the performance of graph-based semi-supervised classification. In this paper, we investigate how to construct a graph from multiple clusterings and propose a method called Semi-Supervised Classification using Multiple Clusterings (SSCMC in short). SSCMC firstly projects original samples into different random subspaces and performs clustering on the projected samples. Then, it constructs a graph by setting an edge between two samples if these two samples are clustered in the same cluster for each clustering. Next, it combines these graphs into a composite graph and incorporates the resulting composite graph with a graph-based semi-supervised classifier based on local and global consistency. Our experimental results on two publicly available facial images show that SSCMC not only achieves higher accuracy than other related methods, but also is robust to input parameters.



Representation, Processing, Analysis, and Understanding of Images
Difference intensity distance group pattern for recognizing actions in video using Support Vector Machines
摘要
Recognition of human actions is a very important, task in many applications such as Human Computer Interaction, Content based video retrieval and indexing, Intelligent video surveillance, Gesture Recognition, Robot learning and control, etc. An efficient action recognition system using Difference Intensity Distance Group Pattern (DIDGP) method and recognition using Support Vector Machines (SVM) classifier is presented. Initially, Region of Interest (ROI) is extracted from the difference frame, where it represents the motion information. The extracted ROI is divided into two blocks B1 and B2. The proposed DIDGP feature is applied on the maximum intensity block of the ROI to discriminate the each action from video sequences. The feature vectors obtained from the DIDGP are recognized using SVM with polynomial and RBF kernel. The proposed work has been evaluated on KTH action dataset which consists of actions like walking, running, jogging, hand waving, clapping and boxing. The proposed method has been experimentally tested on KTH dataset and an overall accuracy of 94.67% for RBF kernel.



Comparison of image recognition efficiency of Bayes, correlation, and modified Hopfield network algorithms
摘要
The statistical estimates of the probability of correct recognition of the images, noisy reference by an additive handicap, for Bayes, correlation, and modified Hopfield network algorithms are compared. It is shown that, in the case of complete a priori probability concerning a handicap, the modified Hopfield network algorithm reaches the quality of the Bayes algorithm. At a deviation a priori probability on a handicap, the quality of the Bayes algorithm is worse than that of the modified Hopfield network algorithm. The correlation algorithm is worse than the modified Hopfield network algorithm, in general.



Quaternion angular radial transform and properties transformation for color-based object recognition
摘要
Nowadays, with the increased use of digital images, almost all of which are in color format. Conventional methods process color images by converting them into gray scale, which is definitely not effective in representing and which may lose some significant color information. Recently, a novel method of the Color Angular Radial Transform (CART) is presented. This transform combines the information by considering the shape information inherent in the color. However, ART is adapted on the MPEG-7 standard is only limited to binary images and gray-scale images has many properties: invariant to rotation, Translation and scaling, ability to describe complex objects, so it cannot handle color images directly. To solve this problem we proposed in this article to generalize ART from complex domain to hypercomplex domain using quaternion algebras to achieve the Quaternion Angular Radial Transform (QART) to describe finally two dimensional color images and to insure these properties robustness to all possible rotations and translation and scaling. The performance of QART is then evaluated with large database of color image as an example. We first provide a general formula of ART from which we derive a set of quaternion-valued QART properties transformations by eliminating the influence of transformation parameters. The experimental results show that the QART performs better than the commonly used Quaternion form Zernike Moment (QZM) in terms of image representation capability and numerical stability.



On the probability of the formation of local groups in random point images
摘要
Original programming, combinatorial, and geometric schemes are presented. They have been developed and used by the authors to calculate exact analytical formulas that describe the probability of the formation of local groups of a given size in random point images. Formulas, which will be discussed below, arise in the assessment of the reliability detection of point images, when they are detected by scanning the aperture that has a limited number of thresholds. In this paper, significant attention is paid to the formulation and solution of difficult combinatorial problems that have been encountered in the course of the investigation and that are associated with new generalization of the Catalan numbers.



Detecting computer generated images based on local ternary count
摘要
Local binary patterns was used to distinguish the Photorealistic Computer Graphics and Photographic Images, however the dimension of the extracted features is too high. Accordingly, the Local Ternary Count based on the Local Ternary Patterns and the Local Binary Count was developed in this study. Furthermore, a novel algorithm is presented based on the Local Ternary Count to classify photorealistic Computer Graphics and Photographic images. The experiment results show that the proposed algorithm effectively reduces the dimension of the classification features and maintains a good classification performance.



Effective energy-based multi-view piecewise planar stereo
摘要
For piecewise planar scene modeling, many challenging issues still persist, in particular, how to generate sufficient candidate planes and how to assign an optimal plane for each spatial patch. To address these issues, we present a novel multi-view piecewise planar stereo method for the complete reconstruction. In our method, reconstruction is formulated as an energy-based plane labeling problem, where photo-consistency and geometric constraints are incorporated to a unified superpixel-level MRF (Markov Random Field) framework. To enhance the efficacy of the plane inference and optimization, an effective multi-direction plane sweeping with much restricted search space is carried out to generate sufficient and reliable candidate planes. Experiments show that our method can effectively handle many challenging factors (e.g., slant surfaces, textureless regions) and achieve satisfactory results.



Software and Hardware for Pattern Recognition and Image Analysis
Object-oriented topological management system of spatially-distributed databases
摘要
This paper discusses the problems associated with the development of the specialized database management system (DBMS) GIS Terra Plus for storage, processing, and analysis of spatially distributed data (SDD). Description models and the presentation structure of the SDD, the scheme of databases of the GIS Terra Plus DBMS, the query system, and operating conditions are considered.



Applied Problems
Classification of long-bone fractures based on digital-geometric analysis of X-ray images
摘要
The classification of fractured of a patient plays an important role in orthopaedic evaluation and diagnosis. It not only aids in assessing the severity of the disease or injury but also serves as a basis of treatment or surgical correction. This paper proposes a novel approach to automated classification of long-bone fractures based on the analysis of an input X-ray image. The method consists of four major steps: (i) extraction of the bone-contour from a given X-ray image, (ii) identification of fracture-points or cracks, (iii) determination of an equivalent set of geometric features in tune with the Müller-AO clinical classification of fractures, and (iv) identification and detailed assessment of the fracture-type. The decision procedure makes use of certain geometric properties of digital curves such as relaxed digital straight line segments (RDSS), arcs, discrete curvature, and concavity index. The proposed method for the analysis of fractures is applied on different types of bone-images and is observed to have produced correct classification in most of the test-cases.



Contour analysis of a fine structure in an electroencephalogram
摘要
An electroencephalogram (EEG) is examined as an image limited by a polygonal contour. A sequence of elementary vectors presented in analytical form, which approximates the contour line, is accepted as an electroencephalogram model. A concept on a fine structure in the electroencephalogram as a set of elementary vectors that defines the segmented parts obtained as a result of electroencephalogram decomposition is introduced. The expressions for a measure of similarity of two segments of the fine structure in a form of magnitude of their normalized scalar product are obtained. An example on detecting and analyzing the “Sleep Spindle” pattern, which is observed in the frequency range of 9–13 Hz, by using the contour model, is presented.



Estimation of geomagnetic field disturbance using the wavelet transform
摘要
This paper discusses the main aspects of geomagnetic data processing using the wavelet transform. The wavelet transform is shown to be efficient for automatic extraction of unperturbed level of the horizontal component of the Earth’s magnetic field. As a result, it becomes possible to significantly reduce the errors arising during automatic calculations of the local geomagnetic activity index (local K-index) in comparison with adaptive smoothing (KAsm is Adaptative Smoothing method) recommended by INTERMAGNET. It has been found that prior to magnetic storms, we can observe a weak rise of geomagnetic activity in different frequency bands connected with the development of an approaching storm.



Recognition of biomedical signals based on their spectral description data analysis
摘要
Methods for generating spectral features in biosignal recognition in frequency domain are described. The method of linear decision rules constructing using the Fisher’s criterion is discussed. The efficiency of the method is investigated on the example of complex arrhythmia recognition according to the spectral description of electrocardiosignals.



Magnetometry data processing to detect archaeological sites
摘要
The article presents the algorithm that allows one to carry out automatic detection of local anomalies of magnetometric data based on image analysis principles. The algorithm was tested on the available data of the magnetometric survey in areas of planned archaeological excavations.



Application of texture features for comparing the facture of paintings
摘要
The paper continues investigations on the development of a computer-aided method for the analysis of images of the facture of pictorial artworks. The feature description of the facture of paintings is extended. An earlier developed method of comparing a facture by features extracted from informative fragments of digitized images of portraits is applied. The features describe the direction of a brushstroke, which characterizes the artistic manner specific for specific details of a painting. A numerical experiment has shown that the value of the quantitative similarity index of paintings of the same artist (F.S. Rokotov) is higher than the value of the quantitative similarity index of the works of different artists. A portrait attributed to Rokotov is compared with his authentic works, as well as with portraits painted by different artists of the XVII−XIX centuries. The results of the computer analysis of the facture of paintings do not contradict the results of traditional technico-technological investigations.



Metric classification of early Parkinsonism in the space of electroencephalographic features
摘要
This paper considers the problem of metric classification of early Parkinsonism in the feature space of multi-channel signals of electroencephalography (EEG). The electroencephalography feature space includes both spectral characteristics and features of rhythmic disorganization. A model of logistic regression for the classification of early Parkinsonism is studied. The model was trained on the data obtained from the experimental EEG studies in a group of patients in the 1st stage of Parkinson’s disease and a control group of subjects. Analysis of the classification logistic model was carried out using the data from 38 subjects, including 22 subjects from the control group and 16 patients in the first stage of Parkinson’s disease. Dependencies of the recall on the functional value for the control group and the patients and classification accuracies are calculated.



Implementation of the zero-padding interpolation technique to improve angular resolution of X-ray tomographic acquisition system
摘要
This paper suggests the implementation of an algorithm that improves the X-ray tomographic reconstruction quality of a parallel acquisition geometry by adding virtual projections to the reconstruction process in order to increase the angular resolution of the measuring device, as in the case of CT scanners. The suggested method is based on an estimated calculation of virtual projections using the zero-padding properties. The obtained results show a significant reduction of the noise effect on the reconstructed image and a remarkable quality improvement despite the use of a rudimentary tomographic device.



Iterative conic beam tomography based on Bayesian approach to radiation therapy
摘要
The key problem in increasing efficiency of radiotherapy of malignant tumors in brain and other dangerous neoplasms is the problem of increasing the quality of 3D positioning of a patient before radiotherapy. We consider the principles of development of fast parallel iterative algorithms based on graphic accelerators and the OpenGL library. The proposed approaches provide simultaneous residual minimization for the sought solution and total variation of the reconstructed 3D image. In this case, the number of required initial data, i.e., conic X-ray projections, can be reduced several times, and therefore, the radiation load on the patient can also be accordingly reduced with preservation of the necessary contrast and spatial resolution of the 3D image of the patient. The new heuristic iterative algorithm can be used as an alternative to the known 3D Feldkamp algorithm.



Article
From the National Committee of the Presidium of the Russian Academy of Sciences for Pattern Recognition and Image Analysis



From the National Committee of the Presidium of the Russian Academy of Sciences for Pattern Recognition and Image Analysis



Erratum
Erratum to: “An Algorithm for Recognizing Linear Objects in Aerial Photons Automatically” [Pattern Recognition and Image Analysis 26, 385 (2016)]
摘要
There are 2 errors in the paper of A.E.Levashov “An algorithm for recognizing linear objects in aerial photos of the author automatically” (PRIA, volume 26, Issue 2, page 385). The corrected version are as follows:
(1) The affiliation of the author is “The Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences (Moscow, Russian Federation).
(2) Тhe Acknowledgements
The paper was partly supported by the Russian Foundation for Basic Research (project 1401-00881).


