inner structure and method for pattern recognition



Idea of inner structure to explain diversity

  As forming process, from origin to superficial feature, that is final observed value, by some transformation stages , inner structure of the observed is hidden.
i think that intellectual pattern recognition is to clarify the inner structure.

In early age, in order to explain diversity, that is,  many observed samples are born out of one original image, i thought the idea of multistage structure, which observed final object is made from original one by some transformation stages. showing figure below.



For instance, when observed value is defined as frequency spectrum, like image reflected on a curved distorted mirror, transformation which converts frequency higher or lower must be necessary.

And in actual,  solving multistage structure equation, due to its many degrees of freedom, solution cannot be determined. Any convenient answer value can be required by many degrees of freedom of multistage. So, as it is, multistage structure equation isn't useful for pattern recognition.


(Addition) Characterized  Structure
Let's think about general purpose structure which is able to produce everything. And change general purpose structure to one structure which can produce restricted production  by limited degrees of freedom. Suppose that its name is characterized  structure. Kinds of character will be many.

Using the characterized structure, forced solve how input sample is produced. It will be classified into two, that one is that represents well by the characterized  structure, another is that does not fit to represent it by the characterized  structure.

When solving structure equation, as many degrees of freedom, we cannot get proper answer which is explanatory. Is this matter solved by consisting of characterized  structures ?




Matching and Learning
  In fact, it is not  achievable to perfectly compose the sample of produce by structures. So, matching is judgment of whether, as a part of it,  the sample includes the structure, produce by the structure, the feature.  When the sample includes some kinds of structure, some characterized structures, it will be based on  combination of them.

Learning is to estimate common core, structure , which is included as a common along every sample.




Diversity reduction
  In order to reduce diversity of observed shape, i think of utilizing harmonic elements of speech wave, of which basic cycle is  fundamental frequency of speech signal wave (pitch). As use harmonic elements, diversity caused by transformation which converts frequency higher or lower can be avoided.



Similar to formula of Quantum mechanics, transformation operator F which connects original image x,  that corresponds to  sound source in this content, with observed value y, may make group.  i think its possibility.
Especially,  regarding to human speech,  quantity of kind of the group may be limited number, due to physically allocation and their movement of  human  mouth,  tongue, and  nose. i guess so.



What discriminate-able is
  In this world, there are two kinds.  One is what can be discriminated. Another is what cannot be discriminated because of its chaotic state or undistinguished complex. So, i think that  the necessary condition to be discriminate-able maybe exists. Like Complex Function Theory including useful ways born out of only one simple theorem, i dream of that some useful ways will be developed from the necessary condition to be discriminate-able.  Simultaneous Existence is defined as concept of discriminate-able, that is,  relative ones are co-exist.



Mathematics application of these idea
 The idea, simultaneous existence, inner structure, and so on  should be corresponded to mathematics method for actual solution of pattern recognition problem. Limited value function, for example is n-value (n={2,3, ...}), is one mathematics method of simultaneous existence. Inner structure is represented as structure equation. Also, every production trace, it may be huge, is other mathematics method of inner structure. To estimate and determinate everything of inner structure at once won't go well,  due to its many degrees of freedom.  One better method is that determinate outline and then divide into parts and consider detail, and do that procedure repeatedly until reasonable state, like cell division process in biology.



If it is applied to speech recognition,
from simple discrimination which any sound is or not,  to more mixed discrimination, until the discrimination level of human ability, that may is similar to human evolution process.


Find out key of phoneme in diversity per time dimension
  Speech signal changes per time. It's hard to determine perfectly time division of the phoneme on speech signal wave.
So, suppose that there are two kinds of portion, one is key portion which means phoneme, and another is transitional portion which is secondary effect and is less important.

And then, i think that,

A: Course from one key to next key isn't  single way, but many courses is possible,
because transitional portion can change roughly. So, database made from them will not converge.

B: Analyze inner structure of key portion and its around, whether it hide hint to recognize.



Comparison of speech signal change per time dimension with mountain climbing.
From way in to top of mountain, from top of mountain to the top of next mountain,
from the top of next mountain to way out, there are many possible courses which you
can choose. This is compared to diversity per time dimension.  
However, top of mountain and its around which is compared to key of phoneme is only
one for each mountain.
And also, mountain geographical features like soft slope or cliff, that is structure, may
give hint to recognize.





No.13(E),  13 April  2008




 A proverb: "Science is like that, even though you open one door ahead, one more door will appear."