模式识别科学发展与现状(4.展望)

来源:互联网 发布:提分宝典的真实性 知乎 编辑:程序博客网 时间:2024/04/29 15:22

4 Perspectives

4 看法

 

Pattern recognition deals with discovering, distinguishing, detecting or characterizing patterns present in the surrounding world. It relies on extraction and representation of information from the observed data, such that after integration with background knowledge, it ultimately leads to a formulation of new knowledge and concepts. The result of learning is that the knowledge already captured in some formal terms is used to describe the present interdependencies such that the relations between patterns are better understood (interpreted) or used for generalization. The latter means that a concept,e.g. of a class of objects, is formalized such that it can be applied to unseen examples of the same domain, inducing new information, e.g. the class label of a new object. In this process new examples should obey the same deduction process as applied to the original examples.

模式识别技术应用于发现、区分、检测或提取存在于我们周围世界中的模式,这依赖于怎么从观察数据中进行信息提取和表示,结合背景知识,最终得到新知识和概念的形式化内容。学习的结果是得到一个用于表示模式之间相互依赖的形式化知识,以此更好地理解(或解释)观察数据或进行推广。推广的意思是指一个概念(如对象的一个种类)被形式化后,这个概念可以被应用于相同领域未知的用例,包括新的信息,例如对一个新对象进行标识,且对于新用例的处理应当遵从应用于原来用例的相同的演绎过程。

 

In the next subsections we will first recapitulate the elements of logical reasoning that contribute to learning. Next, this will be related to the Platonic and Aristotelian scientific approaches discussed in Section 2. Finally, two novel pattern recognition paradigms are placed in this view.

在下面几节中我们将先概括应用于学习的逻辑推理的方法要素,接着在第二节中讨论有关柏拉图及亚里士多德科学研究方法,最后根据所讨论内容举两个模式识别的例子。

 

4.1 Learning by Logical Reasoning

4.1 逻辑推理性学习

 

Learning from examples is an active process of concept formation that relies on abstraction (focus on important characteristics or reduction of detail)and analogy (comparison between different entities or relations focussing on some aspect of their similarity). Learning often requires dynamical, multilevel(seeing the details leading to unified concepts, which further build higher level concepts) and possibly multi-strategy actions (e.g. in order to support good predictive power as well as interpretability). A learning task is basically defined by input data (design set), background knowledge or problem context and a learning goal [52]. Many inferential strategies need to be synergetically integrated to be successful in reaching this goal. The most important ones are inductive, deductive and abductive principles, which are briefly presented next. More formal definitions can be sought in the literature on formal logic,philosophy or e.g. in [23, 40, 52, 83].

学习用例是一种对概念信息的处理方法,也是一个很有用的处理方法,它依赖于对信息的抽象(专注于提取重要特征或减少细节描述)和分析(在不同实体之间进行比较或关注相似属性之间的关系)。学习经常需要是动态的、多层次的(如怎么得到一致性的概念以更进一步地建立在更高层次上的概念)和多重目的的(如使之具有较好的预测能力,即可判断能力)。学习任务的确立是基于输入数据(设计样本集),背景知识或问题的上下文,以及学习目标。最重要的是归纳、演译和诱导原理,下面会有简要介绍,更多的形式定义可以查看形式逻辑、哲学文献,在[23,40,52,83]中也有描述。

 

Inductive reasoning is the synthetic inference process of arriving at a conclusion or a general rule from a limited set of observations. This relies on a formation of a concept or a model, given the data. Although such a derived inductive conclusion cannot be proved, its reliability is supported by empirical observations. As along as the related deductions are not in contradiction with experiments, the inductive conclusion remains valid. If, however, future observations lead to contradiction, either an adaption or a new inference is necessary to find a better rule. To make it more formal, induction learns a general rule R (concerning A and B) from numerous examples of A and B. In practice, induction is often realized in a quantitative way. Its strength relies then on probability theory and the law of large numbers, in which given a large number of cases, one can describe their properties in the limit and the corresponding rate of convergence.

归纳性推理是一个综合的推导过程,从有限的观察数据中得到结论或一般性规则。归纳性推理依赖于给定数据情况下概念或模型的信息表示方法。虽然归纳出来的结论没办法被证明,但是它的可靠性是依靠实践经验观察出来的。只要相关的演译推理不会和实验得到的结论相抵触,归纳出来的结论就依然有效。然而,如果将来观察发现出现了冲突,这时就需要去适配或推导来寻找一个更好的规则。更为形式化地表示,即从许多用例A和B进行归纳学习得到一个一般性规则R。在实践中,归纳经常是通过量化的方法来实现。归纲性推理强有力的理论支持是概率理论和大数据量原则,在给定一个大数据量的用例中,才能够描述出它们的一定范围的性质及相应的覆盖率。

 

Deductive reasoning is the analytic inference process in which existing knowledge of known facts or agreed-upon rules is used to derive a conclusion. Such a conclusion does not yield ‘new’ knowledge since it is a logical consequence of what has already been known, but implicitly (it is not of a greater generality than the premises). Deduction, therefore, uses a logical argument to make explicit what has been hidden. It is also a valid form of proof provided that one starts from true premises. It has a predictive power, which makes it complementary to induction. In a pattern recognition system, both evaluation and prediction rely on deductive reasoning. To make it more formal, let us assume that A is a set of observations, B is a conclusion and R is a general rule. Let B be a logical consequence of A and R, i.e. (A ∧ R) |= B, where |= denotes entailment. In a deductive reasoning, given A and using the rule R,the consequence B is derived.

演译推理是分析性推理过程,通过已知事实的现存知识或一致被认可的规则推导出一个结论。既然这样的结论是从已知的知识中进行逻辑推导的结果,所以不能算是“新”知识,但具有隐含性(它比前提条件不更具一般性)。演译,即是运用一套逻辑方法把隐藏在背后的知识清晰起来。它也是一个从真实的前提进行实证的有效形式。它具有预言性功能,能弥补归纳方法的不足。更形式化地表示可以这样:假设A是一组观察数据,B是一个结论,R是一个一般性规则,则B是A和R的逻辑推导结果,如(A ∧ R) |= B |=表示蕴涵关系,演译推理过程中,给定A,运用规则R,结论B就能就此被推论出来。

 

Abductive reasoning is the constructive process of deriving the most likely or best explanations of known facts. This is a creative process, in which possible and feasible hypotheses are generated for a further evaluation. Since both abduction and induction deal with incomplete information, induction may be viewed in some aspects as abduction and vice versa, which leads to some confusion between these two [23, 52]. Here, we assume they are different. Concerning the entailment (A ∧ R) |= B, having observed the consequence B in the context of the rule R, A is derived to explain B.

溯因推理是构建推理过程,从最象或最具有解释性的已知事实中推理出结论。这是一个创造性过程,可能或可行性假设是因进一步需要推定而产生的。既然溯因推理和归纳推理是在不完整的信息中进行推理,从某些方面可能可以把归纳推理看成溯因推理,反之亦然,这样在二者之间会导致些混淆。这里,我们假定他们是不一样的,看这样的蕴涵关系:(A ∧ R) |= B,表示从规则R的上下文中观察出B结论,A被用来解释B

 

In all learning paradigms there is an interplay between inductive, abductive and deductive principles. Both deduction and abduction make possible to conceptually understand a phenomenon, while induction verifies it. More precisely, abduction generates or reformulates new (feasible) ideas or hypotheses,induction justifies the validity of these hypothesis with observed data and deduction evaluates and tests them. Concerning pattern recognition systems,abduction explores data, transforms the representation and suggests feasible classifiers for the given problem. It also generates new classifiers or reformulates the old ones. Abduction is present in an initial exploratory step or in the Adaptation stage; see Fig. 1. Induction trains the classifier in the Generalization stage, while deduction predicts the final outcome (such as label) for the test data by applying the trained classifier in the Evaluation stage.

在所有的学习方法中,归纳、演译和溯因推理法则之间相互影响着。演译和溯因推理都有可能进行概念性地理解事物,归纳推理则用来检验它。更确切地说,溯因方法产生或变革新的(或可行的)思想或假设,归纳方法根据观察数据判断这些假设的合理性,演译方法评估和验证假设。对于模式识别系统,溯因方法探究源数据,转换表示方法,对既定的问题提出可行的分类方法,也可能产生新的分类方法或变革旧方法。溯因方法一般用于初始时的探究阶段或适配阶段(见图1),在推广阶段归纳方法被用来训练分类器,在评估阶段演译方法用来预测经过训练后的分类器在测试数据下的最后输出(如标识名)。

 

Since abduction is hardly emphasized in learning, we will give some more insights. In abduction, a peculiarity or an artifact is observed and a hypothesis is then created to explain it. Such a hypothesis is suggested based on existing knowledge or may extend it, e.g. by using analogy. So, the abductive process is creative and works towards new discovery. In data analysis,visualization facilitates the abductive process. In response to visual observations of irregularities or bizarre patterns, a researcher is inspired to look for clues that can be used to explain such an unexpected behavior. Mistakes and errors can therefore serve the purpose of discovery when strange results are inquired with a critical mind. Note, however, that this process is very hard to implement into automatic recognition systems as it would require to encode not only the detailed domain knowledge, but also techniques that are able to detect ‘surprises’ as well as strategies for their possible use. In fact,this requires a conscious interaction. Ultimately, only a human analyst can interactively respond in such cases, so abduction can be incorporated into semi-automatic systems well. In traditional pattern recognition systems, abduction is usually defined in the terms of data and works over pre-specified set of transformations, models or classifiers.

因为溯因推理在学习过程中较难被重视,我们就多做这方面的论述。在溯因推理中,一旦一个特性或一个典型结果被发现到,于是就会创建一个假设来解释它。这种假设是基于已存在的知识或知识的延伸来提出来的,例如运用类推方法。所以,溯因推理是创造性的过程,以新发现为目的。在数据分析中,可视化工具较便利于进行溯因推理。利用了对不规则或奇异的模式进行可视化观察,研究者得到灵感,找到用来解释意外现象的线索。当新的结果被批判性地检查时就会发现有错误或误差。然而,要注意的是溯因方法很难实现自动识别系统,因为这不仅需要消化运用相关细节知识,而且还要有在各种运用方法中检测产生异常的技术。实际上,这需要一个意识交互作用。总之,只有人类分析才能够交互地对这种事做出反应,所以溯因方法可以被很好地应用到半自动化系统中。在传统的模式识别系统中,溯因方法通常是用来定义有关数据并是在转换、建模和设计分类器之前进行。

 

4.2 Logical Reasoning Related to Scientific Approaches

4.2 与科学研究方法有关的逻辑推理

 

If pattern recognition (learning from examples) is merely understood as a process of concept formation from a set of observations, the inductive principle is the most appealing for this task. Indeed, it is the most widely emphasized in the literature, in which ‘learning’ is implicitly understood as ‘inductive learning’. Such a reasoning leads to inferring new knowledge (rule or model)which is hopefully valid not only for the known examples, but also for novel,unseen objects. Various validation measures or adaptation steps are taken to support the applicability of the determined model. Additionally, care has to be taken that the unseen objects obey the same assumptions as the original objects used in training. If this does not hold, such an empirical generalization becomes invalid. One should therefore exercise in critical thinking while designing a complete learning system. It means that one has to be conscious which assumptions are made and be able to quantify their sensibility, usability and validity with the learning goal.

如果模式识别(从用例中学习)只是被理解成一个从一组观察中得到概念形成的过程,那么归纳方法是这个任务中最需要的,这在文献中最被广为强调:“学习”意味着“归纳学习”。这样的推理导致推论出新的知识(法则或模型),这个新知识不仅对已知的用例非常有效,而且对新出现的、未知的对象也有效。各种验证方法或适配步骤被用于支持决策模型的适用性。另外,要注意的是未知对象遵从同样的假设,即原来的对象被用在了训练中,如果不是这样,那么这样得到经验上的推广就变得无效。要设计一个完整的学习系统,就应当批判地去实践检验它,意思是必须意识到要做出什么样的假设和什么假设能够量化敏感性、可用性和有效性以达到学习目标。

 

On the other hand, deductive reasoning plays a significant role in the Platonic approach. This top-down scenario starts from a set of rules derived from expert knowledge on problem domain or from a degree of belief in a hypothesis. The existing prior knowledge is first formulated in appropriate terms.These are further used to generate inductive inferences regarding the validity of the hypotheses in the presence of observed examples. So, deductive formalism(description of the object’s structure) or deductive predictions (based on the Bayes rule) precede inductive principles. A simple example in the Bayesian inference is the well-known Expectation-Maximization (EM) algorithm used in problems with incomplete data [13]. The EM algorithm iterates between the E-step and M-step until convergence. In the E-step, given a current (or initial)estimate of the unknown variable, a conditional expectation is found, which is maximized in the M-step and derives a new estimate. The E-step is based on deduction, while the M-step relies on induction. In the case of Bayesian nets, which model a set of concepts (provided by an expert) through a network of conditional dependencies, predictions (deductions) are made from the (initial) hypotheses (beliefs over conditional dependencies) using the Bayes theorem. Then, inductive inferences regarding the hypotheses are drawn from the data. Note also that if the existing prior knowledge is captured in some rules, learning may become a simplification of these rules such that their logical combinations describe the problem.

在另一方面,演译推理在柏拉图式科学研究中起着重要的角色。这是自顶向下的过程,起始于一组规则,这些规则从某个领域的专家知识中得到,或从假设的可信度中得到。首先先验知识被以某种表示方法形式化,形式化后的知识就可以被用来在现有的观察数据中运用归纳推导法检验假设的有效性。所以演译形式(描述对象结构)或演译预测(基于贝叶斯法则)是在归纳法则之前的过程。在贝叶斯推导中一个简单的例子是大家都知道的最大期望算法(Expectation-Maximization (EM)),这种算法用在数据不完整的问题中。EM算法在E步骤和M步骤之间循环进行直到能够被收敛。在E步骤中,给定一个未知变量的当前(或初始)估值,找到一个条件期望值,期望值在M步骤中被最大化并得到一个新估值。E步骤是基于演译方法,M步骤是运用归纳方法。在贝叶斯网络中,通过条件依赖的贝叶斯网络为一个概念(由专家提供)集进行建模,运用贝叶斯理论得到的(初始)假设(建立在条件依赖上的把握)进行预测(归纳)。然后,归纳推导从数据上进行检验。也要注意的是如果已存在的先验知识是在一些法则中得到,学习可能是对这些法则的简化过程,这样形成的逻辑组合被用来描述所要解决的问题。

 

In the Aristotelian approach to pattern recognition, observation of particulars and their explanation are essential for deriving a concept. As we already know, abduction plays a role here, especially for data exploration and characterization to explain or suggest a modification of the representation or an adaptation of the given classifier. Aristotelian learning often relies on the Occam’s razor principle which advocates to choose the simplest model or hypothesis among otherwise equivalent ones and can be implemented in a number of ways [8].

在模式识别亚里士多德式研究方法中,对细节的观察以及相应的解释是得到概念的本质方法。我们已经知道,溯因方法在这里起到了作用,特别是在数据探索和特征描述中,这两个被用来解释或提出对表示方法的修改或对已有识别器的适配。亚里士多德方法经常用到Occam剃刀法则:在多种等价物和多种实现方法中提倡选择最简单的模型或假设。

 

In summary, the Platonic scenario is dominantly inductive-deductive,while the Aristotelian scenario is dominantly inductive-abductive. Both frameworks have different merits and shortcomings. The strength of the Platonic approach lies in the proper formulation and use of subjective beliefs, expert knowledge and possibility to encode internal structural organization of objects. It is model-driven. In this way, however, the inductive generalization becomes limited, as there may be little freedom in the description to explore and discovery of new knowledge. The strength of the Aristotelian approach lies in a numerical induction and a well-developed mathematical theory of vector spaces in which the actual learning takes place. It is data-driven. The weakness,however, lies in the difficulty to incorporate the expert or background knowledge about the problem. Moreover, in many practical applications, it is known that the implicit assumptions of representative training sets, identical and identically distributed (iid) samples as well as stationary distributions do not hold.

总结一下,柏拉图研究用的主要方法是归纳和演译方法,亚里士多德主要是用归纳和溯因方法。两个方法体系有不同的优点和缺点。柏拉图方法的优势在于概念形式化和主观信心、专家知识的使用以及对对象内在结构性组织的构建,它是属于建模驱动方法。然而,这个方法中束缚了归纳性推广,在探索和发现新知识方面受到了约束。亚里士多德方法的优势在于数字化归纳和丰富的向量空间数学理论,学习过程实际上是在向量空间中进行,这是属于数据驱动方法。然而,缺点是难以把专家和背景知识应用到解决问题中。此外,在许多实际应用中,从典型训练集、具同一性和同分布(即固定分布)的样本中得到的隐假设没有加入专家和背景知识。

 

4.3 Two New Pattern Recognition Paradigms

4.3 两个新的模式识别模式

 

Two far-reaching novel paradigms have been proposed that deal with the drawbacks of the Platonic and Aristotelian approaches. In the Aristotelian scenario, Vapnik has introduced transductive learning [73], while in the Platonic scenario, Goldfarb has advocated a new structural learning paradigm[31, 32]. We think these are two major perspectives of the science of pattern recognition.

有两个新颖的且超前的模式识别思想被提了出来,它们分别用以解决柏拉图和亚里士多德方法的缺陷。对于亚里士多德研究方法,Vapnik提出转化推理学习方法,对于柏拉图研究方法,Goldfarb提出一种新的结构学习模式。我们认为这是模式识别科学领域中两个主要的观点。

 

Vapnik [73] formulated the main learning principle as: ‘If you possess a restricted amount of information for solving some problem, try to solve the problem directly and never solve a more general problem as an intermediate step.’ In the traditional Aristotelian scenario, the learning task is often transformed to the problem of function estimation, in which a decision function is determined globally for the entire domain (e.g. for all possible examples in a feature vector space). This is, however, a solution to a more general problem than necessary to arrive at a conclusion (output) for specific input data. Consequently, the application of this common-sense principle requires a reformulation of the learning problem such that novel (unlabeled) examples are considered in the context of the given training set. This leads to the transductive principle which aims at estimating the output for a given input only when required and may differ from an instance to instance. The training sample, considered either globally, or in the local neighborhoods of test examples, is actively used to determine the output. As a result, this leads to confidence measures of single predictions instead of globally estimated classifiers. It provides ways to overcome the difficulty of iid samples and stationary distributions. More formally, in a transductive reasoning, given an entailment A |= (B ∪ C), if the consequence B is observed as the result of A, then the consequence C becomes more likely.

Vapnik提出主要的学习法则是:如果用于解决问题的信息有限,则应当试着寻找直接解决问题的方法,不要去解决更为通用的问题,如中间问题。在传统的亚里士多德式研究过程中,学习的任务经常转为函数估计问题,其中的决策函数用于全局地决定整个问题域(如为了解决特征向量空间中所有的可能用例),然而,这是一个为了解决更为通用问题的方法,不是为特定输入数据而达到的解决方法(输出)。结果,运用这种普通法则的应用需要对学习问题重新进行形式化,这样新的(未标识)用例要被考虑进已有训练集的上下文中。这导致了转化推理的产生,这种方法是只有在需要的时候才从输入数据来估计输出数据,可能会在实例与实例之间进行比较,对于训练样本,可以是较为全面的测试用例,也可以是局部的相邻部分,决定了决策结果。所以,这里用的是对每个决策的信心度量,而不是对分类器进行全局性估量。这样可以克服样本要具有同一同分布和固定分布的困难。转化推理更为形式化的表示可以是这样的关系:A |= (B ∪ C),如果B被观察出来是A,则C因和B相似也被认为是A。

 

The truly transductive principle requires an active synergy of inductive,deductive and abductive principles in a conscious decision process. We believe it is practised by people who analyze complex situations, deduce and validate possible solutions and make decisions in novel ways. Examples are medical doctors, financial advisers, strategy planners or leaders of large organizations. In the context of automatic learning, transduction has applications to learning from partially labeled sets and otherwise missing information, information retrieval, active learning and all types of diagnostics. Some proposals can be found e.g. in [34, 46, 47, 73]. Although many researchers recognize the importance of this principle, many remain also reluctant. This may be caused by unfamiliarity with this idea, few existing procedures, or by the accompanying computational costs as a complete decision process has to be constantly inferred anew.

在意识决策过程中真正的转化推理法则需要是对归纳、演译和溯因法则进行互补和综合,我们相信人类分析复杂事物、推理和验证可能性结论及用新奇方法进行决策是这样进行的,例如象那些医生、金融顾问、战略规划者和大型组织的领导者。在自动化学习过程中,转化推理拥有从局部标识的数据集中进行学习的程序,还有分别从丢失信息、不完整信息、已学知识和各种诊断学中进行学习的程序。这里可以发现一些学习方案,例如可以见文献[34, 46, 47, 73]。虽然有许多研究者认识到了这个转化推理法则的重要性,也有很多人对此表示怀疑,这也许是因为对这个思想不熟悉、相应的程序很少,或者是因为这样一个完整的决策每次都要重新被推断需要较多的计算开销。

 

In the Platonic scenario, Goldfarb and his colleagues have developed structural inductive learning, realized by the so-called evolving transformation systems (ETS) [31, 32]. Goldfarb first noticed the intrinsic and impossible to overcome inadequacy of vector spaces to truly learn from examples [30]. The reason is that such quantitative representations loose all information on object structure; there is no way an object can be generated given its numeric encoding. The second crucial observation was that all objects in the universe have a formative history. This led Goldfarb to the conclusion that an object representation should capture the object’s formative evolution, i.e. the way the object is created through a sequence of suitable transformations in time. The creation process is only possible through structural operations. So, ‘the resulting representation embodies temporal structural information in the form of a formative, or generative, history’ [31]. Consequently, objects are treated as evolving structural processes and a class is defined by structural processes, which are ‘similar’. This is an inductive structural/symbolic class representation,the central concept in ETS. This representation is learnable from a (small) set of examples and has the capability to generate objects from the class.

在柏拉图科学研究方法中,Goldfarb和他的同事研发出基于结构归纳学习方法,运用到所谓的演变转化系统(ETS)中[31,32]。Goldfarb首先注意到一个固有的不可克服的困难,即因向量空间信息不充备而无法真正从用例中学会识别方法,这个原因是定量的表示方法无法确切地表示对象结构的所有信息,没有方法可以量化地表示一个对象。其次还有一个重要的观察是,发现宇宙中所有事物都有一个形式不断演变的历程。于是Goldfarb得到这样的结论,一个对象的表示方法应当抓住对象的形式演变过程,例如,这个方法要能够表示通过一系列适当的不断变化的事物创建过程,这个创建过程只有通过结构化操作才有可能实现。所以,“由此而产生的表示方法嵌入了带时间的结构信息,反映了一个演变或创建历程”[31]。因此,用演变式地结构处理来看待对象,用结构式处理来定义某个类别,这两个方面是“相似”的。这是一个采用归纳方法、结构化/符号化的类别表示方法,是ETS中的中心概念。这个表示方法可以在一个(小的)用例集中进行学习,拥有从某个类别中生成对象的能力。

 

The generative history of a class starts from a single progenitor and is encoded as a multi-level hierarchical system. On a given level, the basic structural elements are defined together with their structural transformations, such that both are used to constitute a new structural element on a higher level.This new element becomes meaningful on that level. Similarity plays an important role, as it is used as a basic quality for a class representation as a set of similar structural processes. Similarity measure is learned in training to induce the optimal finite set of weighted structural transformations that are necessary on the given level, such that the similarity of an object to the class representation is large. ‘This mathematical structure allows one to capture dynamically, during the learning process, the compositional structure of objects/events within a given inductive, or evolutionary, environment’ [31].

一个种类的生成历程起始于一个单一的源点,这个历程被描述成一个多层次的分等级的系统。在某个层次上,基本的结构元素和它们的结构转变被定义在一起,这样它们就可以一起被用来建立更高层次上新的结构元素,新元素在所在层次上就变成有意义起来。相似性起着一个重要的角色,是一个类别表示方法的基本性质,一个类别表示方法是通过对结构上相似性判断分出来的。相似性衡量方法通过训练被学习出来,通过归纲方法进行优化在某个层上必须要有的权重化结构转变信息的有限集,这样在某个种类表示方法中的某个对象的相似性是很大的。“在归纳推理中,或在演进式环境中,这个数学结构在学习过程中允许动态地获取对象或事件的合成结构。”[31]

 

Goldfarb’s ideas bear some similarity to the ones of Wolfram, presented in his book on ‘a new kind of science’ [80]. Wolfram considers computation as the primary concept in nature; all processes are the results of cellular automata(Cellular automata are discrete dynamical systems that operate on a regular lattice in space and time, and are characterized by ‘local’ interactions.)type of computational processes, and thereby inherently numerical. He observes that repetitive use of simple computational transformations can cause very complex phenomena, especially if computational mechanisms are used at different levels. Goldfarb also discusses dynamical systems, in which complexity is built from simpler structures, through hierarchical folding up (or enrichment). The major difference is that he considers structure of primary interest, which leads to evolving temporal structural processes instead of computational ones.

Goldfarb的思想与Wolfram有些相似,在他的书‘a new kind of science’[80]中有这方面的介绍。Woldfram认为计算是自然中首要的概念,所有的处理都是实现在胞元自动机类型上的计算过程(胞元自动机是离散的动态系统,在时间和空间中操作在有规则的格子上,具有局部交互的特点),显然是数字化的。他观察到反复使用简单的计算转换方法可以产生复杂的现象,特别是如果在不同层次进行计算。Goldfarb也论述类似的动态系统,认为复杂的事物可以通过分等级的折叠(或富集)方法,从更简单的结构来建立。主要不同的是他考虑主要有用的结构,且这种结构可以在结构上随时间的变化而进行演变,而不是计算出来。

 

In summary, Goldfarb proposes a revolutionary paradigm: an ontological model of a class representation in an epistemological context, as it is learnable from examples. This is a truly unique unification. We think it is the most complete and challenging approach to pattern recognition to this date, a breakthrough. By including the formative history of objects into their representation, Goldfarb attributes them some aspects of human consciousness. The far reaching consequence of his ideas is a generalized measurement process that will be one day present in sensors. Such sensors will be able to measure ‘in structural units’ instead of numerical units (say, meters) as it is currently done. The inductive process over a set of structural units lies at the foundation of new inductive informatics. The difficulty, however, is that the current formalism in mathematics and related fields is not yet prepared for adopting these far-reaching ideas. We, however, believe, they will pave the road and be found anew or rediscovered in the next decennia.

总之,Goldfarb提出的是一个具有创新性的识别模式:以认识论为背景,建立类别表示方法的存在论模型,且可以从用例中进行学习。这是一个真正的终极方法。我们认为这是迄今为止最为完整且极具有挑战性的方法。通过把对象的演变历史融入到表示方法中,Goldfarb加入了人类意识形为。他的思想遥不可及的是一般化的类似于生物的测量方法,这个方法有一天将会被用在传感器中,这样的传感器可以‘在结构单元’上进行测量,而不是现在所实现的数值单元(如米)。归纳推理在一组结构单元上进行处理,这些结构单元是归纳推理产生新信息的基础。然而,困难的是当前数学和相关领域上的形式体系并未能为采用这些遥不可及的思想做好准备,不过,我们相信这些思想是通往成功之路,且在几十年后会被再次找到或重新发现。

原创粉丝点击