CN114419716A

CN114419716A - A calibration method for face key point calibration in face images

Info

Publication number: CN114419716A
Application number: CN202210094018.7A
Authority: CN
Inventors: 闫佳庆; 胡博阳; 邓金钊; 李丹; 龙舟; 孙文浩; 李小俚
Original assignee: North China University of Technology
Current assignee: North China University of Technology
Priority date: 2022-01-26
Filing date: 2022-01-26
Publication date: 2022-04-29
Anticipated expiration: 2042-01-26
Also published as: CN114419716B

Abstract

The present invention provides a calibration method for calibrating facial key points for continuous face images, including: using the key points of the first face image obtained through the DAN network as candidate key points, Each key point of the image is calibrated in turn: obtain the regional gradient feature vector, calculate the similarity result α ₁ of the regional gradient feature vector before and after the frame; obtain the regional convolution feature, calculate the similarity of the regional convolution feature in the face image before and after the frame Result α ₂ ; the similarity result α ₁ and the similarity result α ₂ are added according to the proportion to obtain the similarity result β: if it is greater than the preset threshold, the candidate key point is the key point after the calibration of the face image of the next frame ; If it is less than the preset threshold, search in the preset key point search area until the key points greater than the threshold are obtained as the calibrated key points of the next frame of face image. The method improves the positioning accuracy of face key points and the anti-noise effect, so that it can be applied to fields that require higher accuracy of face key points.

Description

A calibration method for face key point calibration in face images

技术领域technical field

本发明涉及机器视觉技术领域，具体涉及一种人脸图像面部关键点标定的校准方法。The invention relates to the technical field of machine vision, in particular to a calibration method for calibrating facial key points in a human face image.

背景技术Background technique

近年来，针对人脸分析的研究越来越多，所谓人脸分析，是指在人脸图像的基础上，通过计算机视觉相关技术，对人的表情，身份进行识别，人脸关键点的精确定位是人脸分析任务中重要的基础环节。人脸关键点定位通常是所有人脸分析任务的预处理工作，后续还需要人脸对齐进行数据标准化。In recent years, there have been more and more researches on face analysis. The so-called face analysis refers to the recognition of human expressions and identities, and the accurate identification of face key points through computer vision-related technologies based on face images. Localization is an important basic link in face analysis tasks. Face key point location is usually the preprocessing work for all face analysis tasks, and face alignment is required for data standardization in the follow-up.

人脸关键点检测的方法可以分为两类：基于点分布(PDM)模型的检测方法和基于深度学习的检测方法。The methods of face keypoint detection can be divided into two categories: detection methods based on point distribution (PDM) model and detection methods based on deep learning.

其中，基于点分布模型的检测方法主要包括以下部分：对人脸关键点样本进行统计分析，对所有关键点的坐标串联得到向量表示，通过最小二乘法计算得到反映人脸关键点分布规律的模型—点分布模型。ASM与AAM这类传统算法会形成基于统计的形状模型，可解释性较强，但其形状模型与检测结果强烈依赖于数据集，泛化性能较差，同时由于其统计是在整体平均的基础上进行的，导致其在面对一些特殊数据时表现很差。基于深度学习的检测方法是将整张图片作为网络输入，充分避免了特征的稀疏性，能够学习到更多的信息；同时，深层次的卷积神经网络能够学习到深层次的语义特征。虽然深度学习技术的出现使得人脸关键点算法性能得到了一定提升，但是还存在一些缺点，比如人脸姿态和遮挡，虽然近年来出现了一些处理相关问题的方法，但是在图片质量较低的实时情境下，当前方法距离实际应用还有一定距离。Among them, the detection method based on the point distribution model mainly includes the following parts: performing statistical analysis on the face key point samples, concatenating the coordinates of all key points to obtain a vector representation, and obtaining a model reflecting the distribution law of face key points through the least squares method - Point distribution model. Traditional algorithms such as ASM and AAM will form a shape model based on statistics, which is highly interpretable, but its shape model and detection results are strongly dependent on the data set, and the generalization performance is poor. At the same time, because its statistics are based on the overall average , which results in its poor performance in the face of some special data. The detection method based on deep learning takes the entire image as the network input, which fully avoids the sparsity of features and can learn more information; at the same time, the deep convolutional neural network can learn deep semantic features. Although the emergence of deep learning technology has improved the performance of the face key point algorithm to a certain extent, there are still some shortcomings, such as face pose and occlusion. Although there have been some methods to deal with related problems in recent years, but in low-quality images In real-time situations, the current method is still far from practical application.

现有技术中，人脸关键点��定位方法主要包括在对包含人脸图像进行分解变换和对称变换之后利用几何关系得到估算定位。但是，在很多困难场景下，人脸关键点定位的结果仍然无法令人满意，影响到检测精度的外界因素很多，包括姿势，遮挡，表情，照明。在无约束的环境下，由于面部特征的自身或环境引起的变化，使得人脸关键点定位这项任务难度较大。传统的人脸关键点在处理一些精度要求不太高的任务如人脸识别时能取得较好结果，但面对人脸微表情识别，人脸姿态识别等精度要求较高的任务时误差较大。In the prior art, the method for locating key points of a face mainly includes using a geometric relationship to obtain an estimated position after decomposing and symmetric transforming the image containing the face. However, in many difficult scenes, the results of facial key point location are still unsatisfactory, and there are many external factors that affect the detection accuracy, including pose, occlusion, expression, and lighting. In an unconstrained environment, the task of facial key point localization is difficult due to the changes caused by the facial features themselves or the environment. The traditional face key points can achieve better results when dealing with tasks that require less precision, such as face recognition, but face micro-expression recognition, face pose recognition and other tasks that require higher precision. big.

为了解决这些问题，本发明提出了一种新的人脸图像面部关键点标定的校准方法。In order to solve these problems, the present invention proposes a new calibration method for the calibration of facial key points in human face images.

发明内容SUMMARY OF THE INVENTION

为解决上述问题，本发明提供一种人脸图像面部关键点标定的校准方法。In order to solve the above problems, the present invention provides a calibration method for calibrating facial key points in a face image.

本发明提供了如下的技术方案。The present invention provides the following technical solutions.

一种人脸图像面部关键点标定的校准方法，包括以下步骤：A calibration method for calibrating facial key points in a face image, comprising the following steps:

以通过训练后的DAN网络获取的第一张人脸图像各关键点的坐标为基准，对后一帧的人脸图像的各关键点依次进行校准，包括以下步骤：Based on the coordinates of the key points of the first face image obtained by the trained DAN network, the key points of the face image of the next frame are calibrated in turn, including the following steps:

将前一帧人脸图像中校准好的关键点坐标，作为后一帧的备选关键点的坐标，分别为中心建立两帧图像的梯度特征向量生成区域；Taking the calibrated key point coordinates in the face image of the previous frame as the coordinates of the candidate key points of the subsequent frame, respectively, establish the gradient feature vector generation area of the two frames of images as the center;

通过计算梯度特征向量生成区域的梯度大小与梯度方向，获得区域梯度特征向量，计算前后两帧人脸图像的区域梯度特征向量的��似性结果α₁；By calculating the gradient magnitude and gradient direction of the gradient feature vector generation region, the regional gradient feature vector is obtained, and the similarity result α ₁ of the regional gradient feature vectors of the two frames of face images before and after is calculated;

分别以关键点为中心建立两帧图像的卷积特征对比区域，对卷积特征对比区域进行多次卷积，获得区域卷积特征，计算前后两帧人脸图像中的区域卷积特征的相似性结果α₂；The convolution feature comparison area of the two frames of images is established with the key points as the center, and the convolution feature comparison area is convolved multiple times to obtain the regional convolution features, and the similarity of the regional convolution features in the two frames of face images before and after is calculated. Sexual result α ₂ ;

将相似性结果α₁与相似性结果α₂按比重相加，得到相似性结果β：The similarity result α ₁ and the similarity result α ₂ are added according to the proportion to obtain the similarity result β:

若大于预设相似性结果阈值，则备选关键点为后一帧人脸图像校准后的关键点；若小于预设相似性结果阈值，在预设的关键点搜索区域内搜索，直至获得大于阈值的关键点作为后一帧人脸图像校准后的关键点。If it is greater than the preset similarity result threshold, the candidate key point is the calibrated key point of the face image in the next frame; if it is less than the preset similarity result threshold, search in the preset key point search area until it obtains a key point greater than The key points of the threshold are used as the calibrated key points of the next frame of face image.

优选地，还包括：若搜索完整片搜索区域后仍没有高于预设阈值的点，则将搜索区域内所有的点以与备选关键点的相似度排序，将相似度排第一的点作为校准后的关键点。Preferably, it also includes: if there is still no point higher than the preset threshold after searching the complete search area, then all points in the search area are sorted by the similarity with the candidate key points, and the similarity is ranked first. as a key point after calibration.

优选地，所述DAN神经网络模型根据标记的人脸关键点坐标数据，之后利用CNN完成监督学习。Preferably, the DAN neural network model utilizes CNN to complete supervised learning according to the marked face key point coordinate data.

优选地，所述梯度特征向量生成区域的梯度大小与梯度方向的计算，包括以下步骤：Preferably, the calculation of the gradient size and gradient direction of the gradient feature vector generation region includes the following steps:

梯度大小分为横向梯度大小G_x与纵向梯度大小G_y为：The gradient size is divided into the horizontal gradient size G _x and the longitudinal gradient size G _y as:

G_x(x,y)＝I₂(x+1,y)-I₂(x,y)G _x (x,y)=I ₂ (x+1,y)-I ₂ (x,y)

G_y(x,y)＝I₂(x,y+1)-I₂(x,y)G _y (x,y)=I ₂ (x,y+1)-I ₂ (x,y)

式中，(x,y)为像素点坐标；I₂为图像Gamma校正后的亮度；In the formula, (x, y) is the pixel coordinate; I ₂ is the brightness of the image after Gamma correction;

梯度的幅值为：The magnitude of the gradient is:

梯度的方向为：The direction of the gradient is:

优选地，所述区域梯度特征向量的获取，包括以下步骤：Preferably, the acquisition of the regional gradient feature vector includes the following steps:

为梯度特征向量生成区域中心点距离相同的像素分配相同的权重，将像素的梯度向量与对应的权重相乘得到权重像素梯度向量，将权重像素梯度向量相加得到区域梯度特征向量。Assign the same weight to the pixels with the same distance from the center point of the gradient feature vector generation region, multiply the gradient vector of the pixel with the corresponding weight to obtain the weighted pixel gradient vector, and add the weighted pixel gradient vector to obtain the regional gradient feature vector.

优选地，所述相似性计算公式采用余弦相似度计算公式。Preferably, the similarity calculation formula adopts a cosine similarity calculation formula.

优选地，所述将相似性结果α₁与相似性结果α₂按比重相加，得到相似性结果β，公式如下：Preferably, the similarity result α ₁ and the similarity result α ₂ are added in proportion to obtain the similarity result β, and the formula is as follows:

β＝w₁α₁+w₂α₂,w₁+w₂＝1β=w ₁ α ₁ +w ₂ α ₂ , w ₁ +w ₂ =1

其中，将β设定阈值为0.95。Among them, the threshold value of β is set to 0.95.

优选地，所述在预设的关键点搜索区域内搜索，包括以下步骤：Preferably, the searching in the preset key point search area includes the following steps:

以备选关键点为中心，建立15*15的关键点搜索区域；以备选关键点为原点开始搜索，先横向搜索，后纵向搜索，直到出现高于阈值的点，即为后一帧人脸图像的关键点。With the candidate key point as the center, establish a 15*15 key point search area; start the search with the candidate key point as the origin, first search horizontally, then vertically, until there is a point higher than the threshold, that is, the next frame of people keypoints of the face image.

本发明的有益效果：Beneficial effects of the present invention:

本发明提出一种人脸图像面部关键点标定的校准方法，该方法提高了人脸关键点的抗噪效果，消除了关键点的抖动，提高了关键点定位的精度，能够完成微表情分类任务。The invention proposes a calibration method for calibrating facial key points of a face image, which improves the anti-noise effect of the key points of the face, eliminates the jitter of the key points, improves the positioning accuracy of the key points, and can complete the task of micro-expression classification .

附图说明Description of drawings

图1是本发明实施例的方法流程图；Fig. 1 is the method flow chart of the embodiment of the present invention;

图2是本发明实施例的人脸关键点定位示意图；2 is a schematic diagram of the location of key points of a human face according to an embodiment of the present invention;

图3是本发明实施例的HOG特征提取流程图3 is a flow chart of HOG feature extraction according to an embodiment of the present invention

图4(a)是本发明实施例的未经过Gamma处理的被试人脸图；Fig. 4 (a) is the subject's face map without Gamma processing according to the embodiment of the present invention;

图4(b)是本发明实施例的经过处理的被试人脸图；Fig. 4(b) is the processed face map of the tested subject according to the embodiment of the present invention;

图5是本发明实施例的关键点附近区域梯度向量生成及对比过程示意图；5 is a schematic diagram of a gradient vector generation and comparison process in the vicinity of a key point according to an embodiment of the present invention;

图6是本发明实施例的局部卷积操作及结果对比图；6 is a local convolution operation and a result comparison diagram of an embodiment of the present invention;

图7(a)是本发明实施例的连续帧人脸图像；Fig. 7 (a) is the continuous frame face image of the embodiment of the present invention;

图7(b)是本发明实施例的连续人脸图像中未经校准的人脸关键点定位示意图；Fig. 7 (b) is a schematic diagram of uncalibrated face key point positioning in the continuous face image of the embodiment of the present invention;

图7(c)是本发明实施例的连续人脸图像中经校准的人脸关键点定位示意图。Fig. 7(c) is a schematic diagram of the location of calibrated key points of faces in continuous face images according to an embodiment of the present invention.

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白，以下结合附图及实施例，对本发明进行进一步详细说明。应当理解，此处所描述的具体实施例仅用以解释本发明，并不用于限定本发明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

实施例1Example 1

本发明的一种人脸图像面部关键点标定的校准方法，如图1所示，A calibration method for calibrating facial key points in a face image of the present invention is shown in FIG. 1 ,

S1：使用DAN神经网络进行人脸序列的第一张图像关键点定位：S1: Use the DAN neural network to locate the key points of the first image of the face sequence:

采用的数据集为300W，300W数据集是一个比较通用的人脸对齐数据集，该数据集约有4000个被试者的图像。每个被试者包含多张人脸，但是每批图像只有一张人脸图像被标注了68个关键点。The dataset used is 300W, and the 300W dataset is a relatively common face alignment dataset with about 4000 subjects' images. Each subject contains multiple faces, but only one face image per batch is annotated with 68 keypoints.

DAN是一种类似级联的神经网络，但是与其他的级联模型不同，其输入是整张人脸图，DAN可以有效地克服头部姿态带来的问题，达到较好的检测效果。DAN需要根据标记的人脸关键点坐标数据，利用CNN完成监督学习。DAN为多层级网络，第一层的输入为人脸关键点的标椎模板和原始图片，将输入送入前馈神经网络，前馈神经网络由十个卷积层，五个池化层，一个全连接层构成。每两个卷积层后接一个池化层，激活函数选用ReLU函数，输出为人脸关键点各位置的偏差值。第二层为姿势估计网络，该层网络主要计算图中人脸的姿态方向，同时根据标准人脸关键点模型进行仿射变换，将图像中的人脸提出并且摆正。第三层为关键点热图生成。关键点热图是图像中每一个关键点附近区域(选取区域为25*25)的概率图，该图像素值大小范围为0到1，位置距关键点越远，像素值越小。热力图这种基于位置的分布规律用高斯分布来模拟。在经过训练后，得到的模型可以预测人脸图像的关键点。如图2所示。DAN is a kind of similar cascaded neural network, but different from other cascaded models, its input is the whole face image, DAN can effectively overcome the problems caused by head posture and achieve better detection results. DAN needs to use CNN to complete supervised learning according to the marked face key point coordinate data. DAN is a multi-layer network. The input of the first layer is the face key point template and the original image, and the input is sent to the feedforward neural network. The feedforward neural network consists of ten convolutional layers, five pooling layers, one Fully connected layer. Each two convolutional layers is followed by a pooling layer, the activation function uses the ReLU function, and the output is the deviation value of each position of the key points of the face. The second layer is the pose estimation network, which mainly calculates the pose direction of the face in the image, and at the same time, performs affine transformation according to the standard face key point model, and proposes and straightens the face in the image. The third layer is keypoint heatmap generation. The key point heat map is a probability map of the area near each key point in the image (the selected area is 25*25). The pixel value of the image ranges from 0 to 1. The farther the position is from the key point, the smaller the pixel value. The location-based distribution of heatmaps is modeled with a Gaussian distribution. After training, the resulting model can predict the keypoints of face images. as shown in picture 2.

S2：以通过DAN网络获取的第一张人脸图像各关键点的坐标为基准，对连续帧的人脸图像的各关键点依次进行校准，包括以下步骤：S2: Based on the coordinates of the key points of the first face image obtained through the DAN network, the key points of the face images of consecutive frames are calibrated sequentially, including the following steps:

S2.1：将连续帧的人脸图像进行灰度化，并进行Gamma校正.S2.1: Grayscale face images of consecutive frames and perform Gamma correction.

图像灰度化将彩色图像转化为灰度图像，图像灰度化后的数据占用的内存更小，运算速度更快，同时转化为灰度图像后可以增强视觉上的对比效果，突出人脸关键点附近的区域。灰度化的公式如下：Image grayscale converts color images into grayscale images. The grayscaled data occupies less memory and operates faster. At the same time, after converting to grayscale images, it can enhance the visual contrast effect and highlight the key to the face. area near the point. The formula for grayscale is as follows:

I₀＝0.3I_r+0.59I_g+0.11I_b I ₀ =0.3I _r +0.59I _g +0.11I _b

该公式中的系数来自于人眼对于红色，绿色，蓝色的敏感程度。使用 opencv中的相关函数即可快速实现。Gamma校正的目的是为了调节图像的对比度，降低图像局部的阴影和光照变化所造成的影响，同时也可以抑制噪音的干扰。进行Gamma操作首先需要将图像像素进行归一化，归一化公式如下：The coefficients in this formula come from how sensitive the human eye is to red, green, and blue. It can be quickly implemented using the related functions in opencv. The purpose of Gamma correction is to adjust the contrast of the image, reduce the influence of local shadows and illumination changes in the image, and also suppress the interference of noise. To perform the Gamma operation, the image pixels need to be normalized first. The normalization formula is as follows:

I₁＝I₀/255I ₁ =I ₀ /255

对归一化后的像素进行Gamma补偿，Gamma补偿的公式如下：Gamma compensation is performed on the normalized pixels. The formula of Gamma compensation is as follows:

I₂＝I₁ ^gamma I ₂ =I ₁ ^gamma

进行反归一化操作，便得到了处理后的图像，如图4所示，图4(a)为未经过Gamma处理的被试人脸图，图4(b)为经过处理的被试人脸图。Perform the inverse normalization operation to obtain the processed image, as shown in Figure 4. Figure 4(a) is the face image of the subject without Gamma processing, and Figure 4(b) is the processed subject. face map.

S2.2：进行HOG特征提取，过程如图3所示，将前一帧人脸图像中校准好的关键点坐标，作为后一帧的备选关键点的坐标，分别建立两帧图像的梯度特征向量生成区域。S2.2: Perform HOG feature extraction. The process is shown in Figure 3. The calibrated key point coordinates in the previous frame of the face image are used as the coordinates of the candidate key points in the next frame, and the gradients of the two frames of images are established respectively. Feature vector generation area.

S2.3：通过计算梯度特征向量生成区域的梯度大小与梯度方向，获得区域梯度特征向量，计算前后两帧人脸图像的区域梯度特征向量的相似性结果 α₁.具体包括：S2.3: Obtain the regional gradient feature vector by calculating the gradient size and gradient direction of the gradient feature vector generation region, and calculate the similarity result α ₁ of the regional gradient feature vectors of the two frames of face images before and after. Specifically, it includes:

G_x(x,y)＝I₂(x+1,y)-I₂(x,y)G _x (x,y)=I ₂ (x+1,y)-I ₂ (x,y)

G_y(x,y)＝I₂(x,y+1)-I₂(x,y)G _y (x,y)=I ₂ (x,y+1)-I ₂ (x,y)

式中，(x,y)为像素点坐标；In the formula, (x, y) is the pixel coordinates;

梯度的幅值为：The magnitude of the gradient is:

梯度的方向为：The direction of the gradient is:

以每个关键点为中心，建立大小为5*5的梯度特征向量生成区域，每个关键点都会根据其对应的梯度特征向量生成区域梯度特征向量：为梯度特征向量生成区域距离中心点相同的像素分配相同的权重，将像素的梯度向量与对应的权重相乘得到权重像素梯度向量，将权重像素梯度向量相加得到区域梯度特征向量。With each key point as the center, a gradient feature vector generation area with a size of 5*5 is established. Each key point will generate a regional gradient feature vector according to its corresponding gradient feature vector: the gradient feature vector is generated for the region with the same distance from the center point. The pixels are assigned the same weight, the gradient vector of the pixel is multiplied by the corresponding weight to obtain the weighted pixel gradient vector, and the weighted pixel gradient vector is added to obtain the regional gradient feature vector.

相似性计算公式采用余弦相似度计算公式：The similarity calculation formula adopts the cosine similarity calculation formula:

得到梯度特征向量对比��α₁。关键点附近区域梯度向量生成及对比过程示意图，如图5所示。The gradient feature vector contrast value α ₁ is obtained. The schematic diagram of the gradient vector generation and comparison process in the vicinity of the key point is shown in Figure 5.

S2.4：分别以关键点为中心建立两帧图像的卷积特征对比区域，对卷积特征对比区域进行多次卷积，获得区域卷积特征，计算前后帧人脸图像中的区域卷积特征的相似性结果α₂：以每个关键点为中心，建立大小为7*7的卷积特征对比区域。在卷积对比区域进行三次卷积，卷积核大小均为3*3。局部卷积操作及结果对比图如图6所示。S2.4: Establish the convolution feature comparison area of the two frames of images with the key points as the center, perform multiple convolutions on the convolution feature comparison area to obtain the regional convolution features, and calculate the regional convolution in the face images of the front and rear frames. Feature similarity result α ₂ : With each key point as the center, a convolutional feature comparison area with a size of 7*7 is established. Three convolutions are performed in the convolution contrast area, and the size of the convolution kernel is 3*3. The local convolution operation and the result comparison are shown in Figure 6.

S2.5：将相似性结果α₁与相似性结果α₂按比重相加，得到相似性结果β：具体公式如下：S2.5: Add the similarity result α ₁ and the similarity result α ₂ according to the proportion to obtain the similarity result β: The specific formula is as follows:

将β设定阈值为0.95。下面具体说明一下关键点对比过程：Set the β threshold to 0.95. The following is a detailed description of the key point comparison process:

S2.6：关键点对比过程：S2.6: Key point comparison process:

记第一张人脸图片某关键点的坐标(x₁,y₁)，在第二张人脸图片以(x₁,y₁)为中心计算梯度特征向量相似度，卷积特征相似度以及最终的相似性结果β₀。若β₀大于0.95，则可以认为第二张人脸图像的(x₁,y₁)附近的特征与第一张人脸图像的(x₁,y₁)附近的特征相似度足够高，即第一张人脸图像的(x₁,y₁)与第二张人脸图像的(x₁,y₁)是同一个点，(x₁,y₁)也是第二张人脸图像的关键点。若对比结果低于该阈值，以(x₁,y₁)为中心，建立15*15的关键点搜索区域。以(x₁,y₁) 为原点开始搜索，先横向搜索，后纵向搜索，直到出现高于阈值的点(x₂,y₂)，Note the coordinates (x ₁ , y ₁ ) of _a key point in the _first face picture, and calculate the gradient feature vector similarity, convolution feature similarity and The final similarity result β ₀ . If β ₀ is greater than 0.95, it can be considered that the features near (x ₁ , y ₁ ) of the second face image are sufficiently similar to the features near (x ₁ , y ₁ ) of the first face image, that is, (x ₁ , y ₁ ) of the first face image is the same point as (x ₁ , y ₁ ) of the second face image, and (x ₁ , y ₁ ) is also the key to the second face image point. If the comparison result is lower than the threshold, a 15*15 key point search area is established with (x ₁ , y ₁ ) as the center. Start the search with (x ₁ , y ₁ ) as the origin, first search horizontally, then vertically, until a point (x ₂ , y ₂ ) higher than the threshold appears,

(x₂,y₂)即为第二张人脸图像的关键点。若搜索完整个搜索区域后仍没有高于阈值的点，则将搜索区域内所有的点以与(x₁,y₁)的相似度排序，将相似度排第一的点(x_n,y_n)作为关键点。对所有关键点重复上述过程，即可得到校正后的关键点。具体效果，如图7所示，图7(a)为连续人脸图像，图7(b)连续人脸图像中未经校准的人脸关键点，图7(c)为连续图像中经过校准的人脸关键点。(x ₂ , y ₂ ) is the key point of the second face image. If there is no point higher than the threshold after searching the entire search area, then all points in the search area are sorted by their similarity to (x ₁ , y ₁ ), and the first point of similarity (x _n , y _n ) as key points. Repeat the above process for all keypoints to get the corrected keypoints. The specific effect is shown in Figure 7. Figure 7(a) is a continuous face image, Figure 7(b) is an uncalibrated face key point in the continuous face image, and Figure 7(c) is a calibrated face in the continuous image. face key points.

以上仅为本发明的较佳实施例而已，并不用以限制本发明，凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等，均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention shall be included in the protection scope of the present invention. Inside.

Claims

1. a calibration method of face image facial key point calibration, is characterized in that, comprises:

Based on the coordinates of the key points of the first face image obtained through the trained DAN network, the key points of the face image in the next frame are calibrated in turn, including the following steps:

Grayscale the face images of consecutive frames and perform Gamma correction;

The calibrated key point coordinates in the face image of the previous frame are used as the coordinates of the candidate key points of the next frame, and the gradient feature vector generation regions of the two frames of images are respectively established around them;

By calculating the gradient magnitude and gradient direction of the gradient feature vector generation region, the regional gradient feature vector is obtained, and the similarity result α ₁ of the regional gradient feature vectors of the two frames of face images before and after is calculated;

The convolution feature comparison area of the two frames of images before and after is established with the candidate key points as the center, and the convolution feature comparison area is convolved multiple times to obtain the regional convolution feature, and the regional convolution in the two frames of face images before and after is calculated. Feature similarity result α ₂ ;

The similarity result α ₁ and the similarity result α ₂ are added according to the proportion to obtain the similarity result β:

If it is greater than the preset similarity result threshold, the candidate key point is the calibrated key point of the face image in the next frame; if it is less than the preset similarity result threshold, search in the preset key point search area until it obtains a key point greater than The key points of the threshold are used as the calibrated key points of the next frame of face image.

2. the calibration method of face image facial key point demarcation according to claim 1, is characterized in that, also comprises: if still does not have the point higher than preset threshold value after searching the complete piece search area, then will search area. All points are sorted by similarity with candidate key points, and the point with the first similarity is used as the calibrated key point.

3. The calibration method of face image facial key point demarcation according to claim 1, is characterized in that, described DAN neural network utilizes CNN to complete supervision learning afterward according to the marked face key point coordinate data.

4. the calibration method of face image face key point calibration according to claim 1, is characterized in that, the calculation of the gradient size and gradient direction of described gradient feature vector generation area, comprises the following steps:

The gradient size is divided into the horizontal gradient size G _x and the longitudinal gradient size G _y as:

G _x (x,y)=I ₂ (x+1,y)-I ₂ (x,y)

G _y (x,y)=I ₂ (x,y+1)-I ₂ (x,y)

In the formula, (x, y) is the pixel coordinate; I ₂ is the brightness of the image after Gamma correction;

The magnitude of the gradient is:

The direction of the gradient is:

5. the calibration method of human face image face key point calibration according to claim 4, is characterized in that, the acquisition of described regional gradient feature vector, comprises the following steps:

Assign the same weight to the pixels with the same distance from the center point of the gradient feature vector generation region, multiply the gradient vector of the pixel with the corresponding weight to obtain the weighted pixel gradient vector, and add the weighted pixel gradient vector to obtain the regional gradient feature vector.

6 . The calibration method for face key point calibration in a human face image according to claim 1 , wherein the similarity calculation formula adopts a cosine similarity calculation formula. 7 .

7. the calibration method of face image face key point demarcation according to claim 1, is characterized in that, described similarity result α ₁ and similarity result α ₂ are added by proportion, obtain similarity result β, formula as follows:

β=w ₁ α ₁ +w ₂ α ₂ , w ₁ +w ₂ =1

Among them, the threshold value of β is set to 0.95.

8. the calibration method of face key point calibration of human face image according to claim 1, is characterized in that, described searching in the preset key point search area, comprises the following steps:

With the candidate key point as the center, establish a 15*15 key point search area; start the search with the candidate key point as the origin, first search horizontally, then vertically, until there is a point higher than the threshold, that is, the next frame of people keypoints of the face image.