CN114419716A - A calibration method for face key point calibration in face images - Google Patents
A calibration method for face key point calibration in face images Download PDFInfo
- Publication number
- CN114419716A CN114419716A CN202210094018.7A CN202210094018A CN114419716A CN 114419716 A CN114419716 A CN 114419716A CN 202210094018 A CN202210094018 A CN 202210094018A CN 114419716 A CN114419716 A CN 114419716A
- Authority
- CN
- China
- Prior art keywords
- key point
- face
- gradient
- face image
- similarity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Image Analysis (AREA)
Abstract
本发明提供针对连续人脸图像的一种面部关键点标定的校准方法,包括:以通过DAN网络获得的第一张人脸图像的各关键点作为备选关键点,对后一帧的人脸图像的各关键点依次进行校准:获得区域梯度特征向量,计算前后帧区域梯度特征向量的相似性结果α1;获得区域卷积特征,计算前后帧人脸图像中的区域卷积特征的相似性结果α2;将相似性结果α1与相似性结果α2按比重相加,得到相似性结果β:若大于预设阈值,则备选关键点为后一帧人脸图像校准后的关键点;若小于预设阈值,在预设的关键点搜索区域内搜索,直至获得大于阈值的关键点作为后一帧人脸图像校准后的关键点。本方法提高了人脸关键点的定位精度,抗噪效果,使其能够应用到对人脸关键点精度要求更高的领域中。
The present invention provides a calibration method for calibrating facial key points for continuous face images, including: using the key points of the first face image obtained through the DAN network as candidate key points, Each key point of the image is calibrated in turn: obtain the regional gradient feature vector, calculate the similarity result α 1 of the regional gradient feature vector before and after the frame; obtain the regional convolution feature, calculate the similarity of the regional convolution feature in the face image before and after the frame Result α 2 ; the similarity result α 1 and the similarity result α 2 are added according to the proportion to obtain the similarity result β: if it is greater than the preset threshold, the candidate key point is the key point after the calibration of the face image of the next frame ; If it is less than the preset threshold, search in the preset key point search area until the key points greater than the threshold are obtained as the calibrated key points of the next frame of face image. The method improves the positioning accuracy of face key points and the anti-noise effect, so that it can be applied to fields that require higher accuracy of face key points.
Description
技术领域technical field
本发明涉及机器视觉技术领域,具体涉及一种人脸图像面部关键点标定 的校准方法。The invention relates to the technical field of machine vision, in particular to a calibration method for calibrating facial key points in a human face image.
背景技术Background technique
近年来,针对人脸分析的研究越来越多,所谓人脸分析,是指在人脸图 像的基础上,通过计算机视觉相关技术,对人的表情,身份进行识别,人脸 关键点的精确定位是人脸分析任务中重要的基础环节。人脸关键点定位通常 是所有人脸分析任务的预处理工作,后续还需要人脸对齐进行数据标准化。In recent years, there have been more and more researches on face analysis. The so-called face analysis refers to the recognition of human expressions and identities, and the accurate identification of face key points through computer vision-related technologies based on face images. Localization is an important basic link in face analysis tasks. Face key point location is usually the preprocessing work for all face analysis tasks, and face alignment is required for data standardization in the follow-up.
人脸关键点检测的方法可以分为两类:基于点分布(PDM)模型的检测 方法和基于深度学习的检测方法。The methods of face keypoint detection can be divided into two categories: detection methods based on point distribution (PDM) model and detection methods based on deep learning.
其中,基于点分布模型的检测方法主要包括以下部分:对人脸关键点样 本进行统计分析,对所有关键点的坐标串联得到向量表示,通过最小二乘法 计算得到反映人脸关键点分布规律的模型—点分布模型。ASM与AAM这类 传统算法会形成基于统计的形状模型,可解释性较强,但其形状模型与检测 结果强烈依赖于数据集,泛化性能较差,同时由于其统计是在整体平均的基 础上进行的,导致其在面对一些特殊数据时表现很差。基于深度学习的检测 方法是将整张图片作为网络输入,充分避免了特征的稀疏性,能够学习到更 多的信息;同时,深层次的卷积神经网络能够学习到深层次的语义特征。虽 然深度学习技术的出现使得人脸关键点算法性能得到了一定提升,但是还存 在一些缺点,比如人脸姿态和遮挡,虽然近年来出现了一些处理相关问题的 方法,但是在图片质量较低的实时情境下,当前方法距离实际应用还有一定 距离。Among them, the detection method based on the point distribution model mainly includes the following parts: performing statistical analysis on the face key point samples, concatenating the coordinates of all key points to obtain a vector representation, and obtaining a model reflecting the distribution law of face key points through the least squares method - Point distribution model. Traditional algorithms such as ASM and AAM will form a shape model based on statistics, which is highly interpretable, but its shape model and detection results are strongly dependent on the data set, and the generalization performance is poor. At the same time, because its statistics are based on the overall average , which results in its poor performance in the face of some special data. The detection method based on deep learning takes the entire image as the network input, which fully avoids the sparsity of features and can learn more information; at the same time, the deep convolutional neural network can learn deep semantic features. Although the emergence of deep learning technology has improved the performance of the face key point algorithm to a certain extent, there are still some shortcomings, such as face pose and occlusion. Although there have been some methods to deal with related problems in recent years, but in low-quality images In real-time situations, the current method is still far from practical application.
现有技术中,人脸关键点���定位方法主要包括在对包含人脸图像进行分 解变换和对称变换之后利用几何关系得到估算定位。但是,在很多困难场景 下,人脸关键点定位的结果仍然无法令人满意,影响到检测精度的外界因素 很多,包括姿势,遮挡,表情,照明。在无约束的环境下,由于面部特征的 自身或环境引起的变化,使得人脸关键点定位这项任务难度较大。传统的人 脸关键点在处理一些精度要求不太高的任务如人脸识别时能取得较好结果, 但面对人脸微表情识别,人脸姿态识别等精度要求较高的任务时误差较大。In the prior art, the method for locating key points of a face mainly includes using a geometric relationship to obtain an estimated position after decomposing and symmetric transforming the image containing the face. However, in many difficult scenes, the results of facial key point location are still unsatisfactory, and there are many external factors that affect the detection accuracy, including pose, occlusion, expression, and lighting. In an unconstrained environment, the task of facial key point localization is difficult due to the changes caused by the facial features themselves or the environment. The traditional face key points can achieve better results when dealing with tasks that require less precision, such as face recognition, but face micro-expression recognition, face pose recognition and other tasks that require higher precision. big.
为了解决这些问题,本发明提出了一种新的人脸图像面部关键点标定的 校准方法。In order to solve these problems, the present invention proposes a new calibration method for the calibration of facial key points in human face images.
发明内容SUMMARY OF THE INVENTION
为解决上述问题,本发明提供一种人脸图像面部关键点标定的校准方法。In order to solve the above problems, the present invention provides a calibration method for calibrating facial key points in a face image.
本发明提供了如下的技术方案。The present invention provides the following technical solutions.
一种人脸图像面部关键点标定的校准方法,包括以下步骤:A calibration method for calibrating facial key points in a face image, comprising the following steps:
以通过训练后的DAN网络获取的第一张人脸图像各关键点的坐标为基 准,对后一帧的人脸图像的各关键点依次进行校准,包括以下步骤:Based on the coordinates of the key points of the first face image obtained by the trained DAN network, the key points of the face image of the next frame are calibrated in turn, including the following steps:
将前一帧人脸图像中校准好的关键点坐标,作为后一帧的备选关键点的 坐标,分别为中心建立两帧图像的梯度特征向量生成区域;Taking the calibrated key point coordinates in the face image of the previous frame as the coordinates of the candidate key points of the subsequent frame, respectively, establish the gradient feature vector generation area of the two frames of images as the center;
通过计算梯度特征向量生成区域的梯度大小与梯度方向,获得区域梯度 特征向量,计算前后两帧人脸图像的区域梯度特征向量的���似性结果α1;By calculating the gradient magnitude and gradient direction of the gradient feature vector generation region, the regional gradient feature vector is obtained, and the similarity result α 1 of the regional gradient feature vectors of the two frames of face images before and after is calculated;
分别以关键点为中心建立两帧图像的卷积特征对比区域,对卷积特征对 比区域进行多次卷积,获得区域卷积特征,计算前后两帧人脸图像中的区域 卷积特征的相似性结果α2;The convolution feature comparison area of the two frames of images is established with the key points as the center, and the convolution feature comparison area is convolved multiple times to obtain the regional convolution features, and the similarity of the regional convolution features in the two frames of face images before and after is calculated. Sexual result α 2 ;
将相似性结果α1与相似性结果α2按比重相加,得到相似性结果β:The similarity result α 1 and the similarity result α 2 are added according to the proportion to obtain the similarity result β:
若大于预设相似性结果阈值,则备选关键点为后一帧人脸图像校准后的 关键点;若小于预设相似性结果阈值,在预设的关键点搜索区域内搜索,直 至获得大于阈值的关键点作为后一帧人脸图像校准后的关键点。If it is greater than the preset similarity result threshold, the candidate key point is the calibrated key point of the face image in the next frame; if it is less than the preset similarity result threshold, search in the preset key point search area until it obtains a key point greater than The key points of the threshold are used as the calibrated key points of the next frame of face image.
优选地,还包括:若搜索完整片搜索区域后仍没有高于预设阈值的点, 则将搜索区域内所有的点以与备选关键点的相似度排序,将相似度排第一的 点作为校准后的关键点。Preferably, it also includes: if there is still no point higher than the preset threshold after searching the complete search area, then all points in the search area are sorted by the similarity with the candidate key points, and the similarity is ranked first. as a key point after calibration.
优选地,所述DAN神经网络模型根据标记的人脸关键点坐标数据,之 后利用CNN完成监督学习。Preferably, the DAN neural network model utilizes CNN to complete supervised learning according to the marked face key point coordinate data.
优选地,所述梯度特征向量生成区域的梯度大小与梯度方向的计算,包 括以下步骤:Preferably, the calculation of the gradient size and gradient direction of the gradient feature vector generation region includes the following steps:
梯度大小分为横向梯度大小Gx与纵向梯度大小Gy为:The gradient size is divided into the horizontal gradient size G x and the longitudinal gradient size G y as:
Gx(x,y)=I2(x+1,y)-I2(x,y)G x (x,y)=I 2 (x+1,y)-I 2 (x,y)
Gy(x,y)=I2(x,y+1)-I2(x,y)G y (x,y)=I 2 (x,y+1)-I 2 (x,y)
式中,(x,y)为像素点坐标;I2为图像Gamma校正后的亮度;In the formula, (x, y) is the pixel coordinate; I 2 is the brightness of the image after Gamma correction;
梯度的幅值为:The magnitude of the gradient is:
梯度的方向为:The direction of the gradient is:
优选地,所述区域梯度特征向量的获取,包括以下步骤:Preferably, the acquisition of the regional gradient feature vector includes the following steps:
为梯度特征向量生成区域中心点距离相同的像素分配相同的权重,将像 素的梯度向量与对应的权重相乘得到权重像素梯度向量,将权重像素梯度向 量相加得到区域梯度特征向量。Assign the same weight to the pixels with the same distance from the center point of the gradient feature vector generation region, multiply the gradient vector of the pixel with the corresponding weight to obtain the weighted pixel gradient vector, and add the weighted pixel gradient vector to obtain the regional gradient feature vector.
优选地,所述相似性计算公式采用余弦相似度计算公式。Preferably, the similarity calculation formula adopts a cosine similarity calculation formula.
优选地,所述将相似性结果α1与相似性结果α2按比重相加,得到相似性 结果β,公式如下:Preferably, the similarity result α 1 and the similarity result α 2 are added in proportion to obtain the similarity result β, and the formula is as follows:
β=w1α1+w2α2,w1+w2=1β=w 1 α 1 +w 2 α 2 , w 1 +w 2 =1
其中,将β设定阈值为0.95。Among them, the threshold value of β is set to 0.95.
优选地,所述在预设的关键点搜索区域内搜索,包括以下步骤:Preferably, the searching in the preset key point search area includes the following steps:
以备选关键点为中心,建立15*15的关键点搜索区域;以备选关键点为 原点开始搜索,先横向搜索,后纵向搜索,直到出现高于阈值的点,即为后 一帧人脸图像的关键点。With the candidate key point as the center, establish a 15*15 key point search area; start the search with the candidate key point as the origin, first search horizontally, then vertically, until there is a point higher than the threshold, that is, the next frame of people keypoints of the face image.
本发明的有益效果:Beneficial effects of the present invention:
本发明提出一种人脸图像面部关键点标定的校准方法,该方法提高了人 脸关键点的抗噪效果,消除了关键点的抖动,提高了关键点定位的精度,能 够完成微表情分类任务。The invention proposes a calibration method for calibrating facial key points of a face image, which improves the anti-noise effect of the key points of the face, eliminates the jitter of the key points, improves the positioning accuracy of the key points, and can complete the task of micro-expression classification .
附图说明Description of drawings
图1是本发明实施例的方法流程图;Fig. 1 is the method flow chart of the embodiment of the present invention;
图2是本发明实施例的人脸关键点定位示意图;2 is a schematic diagram of the location of key points of a human face according to an embodiment of the present invention;
图3是本发明实施例的HOG特征提取流程图3 is a flow chart of HOG feature extraction according to an embodiment of the present invention
图4(a)是本发明实施例的未经过Gamma处理的被试人脸图;Fig. 4 (a) is the subject's face map without Gamma processing according to the embodiment of the present invention;
图4(b)是本发明实施例的经过处理的被试人脸图;Fig. 4(b) is the processed face map of the tested subject according to the embodiment of the present invention;
图5是本发明实施例的关键点附近区域梯度向量生成及对比过程示意图;5 is a schematic diagram of a gradient vector generation and comparison process in the vicinity of a key point according to an embodiment of the present invention;
图6是本发明实施例的局部卷积操作及结果对比图;6 is a local convolution operation and a result comparison diagram of an embodiment of the present invention;
图7(a)是本发明实施例的连续帧人脸图像;Fig. 7 (a) is the continuous frame face image of the embodiment of the present invention;
图7(b)是本发明实施例的连续人脸图像中未经校准的人脸关键点定位示 意图;Fig. 7 (b) is a schematic diagram of uncalibrated face key point positioning in the continuous face image of the embodiment of the present invention;
图7(c)是本发明实施例的连续人脸图像中经校准的人脸关键点定位示意 图。Fig. 7(c) is a schematic diagram of the location of calibrated key points of faces in continuous face images according to an embodiment of the present invention.
具体实施方式Detailed ways
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及 实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施 例仅用以解释本发明,并不用于限定本发明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
实施例1Example 1
本发明的一种人脸图像面部关键点标定的校准方法,如图1所示,A calibration method for calibrating facial key points in a face image of the present invention is shown in FIG. 1 ,
S1:使用DAN神经网络进行人脸序列的第一张图像关键点定位:S1: Use the DAN neural network to locate the key points of the first image of the face sequence:
采用的数据集为300W,300W数据集是一个比较通用的人脸对齐数据 集,该数据集约有4000个被试者的图像。每个被试者包含多张人脸,但是 每批图像只有一张人脸图像被标注了68个关键点。The dataset used is 300W, and the 300W dataset is a relatively common face alignment dataset with about 4000 subjects' images. Each subject contains multiple faces, but only one face image per batch is annotated with 68 keypoints.
DAN是一种类似级联的神经网络,但是与其他的级联模型不同,其输 入是整张人脸图,DAN可以有效地克服头部姿态带来的问题,达到较好的 检测效果。DAN需要根据标记的人脸关键点坐标数据,利用CNN完成监督 学习。DAN为多层级网络,第一层的输入为人脸关键点的标椎模板和原始 图片,将输入送入前馈神经网络,前馈神经网络由十个卷积层,五个池化层, 一个全连接层构成。每两个卷积层后接一个池化层,激活函数选用ReLU函数,输出为人脸关键点各位置的偏差值。第二层为姿势估计网络,该层网络 主要计算图中人脸的姿态方向,同时根据标准人脸关键点模型进行仿射变换, 将图像中的人脸提出并且摆正。第三层为关键点热图生成。关键点热图是图 像中每一个关键点附近区域(选取区域为25*25)的概率图,该图像素值大 小范围为0到1,位置距关键点越远,像素值越小。热力图这种基于位置的 分布规律用高斯分布来模拟。在经过训练后,得到的模型可以预测人脸图像 的关键点。如图2所示。DAN is a kind of similar cascaded neural network, but different from other cascaded models, its input is the whole face image, DAN can effectively overcome the problems caused by head posture and achieve better detection results. DAN needs to use CNN to complete supervised learning according to the marked face key point coordinate data. DAN is a multi-layer network. The input of the first layer is the face key point template and the original image, and the input is sent to the feedforward neural network. The feedforward neural network consists of ten convolutional layers, five pooling layers, one Fully connected layer. Each two convolutional layers is followed by a pooling layer, the activation function uses the ReLU function, and the output is the deviation value of each position of the key points of the face. The second layer is the pose estimation network, which mainly calculates the pose direction of the face in the image, and at the same time, performs affine transformation according to the standard face key point model, and proposes and straightens the face in the image. The third layer is keypoint heatmap generation. The key point heat map is a probability map of the area near each key point in the image (the selected area is 25*25). The pixel value of the image ranges from 0 to 1. The farther the position is from the key point, the smaller the pixel value. The location-based distribution of heatmaps is modeled with a Gaussian distribution. After training, the resulting model can predict the keypoints of face images. as shown in picture 2.
S2:以通过DAN网络获取的第一张人脸图像各关键点的坐标为基准, 对连续帧的人脸图像的各关键点依次进行校准,包括以下步骤:S2: Based on the coordinates of the key points of the first face image obtained through the DAN network, the key points of the face images of consecutive frames are calibrated sequentially, including the following steps:
S2.1:将连续帧的人脸图像进行灰度化,并进行Gamma校正.S2.1: Grayscale face images of consecutive frames and perform Gamma correction.
图像灰度化将彩色图像转化为灰度图像,图像灰度化后的数据占用的内 存更小,运算速度更快,同时转化为灰度图像后可以增强视觉上的对比效果, 突出人脸关键点附近的区域。灰度化的公式如下:Image grayscale converts color images into grayscale images. The grayscaled data occupies less memory and operates faster. At the same time, after converting to grayscale images, it can enhance the visual contrast effect and highlight the key to the face. area near the point. The formula for grayscale is as follows:
I0=0.3Ir+0.59Ig+0.11Ib I 0 =0.3I r +0.59I g +0.11I b
该公式中的系数来自于人眼对于红色,绿色,蓝色的敏感程度。使用 opencv中的相关函数即可快速实现。Gamma校正的目的是为了调节图像的 对比度,降低图像局部的阴影和光照变化所造成的影响,同时也可以抑制噪 音的干扰。进行Gamma操作首先需要将图像像素进行归一化,归一化公式 如下:The coefficients in this formula come from how sensitive the human eye is to red, green, and blue. It can be quickly implemented using the related functions in opencv. The purpose of Gamma correction is to adjust the contrast of the image, reduce the influence of local shadows and illumination changes in the image, and also suppress the interference of noise. To perform the Gamma operation, the image pixels need to be normalized first. The normalization formula is as follows:
I1=I0/255I 1 =I 0 /255
对归一化后的像素进行Gamma补偿,Gamma补偿的公式如下:Gamma compensation is performed on the normalized pixels. The formula of Gamma compensation is as follows:
I2=I1 gamma I 2 =I 1 gamma
进行反归一化操作,便得到了处理后的图像,如图4所示,图4(a)为未 经过Gamma处理的被试人脸图,图4(b)为经过处理的被试人脸图。Perform the inverse normalization operation to obtain the processed image, as shown in Figure 4. Figure 4(a) is the face image of the subject without Gamma processing, and Figure 4(b) is the processed subject. face map.
S2.2:进行HOG特征提取,过程如图3所示,将前一帧人脸图像中校 准好的关键点坐标,作为后一帧的备选关键点的坐标,分别建立两帧图像的 梯度特征向量生成区域。S2.2: Perform HOG feature extraction. The process is shown in Figure 3. The calibrated key point coordinates in the previous frame of the face image are used as the coordinates of the candidate key points in the next frame, and the gradients of the two frames of images are established respectively. Feature vector generation area.
S2.3:通过计算梯度特征向量生成区域的梯度大小与梯度方向,获得区 域梯度特征向量,计算前后两帧人脸图像的区域梯度特征向量的相似性结果 α1.具体包括:S2.3: Obtain the regional gradient feature vector by calculating the gradient size and gradient direction of the gradient feature vector generation region, and calculate the similarity result α 1 of the regional gradient feature vectors of the two frames of face images before and after. Specifically, it includes:
梯度大小分为横向梯度大小Gx与纵向梯度大小Gy为:The gradient size is divided into the horizontal gradient size G x and the longitudinal gradient size G y as:
Gx(x,y)=I2(x+1,y)-I2(x,y)G x (x,y)=I 2 (x+1,y)-I 2 (x,y)
Gy(x,y)=I2(x,y+1)-I2(x,y)G y (x,y)=I 2 (x,y+1)-I 2 (x,y)
式中,(x,y)为像素点坐标;In the formula, (x, y) is the pixel coordinates;
梯度的幅值为:The magnitude of the gradient is:
梯度的方向为:The direction of the gradient is:
以每个关键点为中心,建立大小为5*5的梯度特征向量生成区域,每个 关键点都会根据其对应的梯度特征向量生成区域梯度特征向量:为梯度特征 向量生成区域距离中心点相同的像素分配相同的权重,将像素的梯度向量与 对应的权重相乘得到权重像素梯度向量,将权重像素梯度向量相加得到区域 梯度特征向量。With each key point as the center, a gradient feature vector generation area with a size of 5*5 is established. Each key point will generate a regional gradient feature vector according to its corresponding gradient feature vector: the gradient feature vector is generated for the region with the same distance from the center point. The pixels are assigned the same weight, the gradient vector of the pixel is multiplied by the corresponding weight to obtain the weighted pixel gradient vector, and the weighted pixel gradient vector is added to obtain the regional gradient feature vector.
相似性计算公式采用余弦相似度计算公式:The similarity calculation formula adopts the cosine similarity calculation formula:
得到梯度特征向量对比���α1。关键点附近区域梯度向量生成及对比过程 示意图,如图5所示。The gradient feature vector contrast value α 1 is obtained. The schematic diagram of the gradient vector generation and comparison process in the vicinity of the key point is shown in Figure 5.
S2.4:分别以关键点为中心建立两帧图像的卷积特征对比区域,对卷积 特征对比区域进行多次卷积,获得区域卷积特征,计算前后帧人脸图像中的 区域卷积特征的相似性结果α2:以每个关键点为中心,建立大小为7*7的卷 积特征对比区域。在卷积对比区域进行三次卷积,卷积核大小均为3*3。局 部卷积操作及结果对比图如图6所示。S2.4: Establish the convolution feature comparison area of the two frames of images with the key points as the center, perform multiple convolutions on the convolution feature comparison area to obtain the regional convolution features, and calculate the regional convolution in the face images of the front and rear frames. Feature similarity result α 2 : With each key point as the center, a convolutional feature comparison area with a size of 7*7 is established. Three convolutions are performed in the convolution contrast area, and the size of the convolution kernel is 3*3. The local convolution operation and the result comparison are shown in Figure 6.
S2.5:将相似性结果α1与相似性结果α2按比重相加,得到相似性结果β: 具体公式如下:S2.5: Add the similarity result α 1 and the similarity result α 2 according to the proportion to obtain the similarity result β: The specific formula is as follows:
β=w1α1+w2α2,w1+w2=1β=w 1 α 1 +w 2 α 2 , w 1 +w 2 =1
将β设定阈值为0.95。下面具体说明一下关键点对比过程:Set the β threshold to 0.95. The following is a detailed description of the key point comparison process:
S2.6:关键点对比过程:S2.6: Key point comparison process:
记第一张人脸图片某关键点的坐标(x1,y1),在第二张人脸图片以(x1,y1)为 中心计算梯度特征向量相似度,卷积特征相似度以及最终的相似性结果β0。 若β0大于0.95,则可以认为第二张人脸图像的(x1,y1)附近的特征与第一张人 脸图像的(x1,y1)附近的特征相似度足够高,即第一张人脸图像的(x1,y1)与第二 张人脸图像的(x1,y1)是同一个点,(x1,y1)也是第二张人脸图像的关键点。若对 比结果低于该阈值,以(x1,y1)为中心,建立15*15的关键点搜索区域。以(x1,y1) 为原点开始搜索,先横向搜索,后纵向搜索,直到出现高于阈值的点(x2,y2),Note the coordinates (x 1 , y 1 ) of a key point in the first face picture, and calculate the gradient feature vector similarity, convolution feature similarity and The final similarity result β 0 . If β 0 is greater than 0.95, it can be considered that the features near (x 1 , y 1 ) of the second face image are sufficiently similar to the features near (x 1 , y 1 ) of the first face image, that is, (x 1 , y 1 ) of the first face image is the same point as (x 1 , y 1 ) of the second face image, and (x 1 , y 1 ) is also the key to the second face image point. If the comparison result is lower than the threshold, a 15*15 key point search area is established with (x 1 , y 1 ) as the center. Start the search with (x 1 , y 1 ) as the origin, first search horizontally, then vertically, until a point (x 2 , y 2 ) higher than the threshold appears,
(x2,y2)即为第二张人脸图像的关键点。若搜索完整个搜索区域后仍没有 高于阈值的点,则将搜索区域内所有的点以与(x1,y1)的相似度排序,将相似 度排第一的点(xn,yn)作为关键点。对所有关键点重复上述过程,即可得到校 正后的关键点。具体效果,如图7所示,图7(a)为连续人脸图像,图7(b)连 续人脸图像中未经校准的人脸关键点,图7(c)为连续图像中经过校准的人脸 关键点。(x 2 , y 2 ) is the key point of the second face image. If there is no point higher than the threshold after searching the entire search area, then all points in the search area are sorted by their similarity to (x 1 , y 1 ), and the first point of similarity (x n , y n ) as key points. Repeat the above process for all keypoints to get the corrected keypoints. The specific effect is shown in Figure 7. Figure 7(a) is a continuous face image, Figure 7(b) is an uncalibrated face key point in the continuous face image, and Figure 7(c) is a calibrated face in the continuous image. face key points.
以上仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明 的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明 的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention shall be included in the protection scope of the present invention. Inside.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210094018.7A CN114419716B (en) | 2022-01-26 | 2022-01-26 | Calibration method for face image face key point calibration |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210094018.7A CN114419716B (en) | 2022-01-26 | 2022-01-26 | Calibration method for face image face key point calibration |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN114419716A true CN114419716A (en) | 2022-04-29 |
| CN114419716B CN114419716B (en) | 2024-03-15 |
Family
ID=81277438
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202210094018.7A Active CN114419716B (en) | 2022-01-26 | 2022-01-26 | Calibration method for face image face key point calibration |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN114419716B (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115776410A (en) * | 2023-01-29 | 2023-03-10 | 深圳汉德霍尔科技有限公司 | Face data encryption transmission method for terminal identity authentication |
| CN116311490A (en) * | 2022-09-07 | 2023-06-23 | 深圳市优必选科技股份有限公司 | Hunchback sitting posture detection method, device, equipment and storage medium |
| CN116503899A (en) * | 2023-03-31 | 2023-07-28 | 中银金融科技有限公司 | Gesture recognition method and device |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109657609A (en) * | 2018-12-19 | 2019-04-19 | 新大陆数字技术股份有限公司 | Face identification method and system |
| US20190392202A1 (en) * | 2018-10-30 | 2019-12-26 | Baidu Online Network Technology (Beijing) Co., Ltd. | Expression recognition method, apparatus, electronic device, and storage medium |
| CN112149728A (en) * | 2020-09-22 | 2020-12-29 | 成都智遥云图信息技术有限公司 | Rapid multi-modal image template matching method |
| CN112308037A (en) * | 2020-11-25 | 2021-02-02 | 郑州苏一电子科技有限公司 | Facial paralysis detection method based on visual perception and audio information |
| CN112927202A (en) * | 2021-02-25 | 2021-06-08 | 华南理工大学 | Method and system for detecting Deepfake video with combination of multiple time domains and multiple characteristics |
| CN113077470A (en) * | 2021-03-26 | 2021-07-06 | 天翼爱音乐文化科技有限公司 | Method, system, device and medium for cutting horizontal and vertical screen conversion picture |
-
2022
- 2022-01-26 CN CN202210094018.7A patent/CN114419716B/en active Active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190392202A1 (en) * | 2018-10-30 | 2019-12-26 | Baidu Online Network Technology (Beijing) Co., Ltd. | Expression recognition method, apparatus, electronic device, and storage medium |
| CN109657609A (en) * | 2018-12-19 | 2019-04-19 | 新大陆数字技术股份有限公司 | Face identification method and system |
| CN112149728A (en) * | 2020-09-22 | 2020-12-29 | 成都智遥云图信息技术有限公司 | Rapid multi-modal image template matching method |
| CN112308037A (en) * | 2020-11-25 | 2021-02-02 | 郑州苏一电子科技有限公司 | Facial paralysis detection method based on visual perception and audio information |
| CN112927202A (en) * | 2021-02-25 | 2021-06-08 | 华南理工大学 | Method and system for detecting Deepfake video with combination of multiple time domains and multiple characteristics |
| CN113077470A (en) * | 2021-03-26 | 2021-07-06 | 天翼爱音乐文化科技有限公司 | Method, system, device and medium for cutting horizontal and vertical screen conversion picture |
Non-Patent Citations (2)
| Title |
|---|
| 秦晓飞;盛凯;朱?;杨勇;赵刚;贾程;李成名;鲁小东;周坚风;: "基于注意力模型的人脸关键点检测算法", 光学仪器, no. 02, 15 April 2020 (2020-04-15) * |
| 郑光远;刘峡壁;韩光辉;: "医学影像计算机辅助检测与诊断系统综述", 软件学报, no. 05, 11 January 2018 (2018-01-11) * |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116311490A (en) * | 2022-09-07 | 2023-06-23 | 深圳市优必选科技股份有限公司 | Hunchback sitting posture detection method, device, equipment and storage medium |
| CN115776410A (en) * | 2023-01-29 | 2023-03-10 | 深圳汉德霍尔科技有限公司 | Face data encryption transmission method for terminal identity authentication |
| CN115776410B (en) * | 2023-01-29 | 2023-05-02 | 深圳汉德霍尔科技有限公司 | Face data encryption transmission method for terminal identity authentication |
| CN116503899A (en) * | 2023-03-31 | 2023-07-28 | 中银金融科技有限公司 | Gesture recognition method and device |
Also Published As
| Publication number | Publication date |
|---|---|
| CN114419716B (en) | 2024-03-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN109886121B (en) | Human face key point positioning method for shielding robustness | |
| CN112883896B (en) | A method of micro-expression detection based on BERT network | |
| CN109583340B (en) | A video object detection method based on deep learning | |
| CN114529946B (en) | Pedestrian re-identification method, device, equipment and storage medium based on self-supervised learning | |
| CN107516316B (en) | A Method for Segmenting Static Human Image by Introducing Focusing Mechanism in FCN | |
| CN112801015B (en) | Multi-mode face recognition method based on attention mechanism | |
| WO2020177432A1 (en) | Multi-tag object detection method and system based on target detection network, and apparatuses | |
| CN111062239A (en) | Human target detection method, device, computer equipment and storage medium | |
| WO2020015752A1 (en) | Object attribute identification method, apparatus and system, and computing device | |
| CN106570480B (en) | A Human Action Classification Method Based on Gesture Recognition | |
| CN111274921A (en) | A method for human action recognition using pose mask | |
| CN114419716B (en) | Calibration method for face image face key point calibration | |
| CN112434655A (en) | Gait recognition method based on adaptive confidence map convolution network | |
| CN115862119B (en) | Attention mechanism-based face age estimation method and device | |
| CN110458235B (en) | Motion posture similarity comparison method in video | |
| CN116110118B (en) | Pedestrian re-recognition and gait recognition method based on space-time feature complementary fusion | |
| WO2021051526A1 (en) | Multi-view 3d human pose estimation method and related apparatus | |
| CN110827304A (en) | Traditional Chinese medicine tongue image positioning method and system based on deep convolutional network and level set method | |
| WO2018035794A1 (en) | System and method for measuring image resolution value | |
| CN114118303A (en) | Face key point detection method and device based on prior constraint | |
| CN103971112A (en) | Image feature extracting method and device | |
| CN112200056A (en) | Face living body detection method and device, electronic equipment and storage medium | |
| CN111488766A (en) | Target detection method and device | |
| CN111445496B (en) | Underwater image recognition tracking system and method | |
| CN107766864A (en) | Extract method and apparatus, the method and apparatus of object identification of feature |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |







