您的位置: 专家智库 > >

张昕然

作品数:17 被引量:63H指数:5
供职机构:东南大学更多>>
发文基金:国家自然科学基金国家教育部博士点基金江苏省自然科学基金更多>>
相关领域:电子电信自动化与计算机技术更多>>

文献类型

  • 11篇期刊文章
  • 5篇专利
  • 1篇学位论文

领域

  • 9篇电子电信
  • 4篇自动化与计算...

主题

  • 13篇语音
  • 12篇语音情感
  • 9篇语音情感识别
  • 9篇情感识别
  • 5篇语谱图
  • 4篇识别方法
  • 3篇声学特征
  • 3篇情感
  • 3篇维度
  • 2篇信号
  • 2篇语音信号
  • 2篇直方图
  • 2篇特征提取
  • 2篇统计直方图
  • 2篇图像
  • 2篇情感类型
  • 2篇情感特征
  • 2篇情感维度
  • 2篇自动识别
  • 2篇自动识别方法

机构

  • 17篇东南大学
  • 3篇南京工程学院
  • 3篇烟台大学
  • 2篇盐城工学院
  • 1篇贵州大学
  • 1篇南京大学

作者

  • 17篇张昕然
  • 16篇赵力
  • 10篇陶华伟
  • 10篇查诚
  • 6篇梁瑞宇
  • 4篇宋鹏
  • 4篇徐新洲
  • 3篇王青云
  • 3篇魏昕
  • 3篇黄程韦
  • 2篇杨晶
  • 2篇余华
  • 2篇王如刚
  • 1篇杨平
  • 1篇张旭苹
  • 1篇周琳
  • 1篇吴尘
  • 1篇柳晶晶

传媒

  • 4篇Journa...
  • 3篇信号处理
  • 2篇东南大学学报...
  • 1篇声学学报
  • 1篇数据采集与处...

年份

  • 1篇2019
  • 1篇2018
  • 2篇2017
  • 9篇2016
  • 4篇2015
17 条 记 录,以下是 1-10
排序方式:
面向语音情感识别的语谱图特征提取算法被引量:17
2015年
为研究信号相关性在语音情感识别中的作用,提出了一种面向语音情感识别的语谱图特征提取算法.首先,对语谱图进行处理,得到归一化后的语谱图灰度图像;然后,计算不同尺度、不同方向的Gabor图谱,并采用局部二值模式提取Gabor图谱的纹理特征;最后,将不同尺度、不同方向Gabor图谱提取到的局部二值模式特征进行级联,作为一种新的语音情感特征进行情感识别.柏林库(EMO-DB)及FAU Ai Bo库上的实验结果表明:与已有的韵律、频域、音质特征相比,所提特征的识别率提升3%以上;与声学特征融合后,所提特征的识别率较早期声学特征至少提高5%.因此,利用这种新的语音情感特征可以有效识别不同种类的情感语音.
陶华伟査诚梁瑞宇张昕然赵力王青云
关键词:情感识别语谱图图像纹理特征局部二值模式
基于LDA+kernel-KNNFLC的语音情感识别方法被引量:8
2015年
结合K近邻、核学习方法、特征线重心法和LDA算法,提出了用于情感识别的LDA+kernel-KNNFLC方法.首先针对先验样本特征造成的计算量庞大问题,采用重心准则学习样本距离,改进了核学习的K近邻方法;然后加入LDA对情感特征向量进行优化,在避免维度冗余的情况下,更好地保证了情感信息识别的稳定性.最后,通过对特征空间再学习,结合LDA的kernel-KNNFLC方法优化了情感特征向量的类间区分度,适合于语音情感识别.对包含120维全局统计特征的语音情感数据库进行仿真实验,对降维方案、情感分类器和维度参数进行了多组对比分析.结果表明,LDA+kernel-KNNFLC方法在同等条件下性能提升效果最显著.
张昕然查诚徐新洲宋鹏赵力
关键词:语音情感识别K近邻线性判别分析
一种语音情感维度区域的自动识别方法
本发明公开了一种语音情感维度区域的自动识别方法,属于语音识别技术领域。我们采用了一种特征空间重构的方法进行分类器的优化。第一,我们提取和优化基本声学特征作为区分情感区域的基准;第二,我们采用特征空间重构的方法将多个情感特...
黄程韦赵力张昕然余华杨晶徐新洲陶华伟
文献传递
一种语音情感维度区域的自动识别方法
本发明公开了一种语音情感维度区域的自动识别方法,属于语音识别技术领域。我们采用了一种特征空间重构的方法进行分类器的优化。第一,我们提取和优化基本声学特征作为区分情感区域的基准;第二,我们采用特征空间重构的方法将多个情感特...
黄程韦赵力张昕然余华杨晶徐新洲陶华伟
Auditory attention model based on Chirplet for cross-corpus speech emotion recognition被引量:1
2016年
To solve the problem of mismatching features in an experimental database, which is a key technique in the field of cross-corpus speech emotion recognition, an auditory attention model based on Chirplet is proposed for feature extraction.First, in order to extract the spectra features, the auditory attention model is employed for variational emotion features detection. Then, the selective attention mechanism model is proposed to extract the salient gist features which showtheir relation to the expected performance in cross-corpus testing.Furthermore, the Chirplet time-frequency atoms are introduced to the model. By forming a complete atom database, the Chirplet can improve the spectrum feature extraction including the amount of information. Samples from multiple databases have the characteristics of multiple components. Hereby, the Chirplet expands the scale of the feature vector in the timefrequency domain. Experimental results show that, compared to the traditional feature model, the proposed feature extraction approach with the prototypical classifier has significant improvement in cross-corpus speech recognition. In addition, the proposed method has better robustness to the inconsistent sources of the training set and the testing set.
张昕然宋鹏查诚陶华伟赵力
一种用于语音情感识别的自学习语谱图特征提取方法
本发明公开了一种用于语音情感识别的自学习语谱图特征提取方法,首先对已知情感的标准语料库中的语音进行预处理,得到量化后的语谱图灰度图像;然后计算所得到的语谱图灰度图像的Gabor语谱图;再采用可辨别特征学习算法对提取到的L...
赵力陶华伟魏昕梁瑞宇查诚张昕然
文献传递
Speech emotion recognition via discriminant-cascading dimensionality reduction被引量:1
2016年
In order to accurately identify speech emotion information, the discriminant-cascading effect in dimensionality reduction of speech emotion recognition is investigated. Based on the existing locality preserving projections and graph embedding framework, a novel discriminant-cascading dimensionality reduction method is proposed, which is named discriminant-cascading locality preserving projections (DCLPP). The proposed method specifically utilizes supervised embedding graphs and it keeps the original space for the inner products of samples to maintain enough information for speech emotion recognition. Then, the kernel DCLPP (KDCLPP) is also proposed to extend the mapping form. Validated by the experiments on the corpus of EMO-DB and eNTERFACE'05, the proposed method can clearly outperform the existing common dimensionality reduction methods, such as principal component analysis (PCA), linear discriminant analysis (LDA), locality preserving projections (LPP), local discriminant embedding (LDE), graph-based Fisher analysis (GbFA) and so on, with different categories of classifiers.
王如刚徐新洲黄程韦吴尘张昕然赵力
关键词:DISCRIMINANTANALYSIS
A novel speech emotion recognition algorithm based on combination of emotion data field and ant colony search strategy被引量:3
2016年
In order to effectively conduct emotion recognition from spontaneous, non-prototypical and unsegmented speech so as to create a more natural human-machine interaction; a novel speech emotion recognition algorithm based on the combination of the emotional data field (EDF) and the ant colony search (ACS) strategy, called the EDF-ACS algorithm, is proposed. More specifically, the inter- relationship among the turn-based acoustic feature vectors of different labels are established by using the potential function in the EDF. To perform the spontaneous speech emotion recognition, the artificial colony is used to mimic the turn- based acoustic feature vectors. Then, the canonical ACS strategy is used to investigate the movement direction of each artificial ant in the EDF, which is regarded as the emotional label of the corresponding turn-based acoustic feature vector. The proposed EDF-ACS algorithm is evaluated on the continueous audio)'visual emotion challenge (AVEC) 2012 dataset, which contains the spontaneous, non-prototypical and unsegmented speech emotion data. The experimental results show that the proposed EDF-ACS algorithm outperforms the existing state-of-the-art algorithm in turn-based speech emotion recognition.
查诚陶华伟张昕然周琳赵力杨平
Tunable photonic microwave generated by multi-wavelength Brillouin fiber laser
2017年
Aimed at the problem of narrow tunability and low frequency microwave signal generated by the optical method,a novel approach to stabilizing the tunable photonic microwave generated by the multi-wavelength Brillouin fiber laser is proposed and is experimentally demonstrated.A singlelongitudinal-mode Brillouin fiber laser is designed,and by using the laser,a multi-wavelength Brillouin fiber laser with more than eleven orders of Stokes wave is observed.The wavelength spacing of the adjacent Stokes wave is 0.085 nm.If the Brillouin pump power is increased,the number of Stokes wave output can be further increased.The tunable microwave signals of 10.8 and 21.6 GHz are obtained by heterodyning the Rayleigh wave and Stokes wave of the multiwavelength Brillouin fiber laser.In the experiment,by tuning the pump wavelength and temperature of the gain fiber,microwave signals at different frequencies are generated.The tunable frequency range can be further expanded by using a temperature controller with a wider adjustment range,and the generated microwave signal exhibits high stability on frequency.
王如刚张昕然赵力张旭苹
关键词:MULTIWAVELENGTH
一种用于语音情感识别的自学习语谱图特征提取方法
本发明公开了一种用于语音情感识别的自学习语谱图特征提取方法,首先对已知情感的标准语料库中的语音进行预处理,得到量化后的语谱图灰度图像;然后计算所得到的语谱图灰度图像的Gabor语谱图;再采用可辨别特征学习算法对提取到的L...
赵力陶华伟魏昕梁瑞宇查诚张昕然
文献传递
共2页<12>
聚类工具0