The CSI effect

On the perception of forensic science.

Deep learning and voice comparison: phonetically-motivated vs. automatically-learned features

Broadband spectrograms of French vowels /ɑ̃/, /a/, /ɛ/, /e/, /i/, /ə/, and /ɔ/ extracted from radio broadcast corpora were used to recognize 45 speakers with a deep convolutional neural network (CNN). The same network was also trained with 62 …

Towards phonetic interpretability in deep learning applied to voice comparison

A deep convolutional neural network was trained to classify 45 speakers based on spectrograms of their productions of the French vowel /ɑ̃/ Although the model achieved fairly high accuracy – over 85 % – our primary focus here was phonetic …


Forensic Voice Comparison