Hierarchical Speech Analysis Method for Age, Gender, and Emotion Detection
Summary
USPTO published patent application US20260100196A1 for Tencent America LLC, covering a hierarchical speech analysis method using two-stage neural networks to detect speaker age, gender, and emotion from voice signals. The first learning stage performs initial detection while the second stage refines these attributes. This patent application relates to AI-driven speech processing technology.
What changed
USPTO published patent application US20260100196A1 assigned to Tencent America LLC, covering a two-stage machine learning method for analyzing speech signals. The first learning stage receives preprocessed speech to detect initial age and gender, while the second learning stage refines these attributes and determines speaker emotion. The system uses hierarchical processing to improve detection accuracy across multiple speaker characteristics.
Technology companies developing speech recognition, emotion detection, or speaker analysis systems should review this patent application for potential implications on future product development and intellectual property strategy. The patent's scope covers AI-based audio processing with applications in customer service, security, healthcare, and consumer electronics.
What to do next
- Monitor for updates
Archived snapshot
Apr 10, 2026GovPing captured this document from the original source. If the source has since changed or been removed, this is the text as it existed at that time.
HIERARCHICAL SPEECH ANALYSIS: A PROGRESSIVE APPROACH TO AGE, GENDER, AND EMOTION DETECTION
Application US20260100196A1 Kind: A1 Apr 09, 2026
Assignee
Tencent America LLC
Inventors
Hao ZHANG, Dong YU
Abstract
A method includes receiving a speech input signal of a speaker; preprocessing the speech signal to generate a preprocessed speech signal; detecting, via a first learning stage that receives the preprocessed speech signal, an initial age and an initial gender of the speaker; and determining, via a second learning stage that receives the initial age and the initial gender as inputs, a refined age of the speaker, a refined gender of the speaker, and an emotion of the speaker.
CPC Classifications
G10L 25/63 G06N 3/045 G10L 25/24 G10L 25/30
Filing Date
2024-10-03
Application No.
18905436
Related changes
Get daily alerts for USPTO Patent Applications - AI & Computing (G06N)
Daily digest delivered to your inbox.
Free. Unsubscribe anytime.
Source
About this page
Every important government, regulator, and court update from around the world. One place. Real-time. Free. Our mission
Source document text, dates, docket IDs, and authority are extracted directly from USPTO.
The plain-English summary, classification, and "what to do next" steps are AI-generated from the original text. Cite the source document, not the AI analysis.
Classification
Who this affects
Taxonomy
Browse Categories
Get alerts for this source
We'll email you when USPTO Patent Applications - AI & Computing (G06N) publishes new changes.
Subscribed!
Optional. Filters your digest to exactly the updates that matter to you.