Skip to main content

Interface: VoiceInputFrame

Audio frame delivered to the engine when streaming microphone input.

Properties

channels

channels: number;

Number of audio channels encoded within the frame.


data

data: Buffer;

Raw PCM data for the frame.


encoding

encoding: "pcm16";

Encoding used for data.


isSpeechEnd?

optional isSpeechEnd: boolean;

Signals when the caller believes speech content has ended.


length

length: number;

Frame length in PCM samples.


sampleRate

sampleRate: number;

Sample rate applied to the frame.


speakerId?

optional speakerId: null | string;

Optional speaker identifier for diarization-aware engines.


timestamp

timestamp: number;

Timestamp in milliseconds when the frame was captured.