What Transcription Returns
Each completed transcription job gives you:Speaker-Labeled Transcript
Every segment is tagged with the speaker role —
agent or customer — along with start time, end time, and a confidence score.Smart Summary
A 2–3 sentence plain-language summary of the call: what was discussed, what the customer’s situation was, and what the outcome or next step is.
Sentiment Analysis
Overall call sentiment (positive, neutral, negative) with per-segment sentiment scores so you can see how the conversation evolved.
Intent Classification
A structured intent label — interested, callback requested, not interested, complaint, escalation — derived from the full conversation context.
Speaker Diarization
NeuronLens automatically separates and labels the two sides of a call. You do not need to submit separate audio channels — the diarization model identifies who is speaking based on acoustic patterns and conversational structure. Pass"speakers": 2 in your request to signal a two-party call. If your recording includes a conference or a three-way call, set the value accordingly.
For best diarization accuracy on single-channel (mono) recordings, ensure the audio is at least 8 kHz sample rate. Stereo recordings with agent and customer on separate channels produce the most precise speaker separation.
Submitting a Recording
Send aPOST request to /v1/transcription with the audio URL and the features you want to enable.
Request Parameters
A publicly accessible URL or a signed URL pointing to your audio file. NeuronLens fetches the file at processing time and does not store your audio beyond that window.
BCP-47 language code for the primary language spoken in the call. For example,
hi-IN for Hindi, en-IN for Indian English, ta-IN for Tamil. See the supported languages table below.Number of distinct speakers in the recording. For standard two-party agent–customer calls, use
2.List of analysis features to run. Accepted values:
"transcription", "summary", "sentiment", "intent". Pass all four to get the full analysis in a single job.Optional. If provided, NeuronLens posts the completed result to this URL when processing finishes, so you do not need to poll.
Optional. A free-form key-value object you can attach to the job for your own reference — for example,
{ "agent_id": "ag_001", "campaign_id": "camp_xyz" }.Initial Response
NeuronLens returns a job object immediately. Processing happens asynchronously.Unique identifier for this transcription job. Use it to fetch results.
Current job state:
pending, processing, or completed. A failed status includes an error field with a reason.Approximate seconds until the job completes. Actual time depends on audio duration and current queue depth.
Fetching Results
Poll the job status endpoint untilstatus is completed, or wait for your webhook callback.
Completed Response
Plain-language 2–3 sentence summary of the call outcome.
Top-level sentiment:
positive, neutral, or negative.Classified customer intent for the call.
Ordered array of speech segments. Each segment includes
speaker, text, start_time, end_time, and confidence.Bulk Submission
To submit multiple recordings at once, use the batch endpoint:batch_id and an array of individual job_id values. Results are delivered per job to your webhook URL as each one completes.
Supported Audio Formats
| Format | Extension | Notes |
|---|---|---|
| WAV | .wav | Recommended; supports PCM and compressed variants |
| MP3 | .mp3 | Widely supported; slight quality trade-off vs. WAV |
| OGG Vorbis | .ogg | Common in WebRTC-based recording setups |
| FLAC | .flac | Lossless; larger file size |
| MPEG-4 Audio | .m4a | Common in mobile recording applications |
Supported Languages
Audio files must be accessible via a public URL or a time-limited signed URL (minimum 15 minutes validity). NeuronLens fetches the file once during processing and does not store your audio beyond that window. Make sure your storage bucket does not require IP allowlisting that would block VInfer’s processing servers.
| Language | Code | Notes |
|---|---|---|
| Hindi | hi-IN | Including Hinglish (code-switched Hindi–English) |
| Indian English | en-IN | Tuned for Indian accents across regions |
| Tamil | ta-IN | |
| Telugu | te-IN | |
| Marathi | mr-IN | |
| Bengali | bn-IN | |
| Kannada | kn-IN | |
| Gujarati | gu-IN | |
| Malayalam | ml-IN | |
| Punjabi | pa-IN | |
| Standard English | en-US | For calls with non-Indian participants |