Logo
blank Skip to main content

Ensuring Proper VoIP Call Quality of Your Audio/Video Conferencing Solution: Tips and Tricks

QA

Audio and video conferencing software is getting more popular, and new applications of this type keep entering the market. Thus, itโ€™s essential to ensure your end users are pleased with your solution and donโ€™t have reasons to choose a competitorโ€™s software over yours.

The key is to make sure your development and quality assurance teams focus on ensuring flawless sound. Your target audience probably uses Zoom-like solutions for calling friends and family, attending educational events, and holding work meetings, so itโ€™s crucial for them to be able to clearly hear what others are saying. If you donโ€™t pay enough attention to detecting and fixing issues related to audio quality, you risk ending up with unsatisfied end users and an uncompetitive product.

In this article, we discuss common VoIP quality issues to expect when working on a solution that enables audio and/or video calling functionality. We also share our experience of what tests to perform and what tools to use when measuring sound quality.

VoIP as the basis of audio conferencing solutions

If you want your product to allow end users to make calls and host audio conferences and video meetings, you need to implement Voice over Internet Protocol (VoIP) technology. VoIP lets users make voice calls over the internet, providing benefits over conventional phone calls including cost savings, greater flexibility, and advanced features.

VoIP stands behind many popular communication solutions including Skype, WhatsApp, and FaceTime. Businesses also use it to create their own custom solutions. According to Statista, the global video conferencing market is expected to increase up to $19.1 billion by 2027. And Market Data Forecast predicts that the audio conferencing service market can reach $32.4 billion by the end of 2026.

For your product to survive in the competitive market of audio and video conferencing solutions, you need to make sure it provides flawless audio quality. Letโ€™s start with a brief explanation of how VoIP technology works before we move to common audio issues and testing.

During an audio or video conferencing session, VoIP transmits audio signals over the internet rather than over the cellular network. The VoIP call quality is heavily dependent on the internet connection speed.

When working on VoIP-based software, you must consider four crucial factors:

  1. Bandwidth โ€” This is the difference between the upper and lower frequencies in a continuous band of frequencies. If there are any bandwidth-related issues, audio might sound muffled or distorted.
  2. Sample rate โ€” This is the number of digital samples taken per second when recording the original sound. The higher the sample rate, which is typically expressed in kHz, the better the sound quality. For standard telephony, the sample rate is usually 8 kHz or 16 kHz, while for streaming audio, itโ€™s typically 44.1 kHz.
  3. Bitrate โ€” This is the amount of data contained in a digital audio file, measured in kilobits per second (kbps). A higher bitrate is often an indicator of better audio quality.
  4. Audio codecs โ€” These are algorithms that compress and decompress digital audio.

Signals that occupy a narrow range of frequencies or that have a small fractional bandwidth are called narrowband. Wide-bandwidth data transmission that transports multiple signals over a wide range of frequencies is called broadband. Hereโ€™s how narrowband and broadband differ:

  • To maximize efficiency, narrowband signals rely on the adaptive multirate (AMR) speech codec. AMR compresses and transmits a live stream using a limited range of audio frequencies: 200 Hz to 3.4 kHz. If the sound quality is compromised, itโ€™s usually due to low bandwidth causing the AMR codec to switch to a lower bitrate.
  • Broadband audio offers superior quality with a high-definition format tailored specifically for VoIP. It uses adaptive multi-rate wideband (AMR-WB) speech codecs that cover a wider frequency range: 50 Hz to 7 kHz. AMR-WB allows for better audio reception and for capturing higher and lower sounds to provide richer and more reliable sound quality.

With this information in mind, let’s move to exploring the issues to expect when attempting to ensure high quality in VoIP-based solutions.

Read also:
Specifics of Testing Sound Redirection in Terminal Sessions

6 common audio quality issues in VoIP-based solutions

Audio quality has a significant impact on how information is perceived and assimilated. If sound delays or errors occur, end users can be unsatisfied with your product. To understand how to improve your VoIP call quality, you first need to know what problems to expect.

Letโ€™s explore several potential issues that can affect audio quality and make it difficult for users to understand each other:

6 common issues that affect audio quality in VoIP-based solutions

1. Limited bandwidth can lead to a decline in overall audio quality. If the available bandwidth is limited, audio quality can suffer due to compression, data loss, or delays. 

This can make it difficult for participants to hear each other, especially if they have different accents or speak quickly. The negative impact is especially severe in remote meetings, where participants may rely entirely on audio cues to understand each other.

2. Reverberations occur when sound waves bounce off hard surfaces, such as walls or ceilings, creating a lingering echo. They can make audio sound harsh or unnatural.

3. Crosstalk occurs when multiple participants speak at once, creating overlapping audio signals that can be difficult to separate.

4. Background noise should be minimized, as it can make it difficult to hear what participants are saying, leading to miscommunication or misunderstandings.

The volume of background noise also affects how well it is picked up by the microphone. A microphone may not pick up faint sounds, such as quiet speech or distant noises. But if the hum of a fan or the sound of traffic outside is quite loud, the microphone will pick up unwanted background noise. It’s vital to consider microphone testing in ensuring the proper audio quality of your solution, because it will help you see how different types of microphones work with your software.

5. Clipping and dropping. Clipping occurs when a word is cut off, while dropping occurs when an entire word is missing, changing the meaning of what was said.

These issues can be caused by dropped packets, which can occur due to:

  • Bandwidth limitations
  • Misconfigured interfaces
  • Microburst link floods
  • Underpowered network equipment
  • Cabling faults
  • Out-of-order packets

If users report clipping at the start or end of words, the reason could be the Voice Activity Detection (VAD) in the VoIP hardware. This can be fixed by adjusting sound level settings or disabling VAD / silence suppression until the issue is resolved.

6. Latency in digital audio refers to the delay that occurs when an analog signal is converted to digital form and then outputted from computer memory. Below, we explain in detail how to test and minimize latency.

With that in mind, letโ€™s move on to the active part of ensuring decent sound quality in your VoIP-based solution and discuss where to start.

Read also:
System Performance Bottleneck Analysis: Data Center Running VoIP and MS SQL

Ensuring proper sound quality: where to start?

The first thing to do to evaluate your solutionโ€™s sound quality is to analyze the sound spectrum.

A sound spectrum is a representation of sound that displays the different frequencies present. To perform sound spectrum analysis, you need to use a spectrum analyzer ใƒผ a tool that breaks down a signal into individual frequencies and shows the level of each frequency in a graph.

A spectrum analyzer can help you detect unwanted sounds, such as wind and DC offset, by showing a visual representation of sound frequencies. Such defects are difficult to identify just by listening to a recording. A visual representation of how the spectrum of frequencies changes over time is called a spectrogram.

By analyzing sound with a spectrum analyzer, you can:

  • Identify the properties of the sound, like its frequency and whether there are any additional frequencies that donโ€™t relate to your sound
  • Identify and remove unwanted sounds in music production
  • Study the phonetics of speech
  • Find hard-to-hear defects like high-pitched sounds (15 kHz in the example below) which may not be noticeable to the human ear
Example of what a spectrogram looks like
Screenshot 1. Example of what a spectrogram looks like

Spectrum analyzers differ in the methods they use to obtain the spectrum of a signal. The most common types are:

  1. Swept-tuned analyzer, also called a sweep or swept spectrum analyzer โ€” works like a radio receiver with a display indicating the output level
  2. FFT analyzer โ€” computes a time sequence of periodograms using the fast Fourier transform (FFT) algorithm

Once youโ€™re done with spectrum analysis, itโ€™s time for tests.

Tests and tools for measuring sound quality

From our experience, performing these four crucial tests will help you measure the sound quality of your VoIP-based solution:

 tests and for each test
  1. A latency test measures the delay between the time a signal is sent and received. The lower the latency, the better the VoIP call quality. Based on latency test results, you can conclude whether a network setup is suitable for real-time VoIP communication.
  1. The noise floor (room tone) helps to establish the level of background noise in a VoIP call. A lower noise floor indicates less background noise, which can result in a clearer call. You can use noise reduction tools like noise suppression and echo cancellation to minimize unwanted background noise in the call.
  1. Loudness is a measure of the volume of the VoIP call. Itโ€™s important to ensure that the call volume is not too low or too high. Measuring loudness can help to make sure the call volume is at an appropriate level for the intended use.
  1. Microphone testing can help you determine the quality of a microphone being used for VoIP communications and check how different types of microphones work with your solution. Most tools for mic testing offer a variety of tests to assess the microphoneโ€™s performance in different scenarios, including noise reduction, frequency response, and sensitivity.

Now, letโ€™s explore each test in detail.

1. Latency testing

Latency testing is one of the key tests your team should conduct to achieve a quality product. Before showing you how to test it, letโ€™s dive into some details.

To be able to thoughtfully test latency and minimize delays in your solution, you first need to understand what causes them. Here are the factors that most influence latency:

  • The data input buffer size is the main cause of delays and depends on the processorโ€™s capabilities
  • The analog-to-digital conversion (ADC) and digital-to-analog conversion (DAC) stages add a minor delay ใƒผ 0.5ms combined (0.25ms each) on average
  • The data transfer protocol usually introduces a negligible delay

An efficient audio driver is what helps you minimize latency. Letโ€™s explore its place in audio inputโ€“output to a digital workstation (DAW), where the DAC stage converts the digital signal back to analog form:

A block diagram of audio inputโ€“output to a DAW

The sample rate determines the number of samples per second. The higher the sample rate, the more samples will represent the original analog waveform. Since time remains constant, with a higher sample rate, each individual sample represents a shorter amount of time compared to a lower sample rate.

Sample rate, buffer size, and latency are all interconnected. By adjusting the buffer size, which is measured in samples, you can change the resulting latency. A larger buffer size results in higher latency. The sample rate, on the other hand, determines the number of samples taken per second. A higher sample rate means each individual sample is shorter, resulting in less latency.

To calculate the total latency of a system, you can use the following formula:

(Buffer size / sample rate) ร— 2

However, itโ€™s important to take into account fixed delays, such as analog-to-digital and digital-to-analog conversion and audio driver delays.

Here are two practical examples to help illustrate the relationship between sample rate, buffer size, and latency:

1. Letโ€™s say youโ€™re recording audio using a DAW:

  • If you set the buffer size to 256 samples and the sample rate to 44.1 kHz, the total latency of the system would be (256 / 44100) ร— 2 = 0.0116 seconds, or 11.6 milliseconds.
  • If you increase the buffer size to 512 samples, the total latency will increase to 23.2 milliseconds.
  • If you increase the sample rate to 88.2 kHz while keeping the buffer size at 256 samples, the total latency will decrease to 5.8 milliseconds.

2. In the context of video conferencing, the buffer size and sample rate can affect the delay between audio and video. Letโ€™s explore a few examples:

  • If youโ€™re using a video conferencing app with a buffer size of 256 samples and a sample rate of 48 kHz, you may notice that the audio and video are slightly out of sync.
  • If you increase the buffer size to 512 samples and decrease the sample rate to 44.1 kHz, the delay may be even more noticeable.
  • To reduce the delay, you can try increasing the sample rate while keeping the buffer size small or using a dedicated low-latency audio interface.

How can you test latency?

To show you an example of latency testing, weโ€™ll use the open-source cross-platform Audacity software.

Before we start testing latency, letโ€™s briefly discuss possible devices:

  • An external microphone. To record overdubs, set up the microphone in front of a speaker or headphones and turnโ€ฏSoftware Playthroughโ€ฏoff.
  • A laptopโ€™s onboard microphone. To record overdubs, simply use the onboard microphone and speakers.
  • A desktop computer. In this case, youโ€™ll need to connect the line input and output ports with a loopback cable. The type of cable required will depend on the way youโ€™re connecting your recording equipment to the computer. If youโ€™re connecting your recording equipment to the line input and line output jacks on your computer, youโ€™ll need a cable with a stereo mini plug at each end.
  • If youโ€™re using a USB audio interface device for input and/or output, plug the loopback cable into the appropriate jacks in the USB interface. Depending on your interface, you may need a specific type of loopback cable.

Note that using some USB microphones might not be the best way to record overdubs.โ€ฏThe problem with some of these microphones is that the only way you can hear yourself in your headphones is by turningโ€ฏSoftware Playthroughโ€ฏon.โ€ฏSoftware Playthrough introduces its own delay that you will hear in your headphones.โ€ฏFor example, in our project, using a USB microphone added 65 milliseconds of delay.

Now, letโ€™s discuss Audacity settings for latency testing.

Recording preferences

1. In the top menu bar, choose onโ€ฏEditโ€ฏand > Preferences.

Choosing preferences
Screenshot 2. Choosing preferences

2. In theโ€ฏRecording tab, check the box for Play other tracks while recording (overdub) as shown in the screenshot below:

Checking the Play other tracks while recording (overdub) box
Screenshot 3. Checking the Play other tracks while recording (overdub) box

With Audacityโ€™s default settings, theโ€ฏRecordโ€ฏbuttonโ€ฏwill append a recording onto your existing track. To record to a new track for multitrack overdubbing, youโ€™ll need to press Shiftโ€ฏand selectโ€ฏRecord New track,โ€ฏorโ€ฏuse the shortcutโ€ฏShift + R. You can change the default keyboard shortcut inโ€ฏRecording Preferencesโ€ฏby checking Record on a new track.

Device preferences

1. In the Sampling section of the Quality tab, we set the default settings for our app (you can select other values if your app requires): 

  • Default Sample Formatโ€ฏโ€” 16-bit
  • Default Sample Rate โ€” 48000 Hz
Configurations in the Sampling section
Screenshot 4. Configurations in the Sampling section

2. In the Latency section of the Devices tab, set the following:

  • Latency compensationโ€ฏtoโ€ฏ0โ€ฏ so that you get an absolute reading
  • Leave theโ€ฏBuffer lengthโ€ฏvalue at its default value

Then clickโ€ฏOK.

Configurations in the Latency section
Screenshot 5. Configurations in the Latency section

The Buffer length value will affect latency, but the effect can vary between systems. 100 milliseconds is a safe value that wonโ€™t put too much load on your CPU. Too low of a value may put too much load on the CPU, causing dropouts in the recording. If youโ€™re feeling adventurous, you can repeat this test with different values for buffer length. But in the end, you should settle on values for both buffer length and latency compensation and leave them be.

3. We recommend you go to the View page and enable the Device Toolbar:

Enabling the Device Toolbar
Screenshot 6. Enabling the Device Toolbar

If you change the recording, playback, devices, real audio host, or audio hostโ€ฏinโ€ฏthe Device Toolbar, youโ€™ll need to run this test again. Each Audacity test is only representative for the specific inputs, outputs, and hosts used during the test.

Selection Toolbar

1. Inโ€ฏthe Selection Toolbar,โ€ฏmake sure that Snap-To is set to Off. Above the second and third group of numbers, make sure thatโ€ฏStartโ€ฏand Length of Selectionโ€ฏis selected.

2. Click on one of the downward-pointing arrows in the digitโ€™s boxes to the right ofโ€ฏSnap-Toโ€ฏand select hh:mm:ss + milliseconds.

Configurations in the Selection toolbar
Screenshot 7. Configurations in the Selection toolbar

Performing the latency test

As weโ€™re using a standalone microphone to record overdubs in our example, weโ€™ll start with setting up the microphone in front of a speaker or headphones and turningโ€ฏSoftware Playthroughโ€ฏoff. Weโ€™ll use the Realtek (R) Audio laptop speakers and a USB microphone (USB Audio Device).

Note: Turn off unnecessary applications before starting the test so that your machine can allocate most resources to playing and recording audio.

Hereโ€™s how to test latency without an application or a driver:

  1. Set up the devices in the Device toolbar.
  2. Click theโ€ฏRecord new trackโ€ฏbuttonโ€ฏin theโ€ฏTransport Toolbar and record your voice.
  3. Pressโ€ฏShiftโ€ฏand click Record new trackโ€ฏโ€ฏin theโ€ฏTransport Toolbar. Then, the recorded voice will be played back and recorded to a new track.โ€ฏThe top track (Audio 1) is the original recording, and the bottom track (Audio2) is the looped-back recording. Note the delay.
  4. Zoom in so you can see the sound spikes on the top track and its delayed version on the bottom track.
  5. Usingโ€ฏthe Selection Tool, drag a selection from the start of the voice in Audio 1 and ending at the start of the delayed voice in Audio 2. To check the results, click theโ€ฏMuteโ€ฏbutton on Audio 1 and play Audio 2 to hear where the sound begins.

The selections go from the original voice to the delayed version in the bottom recording. 

You can now find out the latency directly from the second panel of numbers:

Latency is shown directly in the second panel of numbers
Screenshot 8. Latency is shown directly in the second panel of numbers

Without using our application, we can see that we haveโ€ฏ157 milliseconds ofโ€ฏlatency.

Hereโ€™s how to test latency with an application or driver:

1. For our app, we set the devices to USB Audio Device as the microphone and Realtek (R) Audio for the speakers. Also, we set the toggles that enable a neural network on the microphone and speakers: microphone On, speaker Off.โ€ฏ

Selecting devices
Screenshot 9. Selecting devices

2. Clickโ€ฏTransport and choose Rescan Audio Devices to detect your device:

Choosing Rescan Audio Devices
Screenshot 10. Choosing Rescan Audio Devices

3. Set up the devices in the Device toolbar (Microphone โ€“ your application or driver device, speaker โ€” Realtek (R) Audio). Note: For measuring latency with the neural network enabled, we continue using the default speakers.

4. Repeat steps 2โ€“5 from the previous test.

Here are the results:

Latency test results
Screenshot 11. Latency test results

When using our driver, we have 406 milliseconds of latency with the neural network enabled. Please note that in these examples, we only explored two configurations. In real life, you need to test all possible configurations to detect latency.

Read also:
Improving a Windows Audio Driver to Obtain a WHQL Release Signature

2. Noise floor measurement

The noise floor refers to the level of noise in an electrical circuit or component. It can take the form of a hum, hiss, rumble, or other low-level sounds. To measure it, you can use the Noise Gate plugin, which is preinstalled in Audacity.

First, you need to configure the plugin. Noise Gate in Audacity has three main settings:

  1. Level reductionโ€ฏis best to set between โ€ฏ-10dB and -20dB, as you want to leave some of the room sound for a more natural feel.
  2. Gate threshold should be set 3dB to 5dB above the noise floor.
  3. Attack/Decay, in our experience, is best to set around 80ms, as going any lower starts to cut the end of sentences. If you notice choppiness at the end of sentences, try increasing this parameterโ€™s value.

An important note on how Noise Gate works: If input audio is below the threshold, the plugin closes and reduces the audio volume by an amount you specify in level reduction. If input audio is above the threshold, the plugin stays open and does nothing.

Then, you can start measuring the noise floor using Audacity:

  1. Open the recording in Audacity (if youโ€™ve already recorded it), or make a new recording.
  2. On the far left-hand side of the recorded track, click the down arrow.
  3. Click the Waveform view (it may already be selected):
Choosing the Waveform view
Screenshot 12. Choosing the Waveform view

4. Right-click the left scale (0.1 and so on) and choose Logarithmic (dB):

Choosing the Logarithmic option
Screenshot 13. Choosing the Logarithmic option

5. Zoom in on the recording (by pressing Ctrl + 1) to make the trackโ€™s size larger.

6. Play the recording until you hear a breath or a section without voice. When you hear it, make a note of approximately the noise level where it begins. In the screenshot below, you can see that its loudest level is -20 dB.

Finding the loudest noise level
Screenshot 14. Finding the loudest noise level

3. Loudness measurement

Weโ€™ll show how to measure loudness using two different tools:

  1. Loudness Meter effect in Adobe Audition
  2. Youlean Loudness Meter plugin in Audacity

1. Measuring loudness using the Loudness Meter effect in Adobe Audition

Adobe Audition is professional audio editing software for recording, mixing, and mastering audio files for a variety of applications.

It has a built-in Loudness Meter tool. Hereโ€™s how to start working with it:

  1. Open the Effects Set tab on the left side of the user interface.
  2. Click the small triangle to open the drop-down menu and select Special > Volume Meter, and the effect will be applied automatically.
  3. To save your settings for future use, select Effects > Special > Loudness Meter > Effect, then check the Loudness Meter dialog box, adjust the settings (see the next step), and click Apply.
  4. To manually change the measurement levels in the Loudness Meter, right-click or double-click it to open the Fx Track Editor โ€“ Loudness Meter dialog box and click Settings. If editing levels manually is inconvenient, you can use a settings template based on commonly used standards or default settings.
  5. In the Settings tab, youโ€™ll see the default Loudness settings that are good to go.
  6. To switch to the channel map editor, click the icon in the upper left corner of the Fx Track Editor and check the Loudness Meter box.

Once all preparation steps are done, letโ€™s move to measuring track loudness:

  1. Right-click (in Windows) or hold Ctrl and click (in macOS) on the Loudness Meter effect and choose Edit Selected Effect from the drop-down menu.
  2. Press the spacebar or the Play button on the program monitor to play. During playback, the loudness meter monitors the loudness of all playback channels and displays the results as a range of blue, green, and yellow values.
  3. As a result, you get an accurate loudness measurement and can use it as the basis for deciding how to change the loudness levels.

2. Applying the Youlean Loudness Meter plugin to measure track loudness in Audacity

This is a free loudness meter plugin that can be used in many popular DAWs. Hereโ€™s a step-by-step guide on how to use the Youlean Loudness Meter with Audacity:

  1. Download and install the Youlean Loudness Meter plugin from the Youlean website.
  2. Open Audacity and import your audio file.
  3. Go to the Effects menu and select Add/Remove Plug-ins.
  4. Click Add and select the Youlean Loudness Meter plugin from the location where you installed it.
  5. Close the Add/Remove Plug-ins window and select the portion of your audio you want to measure.
  6. Go to the Effect menu and select Youlean Loudness Meter.
  7. In the Youlean Loudness Meter window, select the appropriate Loudness Standard for your region (e.g., EBU R128 for Europe, ATSC A/85 for North America).
  8. Click Analyze to measure the loudness of your audio.

The Youlean Loudness Meter will display the loudness measurements for your selected portion of audio. You can adjust the playback level of your audio and re-run the analysis to see the effect on the loudness measurement.

You can also use the Youlean Loudness Meter to measure the overall loudness of your entire audio file. To do this, select the entire audio file and run the analysis as described above.

4. Microphone testing

Testing microphones is essential for delivering a high-quality audio conferencing solution. You should check how different types of microphones work with your product to ensure wide compatibility for end users. When detecting VoIP call quality issues during such tests, you should investigate the reasons behind them and introduce fixes to improve your productโ€™s compatibility with customer devices.

Letโ€™s discuss how to test a microphone using Mictests.com. This online test is a useful tool for testing a microphone, finding out technical information about it, and checking whether it works correctly.

How can you check a microphone on mictests.com?

1. Select the desired microphone.

Microphone testing using Mictests.com
Screenshot 15. Microphone testing using Mictests.com

2. Clickโ€ฏCheck my microphone.

3. Grant access to your microphone.

4. Say something or make some noise.

Microphone testing results
Screenshot 16. Microphone testing results

5. After a few seconds, youโ€™ll see the results on the left side of your screen or in theโ€ฏMicrophone Informationโ€ฏblock.

Apart from the tests mentioned in this section, you also should check:

  • Total harmonic distortion (THD), which measures harmonic distortion in a signal. In audio systems, lower distortion means that components in a microphone or other equipment more accurately reproduce audio. The cause of harmonic distortion might be that the output signal has components that are not present in the input signal.
  • Audio power is the electrical power transferred from an audio amplifier to a loudspeaker, measured in watts.
  • Frequency response describes the range of frequencies and audio tones a piece of equipment can produce.
  • Signal-to-noise ratio measures the quality of sound compared to background noise.

Performing all key tests will help you better evaluate the sound quality provided by your VoIP-based solution, identify issues, and fix them before delivering your product to end users.

Related services

Custom Software and Application Development Services

Conclusion

Building Skype-like applications is challenging. With so many products on the market, you need to ensure top-notch quality of each and every functionality to make your solution stand out. And thoroughly testing sound quality is one of the essentials.

Conducting measurements described in this article can help you assess the quality of VoIP calls and identify issues that can compromise them. Using these tests, you can detect and fix issues and achieve better audio quality in VoIP calls, resulting in clearer communication and a better user experience.

However, working on a real-life product requires much more knowledge, skills, and work than weโ€™ve mentioned in this article. At Apriorit, we have experienced teams of software developers and quality assurance specialists who will gladly assist you with delivering competitive Zoom-like software.

Contact us to start working on your VoIP-based application!

Have a question?

Ask our expert!

Tell us about your project

Send us a request for proposal! Weโ€™ll get back to you with details and estimations.

Book an Exploratory Call

Do not have any specific task for us in mind but our skills seem interesting?

Get a quick Apriorit intro to better understand our team capabilities.

Book time slot

Contact us