Fitness trackers fail on calories burned calculations
June 6, 2017
Fitness trackers accurately measure heart rate but not calories burned, according to a study at Stanford University School of Medicine.
The inquiry into the accuracy of seven wristband activity monitors showed that six out of seven devices measured heart rate within five per cent. None, however, measured energy expenditure well.
Millions of people wear some kind of wristband activity tracker and use the device to monitor their own exercise and health, often sharing the data with their physician. But are the data accurate?
Such people can take heart in knowing that if the device measures heart rate, it’s probably doing a good job, a team of researchers at the Stanford University School of Medicine reports. But if it measures energy expenditure, it’s probably off by a significant amount.
An evaluation of seven devices in a diverse group of 60 volunteers showed that six of the devices measured heart rate with an error rate of less than five per cent. The team evaluated the Apple Watch, Basis Peak, Fitbit Surge, Microsoft Band, Mio Alpha 2, PulseOn and Samsung Gear S2. Some devices were more accurate than others, and factors such as skin colour and body mass index affected the measurements.
In contrast, none of the seven devices measured energy expenditure accurately, the study found. Even the most accurate device was off by an average of 27 per cent. And the least accurate was off by 93 per cent.
“People are basing life decisions on the data provided by these devices,” said Euan Ashley, professor of cardiovascular medicine at Stanford.
But he said consumer devices weren’t held to the same standards as medical-grade devices, and it was hard for doctors to know what to make of heart-rate data and other data from a patient’s wearable device.
A paper reporting the researchers’ findings was published online in the Journal of Personalized Medicine. Ashley was the senior author. Lead authorship was shared by graduate student Anna Shcherbina, visiting assistant professor Mikael Mattsson and senior research scientist Daryl Waggott.
Manufacturers may test the accuracy of activity devices extensively, according to Ashley, but it’s hard for consumers to know how accurate such information is or the process that the manufacturers used in testing the devices. So Ashley and his colleagues set out independently to evaluate activity trackers that met criteria such as measuring both heart rate and energy expenditure and being commercially available.
“For a lay user, in a non-medical setting, we want to keep that error under ten per cent,” Shcherbina said.
Sixty volunteers – 31 women and 29 men – wore the seven devices while walking or running on treadmills or using stationary bicycles. Each volunteer’s heart was measured with a medical-grade electrocardiograph. Metabolic rate was estimated with an instrument for measuring the oxygen and carbon dioxide in breath – a good proxy for metabolism and energy expenditure. Results from the wearable devices were then compared with the measurements from the two gold standard instruments.
“The heart rate measurements performed far better than we expected,” said Ashley, “but the energy expenditure measures were way off the mark. The magnitude of just how bad they were surprised me.”
The take-home message, he said, is that a user can pretty much rely on a fitness tracker’s heart rate measurements. But basing the number of doughnuts you eat on how many calories your device says you burned is a really bad idea, he said.
Neither Ashley nor Shcherbina could be sure why energy-expenditure measures were so far off. Each device uses its own proprietary algorithm for calculating energy expenditure, they said. It’s likely the algorithms are making assumptions that don’t fit individuals very well, said Shcherbina.
“All we can do is see how the devices perform against the gold-standard clinical measures,” she said. “My take on this is that it’s very hard to train an algorithm that would be accurate across a wide variety of people because energy expenditure is variable based on someone’s fitness level, height and weight, etc.”
Heart rate, she said, was measured directly, whereas energy expenditure must be measured indirectly through proxy calculations.
Ashley’s team saw a need to make their evaluations of wearable devices open to the research community, so they created a web site that showed their own data. They welcome others to upload data related to device performance.
The team is already working on the next iteration of their study, in which they are evaluating the devices while volunteers wear them as they go about a normal day, including exercising in the open, instead of walking or running on a laboratory treadmill.
“In phase two,” said Shcherbina, “we actually want a fully portable study. So volunteers’ ECG will be portable and their energy calculation will also be done with a portable machine.”
The work is an example of Stanford Medicine’s focus on precision health, the goal of which is to anticipate and prevent disease in the healthy and precisely diagnose and treat disease in the ill.
Other Stanford co-authors were clinical nurse specialist Heidi Salisbury, clinical exercise physiologist Jeffrey Christle, Trevor Hastie, professor of statistics and of biomedical data science, and Matthew Wheeler, clinical assistant professor of cardiovascular medicine.