News

Sandia National Laboratories launches desktop data mining effort to aid COVID-19 research efforts

Sandia National Laboratories is turning to machine learning to assist COVID-19 researchers in combing through existing coronavirus studies and data.

Through a mix of algorithms, analytics, and data mining, Sandia wants to make it easy for scientists to find the data they need right at their office computers. Dubbing the effort rapid-response data science, the labs already ran a seven-day effort to demonstrate its use. Focused on key coronavirus documents, the labs attempted to help biosecurity and public health experts isolate those documents from the thousands of studies currently out there.

In just 10 minutes, they managed to utilize language and character similarities to reduce 29,000 published studies to 87.

“Medical and epidemiological experts can have near-immediate access to existing pertinent research without being data scientists,” Travis Bauer, Sandia computer scientist, said. “With some refinement, this new process can clarify questions our public health experts need answered to fast-track COVID-19 research, particularly as new studies quickly emerge.”

The investigation was a response to a federal fall for a machine-readable COVID-19 dataset, given the wealth of data out there. During the investigation, Bauer and his team ran two studies. One utilized two experts, the other had three. In an initial stage of that research, they made an index of existing research, graphed them, then put them into a searchable natural-language matrix, which they also indexed, to aid searchability and relevance. They then ran those documents against three visualization algorithms.

Of these, two proved potentially useful, with additional tuning. With an analysis algorithm, they then scored all COVID-19 documents through color, plotting them along a two-dimensional graph. This then allowed experts to explore the different colored clusters as batches of COVID-19 related documents. This process proved largely accurate in terms of relevance, though Bauer added that tuning the algorithms will improve performance.

“Even on my office laptop computer, we can sort millions of documents and make them available to the user,” Bauer said. “Technologically, it’s possible to rapidly research and adapt to experts’ needs as they work through a data set. The agility and speed with which the user interface can be developed with the right team on desktop computer systems can provide an ability to respond to specific queries quickly and adapt with the changing needs of the user.”

Chris Galford

Recent Posts

DHS publishes guidelines for securing critical infrastructure and weapons against AI threats

Mere days after the Department of Homeland Security formed a new Artificial Intelligence (AI) Safety…

14 hours ago

U.S. Army and European Command awards KBR $771M contract

KBR will continue to provide life support, equipment readiness, training and supply chain solutions for…

14 hours ago

Spectrum and National Security Act introduced to modernize spectrum policy, revamp FCC authority

In a bid to update federal spectrum and communications network policy, restore the auction authority…

2 days ago

Department of Homeland Security forms AI Safety and Security Board

As a new means to advise the Secretary of Homeland Security and stakeholders, and promote…

2 days ago

National security upgrades, pay raises and more pushed in Servicemember Quality of Life Improvement Act

Members of Congress recently paraded a mix of recommended updates to benefit military service members…

3 days ago

Embattled TikTok in jeopardy as President Biden signs legislative ban

The ByteDance-owned TikTok faces an uphill battle in the United States after President Joe Biden…

6 days ago

This website uses cookies.