Skip to content

Biomedical Research Employing Surveillance Technology

U.S. researchers from the Chan Zuckerberg Initiative, a philanthropic organization, have compiled a dataset charting occurrences of open-source software in biomedical research publications. The dataset was prepared using an AI system to identify software references in 3.9 million open access...

Usage of Monitoring Software in Biomedical Scientific Studies
Usage of Monitoring Software in Biomedical Scientific Studies

Biomedical Research Employing Surveillance Technology

The Chan Zuckerberg Initiative (CZI) has created a significant dataset of open-source software mentions in biomedical research papers, offering valuable insights into the role of open-source software in the field. This dataset, which is a part of CZI's open science efforts, can be found on their official open-source or open science websites, such as the Early Open Source Software program page at https://chanzuckerberg.com/eoss.

The dataset was curated using an AI system that scanned through 3.9 million open access biomedical research papers and 16.9 million papers provided to the team. In the first subset, the AI system identified over 19 million total mentions and 1.6 million unique mentions of software in 2.5 million papers. In the second subset, the AI system found 48 million total mentions and 934,704 unique mentions of software in 2.9 million papers.

Researchers can use this dataset to better understand the role of open-source software in biomedical research and potentially improve its use in future studies. The dataset allows researchers to identify successful uses of open-source software in the biomedical field. In fact, the team behind the dataset has provided a repository link for 185,000 of the unique software mentions in the first subset.

To access this dataset, follow these steps:

  1. Visit the Chan Zuckerberg Initiative’s open source/software pages at https://chanzuckerberg.com/eoss, where you can find curated lists, datasets, or links related to software used in biomedical research.
  2. Check for any associated GitHub repositories or public data portals linked on CZI’s official site, as these are common places for open data sharing.
  3. Contact CZI or check publications linked to their bioinformatics projects, such as Nextflow workflows or other bioinformatics tools they support, for references to datasets or supplemental data.

While no direct URL for a specific dataset of software mentions was found in these search results, the primary source for such data would be through CZI's official open-source and open science initiatives webpages and repositories. If you require the exact dataset and it is not publicly listed, reaching out directly to CZI via their contact or collaboration channels could provide access or guidance on obtaining the dataset.

This dataset has the potential to advance science and technology research by providing insights into the use of open-source software in biomedical research, ultimately leading to improved outcomes in the field.

  1. The dataset curated by the Chan Zuckerberg Initiative, containing mention of open-source software in biomedical research papers, was created using artificial intelligence and can be found on their official open-source or open science websites.
  2. Researchers can utilize this dataset to study the role of open-source software in the biomedical field and potentially increase its effectiveness in future medical-conditions research.
  3. The dataset allows researchers to identify successful implementations of open-source software in health-and-wellness, as evidenced by the repository link provided for 185,000 unique software mentions in the first subset.
  4. The potential applications of this dataset extend beyond biomedical research, as it has the capacity to advance technology and science research and ultimately lead to better outcomes in various fields.

Read also:

    Latest