User Stories
Featured
NOAA
NOAA leverages Globus in proof of concept
Auth API, Globus Compute
Globus Auth and Globus Compute enable NOAA to overcome capacity challenges and build federated, distributed pipelines which will require several hundred thousand cores or more.
-
University of Michigan, Michigan State, Wayne State and Indiana University
OSiRIS Project for Distributed, Multi-Institutional Storage
Data portal, Platform
Uses Globus Connect data transfer nodes (each at 2 x 25Gbps minimum) for wide-area data transfer.
University of Pittsburgh
Building a portal for the Human BioMolecular Atlas Program
Data portal
The University of Pittsburgh together with the Pittsburgh Supercomputing Center are one of five funded components contributing to the infrastructure for the NIH-funded Human BioMolecular Atlas Program (HuBMAP), which relies heavily on Globus for software infrastructure to build a frictionless research platform.
Argonne National Lab
Sharing Data Using ACDC
Data portal, Share, Transfer
ACDC is the ALCF Community Data Co-op service configured to allow PIs to broadly share their data hosted on Eagle using Globus, by applying simplified interfaces.
UCSD School of Medicine
Streamlining Reproducible Genomics Analysis with GenePattern and the Globus Platform
Auth API, Transfer, Platform
Researchers needed an easy to use platform for reproducible bioinformatics research so the GenePattern Gateway for Genomic Analysis was developed. The Globus platform services are integrated into GenePattern, and enable easy access and rapid transfers of large data sets.
UCSD
Accelerating Research at UCSD with Globus
Researchers are now able to reliably transfer and share large data sets, and even share protected data with ease. With the increase in Globus managed endpoints, researchers at UC San Diego can easily transfer and share data both internally and with researchers outside UC San Diego.
-
University of Michigan
Genomic Researchers Rely on Globus for Large Data Transfers
Transfer
The Biomedical Research Core Facility, like all facilities dealing with human genome sequencing, are experiencing explosive data growth. Fact: every two days they are delivering more data than they did in their first 20 years of existence. The Advanced Genomics core team employs both “push and pull” methods to handle data transfers with Globus.
International Brain Laboratory
Globus Provides Reliable Transfer Service for Collaborative Research at the International Brain Laboratory
Transfer
The International Brain Laboratory (IBL) is a collaborative research organization which focuses on neuroscience, and the study of brain wide circuits for complex behavior. IBL created a modular system architecture to address their data management challenges.
Environmental Molecular Sciences Laboratory (EMSL)
Globus Facilitated Data Access, Transfer and Multi-institutional Sharing
The Environmental Molecular Sciences Laboratory (EMSL) is a Department of Energy, Office of Science, User Facility funded by the Biological and Environmental Research program. EMSL set up a data management pipeline together with the Northwest Cryo-EM Processing Center (PNCC) and with Oregon Health Sciences University (OHSU) that enables researchers to build detailed 3D models and determine the atomic structures of biomolecules using cryo-Electron Microscopy (cryo-EM).
Cornell
Cornell Biotech Data Lifecycle Management, Onsite and in Amazon S3
Transfer
Globus facilitates file transfer for data lifecycle management, from generation to analysis platforms to archive. We use Globus to share data with users of our facility, both from our centrally managed bioinformatics infrastructure or investigator’s own machines. We also use Amazon S3 endpoints to move data for archival storage; once in S3, lifecycle rules move the data to Amazon Glacier.
Argonne National Lab, Lawrence Berkeley National Lab, Oakridge National Lab, Brookhaven National Lab, Cold Spring Harbor Lab
DOE's KBase Program
Transfer
KBase is a data platform for the biological sciences, promising to improve the face of scientific analysis and publication. One capability of KBase is to gather all scientific data related to a particular study in one place (facilitated by the Globus data management and file transfer system), making it easy for future researchers to track this data down and build upon existing work.
Case Western Reserve University
Sharing Data in an Instrument Workflow
The Globus transfer service enabled the data to be quickly transferred to the CWRU data center, thereby freeing up the storage space in the server connected to the microscope. Globus Connect Server was installed, and the Globus sharing capability allowed the facility manager to see the raw data, upload it to a Globus share endpoint, and share the data with researchers, both within CWRU and at other institutions. With the Globus Command Line Interface (CLI), the facility managers were able to automate the syncing and resyncing of the files to ensure that all newly created files off the Titan-Krios microscope were properly transferred.
NIH
Data Sharing via Globus in the NIH Intramural Program
Share, Transfer
The HPC facility at the National Institutes of Health’s Intramural campus has had a Globus endpoint since 2012. Globus is used routinely for data transfer along with other methods, but the Globus data-sharing capabilities in particular have been enthusiastically embraced by the NIH HPC users.
European Bioinformatics Institute
Data Distribution for European Bioinformatics Institute (EBI)
Transfer
EBI is one of the world’s leading providers of life science data to a global community. Moving data from where it is generated, to and from archives and to where the user wishes to analyze it, is supported via Globus.
University of Chicago, Argonne National Laboratory
Brain Mapping and Neurobiology
Share, Transfer
The Kasthuri lab at the University of Chicago and Argonne is pioneering new techniques for brain mapping of the fine structure of the nervous system – ‘connectomics’ and ‘projectomics’. They hope to help answer questions like: how do brains learn as they grow up? And how do brains differ across individuals and across species? And how can we reverse engineer brain function in our own computers and robots? Globus is used by these researchers for file transfer and sharing.
Massachusetts General Hospital, Harvard University
Serving Neuroscientific Studies at Massachusetts General Hospital
Transfer
Globus facilitates intra-project data movement for neuroscientific studies across multiple acquisition sites, with eventual data publication at the Massachusetts General Hospital.
University of Illinois
Blue Waters at NCSA
Transfer
The Blue Waters team at the National Center for Supercomputing Applications utilizes Globus sharing so users can quickly, effectively, and securely share data sets with their research community or the broader public.
University of California - San Diego, Science Gateways Community Institute (SGCI)
COSMIC2 Cryo-EM Project
Data portal, Auth API, Transfer
Used Globus Auth to manage user identities and access to resources, so scientists can use existing login (XSEDE, their university, Google, etc.) to access the gateway. Also used Globus to move terabyte-sized datasets to the gateway, so scientists can analyze it using the XSEDE HPC resource Comet at the San Diego Supercomputer Center (SCSD) and then review results on the portal.
Australian Research Data Commons
Measurement of Material Properties Using Microscopy Techniques and Globus to Transfer Large Datasets
Transfer
Several cryo-electron microscopes have been deployed in institutions across Australia to measure material properties. These instruments have generated huge volumes of data, and Globus was used to move the large datasets between various labs and HPC facilities across long distances.
Abbvie
Globus Connect Personal Assists with Internal Data Transfers in a Multinational Company
Transfer
Abbvie, a large multinational company focused on discovering and delivering transformational medicines and products, and who is heavily involved in genomics projects, is faced with managing their explosive growth of data. With distributed teams wanting to work on the same data set it became necessary to move and share lots of data. Globus Connect Personal is used for internal data transfers, where users are able to rapidly transfer data.
University of Pittsburgh Medical Center (UPMC)
Secure Health Sciences Research Informatics at University of Pittsburgh
Data portal, Share, Platform
UPMC’s Research Informatics Office (RIO) is responsible for clinical data extraction, transformation, honest brokering, and provisioning for hundreds of large and small research projects, as well as the infrastructure for HuBMAP in partnership with the Pittsburgh Supercomputing Center. To efficiently support these activities, particularly including protected data sharing with many investigators across multiple institutions, RIO has adopted the Federated Identity, Data Movement, Search and Group Management features of Globus.
New Zealand eScience Infrastructure (NeSI)
Building a Genomic Data Repository Using the Globus Platform
Auth API, Transfer, Platform
A few years ago NeSI adopted Globus as their de facto national data transfer platform for research.
-
NOAA
NOAA leverages Globus in proof of concept
Auth API, Globus Compute
Globus Auth and Globus Compute enable NOAA to overcome capacity challenges and build federated, distributed pipelines which will require several hundred thousand cores or more.
Oak Ridge National Laboratory
Atmospheric Radiation Measurement (ARM) at Oak Ridge National Lab
Transfer
The Atmospheric Radiation Measurement Data Center (ADC) is a long-term archive and distribution facility for various ground-based, aerial and model data products in support of atmospheric and climate research. Users employ Globus for transferring terabytes of data from ADC to their home institutions, and ADC is using Globus for its operations including transferring data between clusters as well as disaster recovery.
Argonne National Lab
HACC Cosmology Simulation Data Portal
Data portal, Transfer, Platform
Scientists at Argonne are developing a Hardware/Hybrid Accelerated Cosmology Code (HACC) data portal to provide easy access to simulation products and analysis capabilities for the world’s cosmologists. The portal will use Petrel, a research data service at the Argonne Leadership Computing Facility (ALCF), offering fast data transfer and authentication via Globus.
University of Wisconsin, NERSC, Deutsches Elektronen-Synchrotron
IceCube Collaboration
Transfer
IceCube, the world’s largest neutrino detector, uses Globus to archive its data for long term preservation. Users transfer data with Globus from Madison, Wisconsin to storage locations at NERSC in California and at DESY in Berlin.
National Center for Atmospheric Research
NCAR Research Data Archive
Share, Transfer
The Science DMZ network in place at NCAR allows Globus to deliver scalable, efficient, and reliable data transfers out of the RDA. Since late 2014, the Research Data Archive (RDA) at the National Center for Atmospheric Research (NCAR) has used Globus services to support its online data portal.
Texas Advanced Computing Center (TACC) at the University of Texas at Austin
NHERI DesignSafe-CI
Transfer, Platform
The Natural Hazards Engineering Research Infrastructure (NHERI) is a distributed, multi-user, national facility funded by the National Science Foundation (NSF). For moving large volumes of data, or large numbers of files (>50) or folders, the Globus tools are the recommended way of moving data in and out of NHERI’s DesignSafe data depot.
National Center for Supercomputing Applications (NCSA)
Terra Basic Fusion Project at NCSA / Blue Waters
Transfer
We downloaded 1.3 PB of L1B Terra data from various data centers to the Blue Waters system. Then we used Globus to integrate massive-scale processing between archive storage and computational resources to produce 2.5 PB of a new fused weather data product.
Southern California Earthquake Center (SCEC), NCSA Blue Waters, ORNL Titan
CyberShake Project for Predicting Earthquake Impact
Transfer
Researchers used Globus to transfer intermediate results between Titan and Blue Waters for a CyberShake project. Workflows ran for nearly a month, and during that time, Globus was used to transfer over 700TB between OLCF and NCSA.
Texas A&M, Louisiana State University, San Diego Supercomputing Center
Globus Auth and the S3 Connector with XSEDE Jetstream
Auth API, Platform
The XSEDE community and the Jetstream cloud service provider are using Globus Auth to simplify and streamline user authentication and add support for identity linking: especially campus credentials. Jetstream also offers object storage that’s compatible with Amazon Web Services’ Simple Storage Service (S3), and researchers can easily and reliably move research datasets into and out of Jetstream’s object storage using the Globus S3 storage connector.
NCSA, Georgia Tech
Georgia Tech Astrophysics Research at NCSA
Transfer
An astrophysics professor at Georgia Tech uses Globus to retrieve 12TB of data from NCSA Blue Waters long term storage in under 10 minutes, achieving 30+ Gbps transfer rates, as part of ongoing work in simulating galaxy formation.
Texas A&M, Louisiana State University, San Diego Supercomputing Center
SIMULOCEAN Science Gateway
Platform
A research scientist at Texas A&M has been working to deploy a containerized coastal model on XSEDE resources, work that started at LSU. The goal is to develop and deploy enhancements into the SIMULOCEAN science gateway, integrating new Docker features of Bridges and Globus capabilities for authentication, file transfer and sharing. The PI also collaborated with SDSC.
Lawrence Berkeley National Lab, Argonne Advanced Leadership Facility (ALCF)
Globus Accelerates Climate Data Movement
Transfer
In a recent paper entitled “An Assessment of Data Transfer Performance for Large‐Scale Climate Data Analysis and Recommendations for the Data Infrastructure for CMIP6,” experts from Lawrence Berkeley National Laboratory (Berkeley Lab) and ESnet (the Energy Sciences Network, (http://www.es.net/)) document the data transfer workflow, data performance, and other aspects of transferring approximately 56 terabytes of climate model output data for further analysis.
Earth Science Group Federation (ESGF)
Building the Next Generation Earth System Grid Federation and the Globus Platform
Transfer, Platform
ESGF is an international collaboration between many groups for the software that powers most global climate change research. The output data is used by scientists all over the world. A new team is working to modernize the data backplane, and are in the midst of developing a new architecture which is based on the Globus platform - ESGF2. Globus Transfer is already being leveraged, and recently was used to make two redundant copies of the 7.5 PBs of ESGF data by transferring the data via ESnet in under 90 days.
National Center for Atmospheric Research, University Corporation for Atmospheric Research
NCAR/UCAR Data Transfer and Sharing
Share, Transfer
The Computational Information Systems Laboratory (CISL) at NCAR recommends using Globus to transfer files between systems—for example, between XSEDE facilities and the resources that CISL manages. NCAR also uses Globus to distribute data from their Research Data Archive (RDA).
University of Colorado
UC Boulder Oceanography Study
Transfer
While these two phenomena are small in scale, preliminary research has shown that each can have a dramatic impact on the larger ocean and thus the climate of the entire planet. However, the model resolution required to study the two processes together is unprecedented: The project requires regular movement of massive data files
Max Planck Computing and Data Facility
Globus simplifies data sharing with collaborators
Share
MPCDF has been using Globus as a production (DataHub) service for data transfer since 2017. Globus was implemented because it allows for fast, reliable and secure large scale data transfer. Plus it provides researchers with a nice user experience through its “fire and forget” capabilities, where transferring large amounts of data through a point and click interface is trouble-free, even for datasets that are many GBs or even TBs in size.
NCSA
Blue Waters at NCSA
Share, Transfer
Blue Waters Uses Globus for File Sharing and Transfer The Blue Waters team at the National Center for Supercomputing Applications utilizes Globus sharing so users can quickly, effectively, and securely...
-
National Energy Research Scientific Computing Center (NERSC), Argonne Leadership Computing Facility (ALCF)
ATLAS Experiment with Cori and Theta
The ATLAS experiment is a general purpose particle detector based at CERN’s Large Hadron Collider (LHC), the world’s largest and most powerful particle accelerator. ATLAS scientists use Globus to transfer data between Theta, the ALCF’s Intel-Cray system which serves as a bridge to exascale, and NERSC’s Cori supercomputer.
Department of Energy, Oak Ridge National Lab, Argonne National Lab
Quantum Predictions at DOE's CPSFM
Transfer
Scientists at the DOE’s Center for Predictive Simulations of Functional Materials (CPSFM) is focused on taking quantum Monte Carlo (QMC) modeling to the next level. Scientists use Globus to publish their findings to the Materials Data Facility at Argonne and to move data between DOE supercomputer centers, as well as between individual supercomputers and user’s own workstations.
-
University of Illinois, NCSA
The University of Illinois' Nano-manufacturing Hub
Search API, Platform
The University of Illinois’ Nano-manufacturing hub is working to implement a global, community-curated database of graphene growth recipes. They have created an application inside a HubZero instance which allows users to capture their recipes along with SEM images and spectrographic analysis of the sample, which is then submitted to MDF, the Materials Data Facility built on the Globus platform, where additional metadata is extracted and the dataset indexed with Globus Search.
University of Chicago, University of Illinois Urbana-Champaign, Argonne National Lab
Materials Data Facility (MDF) Portal
Data portal, Platform
MDF has leveraged and built upon a variety of Globus services, including data publication, data search, transfer, and Globus Auth, for their materials science data discovery portal.
University of Chicago, Argonne National Laboratory
DLHub: Supporting Machine Learning in Materials Science
The Data and Learning Hub for Science (DLHub) serves as an automated facilitator and interconnection point for ML models and associated data transformation and analysis tools. It allows researchers to describe and publish such tools in ways that support discovery and reuse; run published tools over the network (with tools executed on a scalable hosted infrastructure); and link models, other tools, and data sources into complete ML/AI pipelines that can themselves be published, discovered, and run. DLHub relies on Globus services for data management.
-
University of Winnipeg
Globus Aids University in Efforts to Increase Crop Productivity through HPC and ML for Digital Agriculture
Dilbarjot and Michael Beck, members of the Physics and Applied Computer Science departments at the University of Winnipeg, are part of a team generating labeled datasets to train Machine Learning models to recognize specific features in images captured from crops that indicate a need for specific mitigation techniques, such as targeted pest controls, irrigation changes, etc.
Transportation Energy Resources from Renewable Agriculture Phenotyping Reference Platform (TERRA-REF)
Building an Advanced Crop Analytics Portal
Platform
Used Globus to build portal which will provide open access to huge datasets that will guide breeding decisions, facilitate collaboration, and allow unprecedented data sharing
-
Stanford
Working with Research Data at Stanford
Stanford researchers use Globus to simplify working with data and accelerate transfers.
National Science Foundation
Enabling the Open Storage Network
Share
The Open Storage Network (OSN) is creating a storage appliance which will read and write from disks at speed, with a capacity of about 1.5PB. In conjunction with the National Data Service and John Hopkins University, Globus is helping to build a distributed storage platform for OSN, based on object storage with Globus Auth federated identity authorization to promote cross-institutional data sharing for OSN users.
University of South Dakota
South Dakota Data Store (SDDS)
The South Dakota Data Store (SDDS) provides over 1.2PB of capacity across a Sharing Tier for high-reliability, high-availability, network-accessible storage; and an Archival Tier hosted on a magnetic tape library for long-term offsite archival-grade storage. Globus provides the necessary authentication, data sharing, and transfer capabilities to make SDDS a truly statewide resource.
Oak Ridge National Laboratory
Using Shared Endpoints for Data Publication at Oak Ridge
Share
ORNL developed a Digital Object Identifier (DOI) service where the data resides at ORNL Globus is used to let users construct their data, workflow, and additional info before submitting to be published. They are also using multiple shared endpoints to construct and retrieve DOIs, which are searchable multiple ways including datacite and ORNL’s internal service.
University of Michigan
Tiered Data Storage at University of Michigan
Share
Globus is the glue by which the University of Michigan moves data between services. It is their most recommended way to move data in bulk between services. In addition, their Bioinformatics Core uses Globus Connect to share data with clients.
University of Saskatchewan (USASK)
University of Saskatchewan's Research Storage Service
Data portal, Platform
USASK’s research storage service “DATASTORE” provides access via Globus for speed, automation, and more complex data management, as well as for connection with Compute Canada. They have also integrated Globus with their identity provider and created a customized USASK look-and feel login page.
Harvard University
Modernizing Data Workflows at Harvard
Transfer
Harvard moves over 750 TB a year with Globus, in projects ranging from species migration studies to image processing.
Oak Ridge National Laboratory, OLCF
ORNL Science DMZ and Bridging CADES Workflows
Transfer, Platform
The Compute and Data Environment for Science (CADES) at Oak Ridge National Lab provides compute and data infrastructure resources for scientific discovery via a Science DMZ architecture based on Globus. Use case examples involve Atmospheric Radiation Measurement (ARM) Climate Research Facility, where Globus is used for simplified transfer and to create advanced workflows; the CrossBOW Project for big data operational workflows; and Genetics research.
Rice University
Rice University Using Globus for Google Drive
To facilitate reliable data movement and sharing for Rice researchers, Rice subscribes to Globus for research data management. Rice uses Globus for file transfer and sharing, and they also utilize the Globus for Google Drive connector to simplify data access and archiving with Google Drive.
University of Minnesota's Supercomputing Institute (MSI)
MSI HPC Data Archiving and Digital Preservation
Transfer
MSI implemented Spectra Logic Black Pearl storage with the Globus connector to solves the problem of costly and complex approaches to digital preservation and archiving for hundreds of terabytes of critical research data.
West Virginia University
West Virginia University Uses Globus to Move and Share Data
Share, Transfer
West Virginia University uses the Globus research data management platform to transfer data between campus storage resources and XSEDE centers across the U.S. The University’s researchers and research computing staff also use Globus to move data across WVU’s science DMZ to other locations on campus, such as lab systems or researchers’ personal systems.
Rice University
Rice Relies on Globus for Data Management and Cloud Archiving
Share, Transfer
To facilitate reliable data movement and sharing for Rice researchers, the CRC subscribes to Globus for research data management. Rice uses Globus for file transfer and sharing, and they also utilize the Globus for Google Drive connector to simplify data access and archiving with Google Drive.
National Data Service (NDS), National Center for Supercomputing Applications (NCSA), San Diego Supercomputing Center (SDSC)
National Data Service Adds Cloud-Based Interactive Compute Capabilities
Auth API, Platform
The National Data Service’s Labs Workbench platform, built on Globus, is a scalable, web-based system intended to support turnkey deployment of encapsulated data management and analysis tools to support exploratory analysis and development on cloud resources that are physically “near” the data and associated high-performance computing (HPC) systems.
-
University of Chicago, Argonne National Laboratory
Molecular Data in Google Drive
A group in Japan needed to share a set of 221M molecular calculations, clocking in at 830GB, with collaborators at UC/ANL. This dataset represents the largest compilation of molecular calculations to date, with a broad set of machine learning opportunities ranging from drug discovery to identification of novel molecules for organic electronics. By using the Globus for Google Drive connector, they were able to easily make the data available in Google Drive via fire-and-forget transfer – they estimate this method was at least 10x faster than any other means of sharing the data.
Argonne National Lab
Simplifying Research with Globus for Box
Transfer, Globus for Box
The Box connector feature makes my research much easier. Without it, here is my process to get data from Box (that my collaborators shared with me) to the Argonne Leadership...Clemson University
Machine Learning in the Cloud at DICE Lab
Transfer, Platform
The Data Intensive Computing Ecosystems (DICE) lab at Clemson University is utilizing CloudyCluster for scientific computations in the area of scalable machine learning, and in particular, in topic modeling.
Argonne National Lab
Materials Science Artificial Intelligence (AI) Project Supports Data Service Initiative
Share, Transfer, Platform
Data Science and Learning Division (DSL) scientists have undertaken a project to develop a lab service for interactive, scalable, reproducible data science, leveraging machine learning methods to reduce simulation costs and increase data quality and value for researchers.
University of Queensland, Australia's Academic and Research Network (AARNet)
Global Collaborations Using Machine Learning with MRI Images
Transfer
The University collaborates globally on the use of machine learning in an effort to speed up the reconstruction and affordability of Magnetic Resonance Images (MRI). The secure and fast data sharing enabled by Globus allowed for the rapid transfer of large data sets which made collaborations easier and faster.
Argonne National Lab
End-end AI Frameworks for Scientific Discovery at Scale
Globus Compute
Scientists wanted to study gravitational waves, and as a result of the exponential growth of data, days of supercomputing time were needed to keep up with all the data. Models were trained using supercomputers and were then shared and accessed through DLHub. Users then were able to reuse and refine the existing models combined with funcX .
-
Leibniz Supercomputing Centre (LRZ)
LRZ Data Science Storage Service
Share, Transfer, Platform
LRZ’s Data Science Storage (DSS) service solves the demands and requirements of data intensive science with a data-centric management approach built with Globus. With DSS, LRZ researchers can store, access, transfer and share data worldwide.
University of Chicago - Booth School of Business
Kilts Center Relies on Globus Sharing for Nielsen Data Distribution
Share, Transfer
Kilts Center turned to Globus to provide the secure data distribution platform they needed, as well as the portal for data access and sharing. Now it is easy for Kilts Center to share new Nielsen datasets with their many researchers with just a few clicks: researchers request to join the distribution and receive permissions to access a storage endpoint where the data lives; then they simply log in using their institutional credentials and access the data files.