A community data repository and curation service for Structure, Models, Therapeutics, Simulations related computations for research into the SARS-CoV-2 / COVID-19 / ”Coronavirus” pandemic maintained by The Molecular Sciences Software Institute (MolSSI) and BioExcel. This virus has spurred the entire scientific world into action looking for treatments and answers to help minimize its spread, its symptoms, and its fatalities. To help researchers and the community quickly provide their resources and expertise, this repository is meant to serve as a curated central repository of MM and MD input files, trajectories, analysis scripts, well-validated algorithms, and related data for immediate community access.
This project may serve as a prototype into a future, more general Hub of structural and simulation Data project. There may come a time as this project evolves that it naturally does become more general (or change to another specific focus), however, this project focuses on the SARS-CoV-2 pandemic for now and future evolutions can be dealt with as they arise.
A Central Reference for Community Simulation Data
The nature of this pandemic requires rapid, and flexible response. This repository, and the data within are designed to get the information out quickly. The repo seeks to provide the following information in a centralized location:
- Structures
- Provenance of structure including any modifications made. Will accept manually written provenance. If modifications made by program, please include script along with pre-modified files and list of required programs to re-generate.
- Models
- Extensions of the structures which are ready to go in a drug discovery Pipeline
- Therapeutics
- Known small molecules, drug-likes, and other therapeutics which have been tested for treating this virus
- Simulation Configuration Files
- Programs required to run (minimal subset which gets all dependencies, with minimal versions),
- Basic script and/or working instructions to execute these configuration files,
- Structure files (required for non-internet accessible structures, e.g. not required for things on the Protein Data Bank)
- Analysis Data
- Complete set of any of the previous data and exact details for replication
- Analysis scripts with placeholders for output trajectory files.
- Pointers to the trajectories if possible
- Trajectories - Not accepted in their entirety to the GitHub repository,
- Pointers will be accepted and noted. (contact information, where it’s stored, how to request, URLs, etc)
- Program used to generate this trajectory.
- Structures (or links there to) which were used to initiate the simulation which yielded this trajectory.
- Config files and or scripts which were used to run the simulation that generate the trajectory (if possible)
- Papers - as links
- Links to preprints and publications
- Inclusion of the files for the paper if they fall into the other categories above.
- Notation of any simulation package or software used to get the data in the paper
- Papers from multiple groups should be added in multiple folders (see Contributing below)
- Movies - hyperlinks only, please do not include full movie files.
- Links to long term-hosted movies (e.g. YouTube).
- Structures (or links there to) which were used to initiate this movie.
- Config files and or scripts which were used to run the simulation that generate the trajectory (if possible)
- The trajectory which this movie references (if possible, see rules on Trajectories above)
- Algorithms - Well-validated and easily deployable algorithms such as functional Monte Carlo and machine learning methods and heuristic property calculators
- Code Snippets
- Programing Language (if restricted)
- List of required software and minimal dependencies (if any)
Due to the rapidly evolving nature of this hub, organization and requirements may change with time. However, no changes to the repository will remove user-submitted data. The maintainers may reach out to the original contributor to ask for additional information as needed.
The Website Itself
All content for this website (excluding external links) exist on a single Github repository. Data, HTML pages, CSS, etc. all live there.
This page is generated automatically from the GitHub through by Hugo and deployed through Netlify.
The symbols used to indicate “publication” and “preprint” are adapted from Thomas Shafee through Wikimedia Commons and are reused under its CC BY 4.0 license.