File Repositories
This page describes the RepositoryV1
and RepositoryV2
, which are file systems. See All Repositories for details.
Table of Contents
RepositoryV1 vs RepositoryV2
RepositoryV1
RepositoryV1 is the legacy storage backend for files. It is simply a file system based on how file paths are given in the Solidity metadata. file.
An example metadata source file paths look like this for the "full_match" contract 0x801f3983c7baBF5E6ae192c84E1257844aDb4b4D
on Ethereum Mainnet (1):
{
"sources": {
"erc20/IERC20.sol": {
"keccak256": "0xa38ec4e151e4d397d05bdfb94e6e4eb91e57a9fca3bc1c655289a4adf31a58fa",
"license": "MIT",
"urls": [
"bzz-raw://312e850e36efbf0f2450896c213b23dc0a28150e051bcbf933a8b9211627c44b",
"dweb:/ipfs/QmWsyisPjDwTJrTMhsGZa4JHiCS63mWfsyVQKbaijWGdmK"
]
},
"erc20/airdrop.sol": {
"keccak256": "0xea27a3e2c4179a064caf9fe9a198addd526fd1d1ea467ea474a0c069e6eac957",
"urls": [
"bzz-raw://6a86bc69b99876768bdbddba504410cf60b33681e1203a36d98840bf2ab8a42b",
"dweb:/ipfs/QmRZSqNfAPduoPoUJ6BM4NpBTbTKBqg5Mz5YBNpaUz4TfQ"
]
}
},
}
These files will be like below (see in repo):
contracts/full_match/1/0x801f3983c7baBF5E6ae192c84E1257844aDb4b4D/
├── metadata.json
└── sources/
└── erc20/
├── IERC20.sol
└── airdrop.sol
The problem with this is the part "erc20/airdrop.sol"
is not necessarily a valid file path but a "source unit name" in Solidity, i.e. arbitrary strings. This may cause issues on file systems as well as when pinning to IPFS.
RepositoryV2
RepositoryV2 is the format where we normalize the file names with their keccak256 hashes (source files must have a keccak256
field in the metadata). So the example above would look like this:
contracts/full_match/1/0x801f3983c7baBF5E6ae192c84E1257844aDb4b4D/
├── metadata.json
└── sources/
└── 0xa38ec4e151e4d397d05bdfb94e6e4eb91e57a9fca3bc1c655289a4adf31a58fa
└── 0xea27a3e2c4179a064caf9fe9a198addd526fd1d1ea467ea474a0c069e6eac957
The files are exactly the same so their IPFS hashes will not change, and you can look up the metadata file to find the original path-alike source unit names.
IPFS
Unfortunatelly publishing under IPNS is temporarily disabled. This is because of the difficulty of managing the whole filesystem over IPFS (with MFS etc.) and updating the IPNS regularly.
We still pin all the files on IPFS so you can access them over their individual CIDs (e.g. QmVij3h9z536ZG5cRpUmTfdoN9KR1Xp4ix2P7to9dPHgE5
).
Look at the Download section to learn how to download the whole repository.
Web
Moved to repo.sourcify.dev.
Download
We compress the RepositoryV2 weekly and publish on Cloudflare R2 under https://repo-backup.sourcify.dev ( .dev
redirects to .app
domain, which also belongs to Sourcify).
repo-backup.sourcify.dev will redirect to a manifest.json
file:
manifest.json
{
"description": "Manifest file for when the Sourcify file repository was uploaded",
"timestamp": 1726030203254,
"dateStr": "2024-09-11T04:50:03.254904Z",
"files": [
{
"path": "/sourcify-repository-2025-01-06T03-00-24/full_match.1.00.tar.gz"
"sizeInBytes": 2097152000
},
{
"path": "/sourcify-repository-2025-01-06T03-00-24/full_match.1.01.tar.gz"
"sizeInBytes": 800472503
},
...
]
}
The files are split by match type, chain and the first byte of the contract address.
E.g. /full_match.1.2F.tar.gz
will contain all the full_match contracts for chain 1 starting with 0x2F
.
You can download all files in the files
array and unzip them:
Download the manifest file (
-L
to follow redirects):curl -L -O https://repo-backup.sourcify.dev/manifest.json
Extract file paths and download each file:
jq -r '.files[].path' manifest.json | xargs -I {} curl -L -O https://repo-backup.sourcify.dev/{}
Concatenate the downloaded parts:
cat sourcify-repository-*.part.gz.* > sourcify-repository.gz
Unzip the concatenated file:
tar -xvzf sourcify-repository.gz