Everything you need to know about GitHub Arctic Code Vault
You might have recently seen something like this in your GitHub account or other GitHub account and be wondering what is this all about.
So let’s take a look at what is an Arctic Code Vault Contributor and who are the ones who gets this batch
GitHub, the world’s largest open-source platform for software and programs has safely locked the data of huge value and magnitude in a coal mine in Longyearbyen’s Norwegian town in the Arctic region.
Back in November 2019, GitHub Arctic Code Vault was first announced.
The GitHub Arctic Code Vault is a data repository preserved in the Arctic World Archive (AWA), a very-long-term archival facility 250 meters deep in the permafrost of an Arctic mountain. The archive is located in a decommissioned coal mine in the Svalbard archipelago, closer to the North Pole than the Arctic Circle.
Last year, GitHub said that it plans to capture a snapshot of every active public repository on 02/02/2020 and preserve that data in the Arctic Code Vault.
The project began on February 2, when the firm took a snapshot of all of GitHub’s active public repositories to store them in the vault. They initially intended to travel to Norway and personally escort the world’s open-source technology to the Arctic but their plans were derailed by the global pandemic. Then, they had to wait until 8 July for the Arctic Data Vault data to be deposited.
GitHub announced that the code was successfully deposited in the Arctic Code Vault on July 8, 2020. Over the past several months, GitHub worked with its archive partners Piql to write the 21TB of GitHub repository data to 186 reels of piqlFilm (digital photosensitive archival film).
GitHub’s strategic software director, Julia Metcalf, has written a blog post on the company’s website notifying the completion of GitHub’s Archive Program on July 8th. Discussing the objective of the Archive Program, Metcalf wrote “Our mission is to preserve open-source software for future generations by storing your code in an archive built to last a thousand years.”
The Arctic Code Vault is only a small part of the wider GitHub Archive Program, however, which sees the company partner with the Long Now Foundation, Internet Archive, Software Heritage Foundation, Microsoft Research, and others.
How the cold storage will last 1,000 years
Svalbard has been regulated by the international Svalbard Treaty as a demilitarized zone. Home to the world’s northernmost town, it is one of the most remote and geopolitically stable human habitations on Earth.
The AWA is a joint initiative between Norwegian state-owned mining company Store Norske Spitsbergen Kulkompani (SNSK) and very-long-term digital preservation provider Piql AS. AWA is devoted to archival storage in perpetuity. The film reels will be stored in a steel-walled container inside a sealed chamber within a decommissioned coal mine on the remote archipelago of Svalbard. The AWA already preserves historical and cultural data from Italy, Brazil, Norway, the Vatican, and many others.
What’s in the 02/02/2020 snapshot?
The 02/02/2020 snapshot archived in the GitHub Arctic Code Vault will sweep up every active public GitHub repository, in addition to significant dormant repos.
The snapshot will include every repo with any commits between the announcement at GitHub Universe on November 13th and 02/02/2020, every repo with at least 1 star and any commits from the year before the snapshot (02/03/2019–02/02/2020), and every repo with at least 250 stars.
The snapshot will consist of the HEAD of the default branch of each repository, minus any binaries larger than 100KB in size — depending on available space, repos with more stars may retain binaries. Each repository will be packaged as a single TAR file. For greater data density and integrity, most of the data will be stored QR-encoded and compressed. A human-readable index and guide will itemize the location of each repository and explain how to recover the data.
The company further shared that every reel of the archive includes a copy of the “Guide to the GitHub Code Vault” in five languages, written with input from GitHub’s community and available at the Archive Program’s own GitHub repository.
The archive will also include human-readable reel which documents the technical history and cultural context of the archive’s contents, which the company calls as the Tech Tree. It will primarily consist of the existing works, selected to provide a detailed understanding of modern computing, open-source and its applications, modern software development, popular programming languages, etc.
What is the reason for doing this?
This project aims to preserve open-source software for future generations by storing it in an archive built to last a thousand years.
They hope that one day, the open-source data can be used by historians or future civilizations to understand the dawn of computing: the present.
In addition to the repositories, GitHub also saved a few classic works of humanity and an introductory letter in case it’s discovered after an apocalypse, or by aliens, or by something that doesn’t know much about present humanity. “This archive, the GitHub Code Vault, was established by the GitHub Archive Program, whose mission is to preserve open-source software for future generations”.
Who gets this batch?
The snapshot included any public repository that had at least 250 stars, that had at least one star and had been updated in the past year, or that had no stars but had been updated in the previous eighty days. If you’ve ever uploaded to GitHub, you probably had got your name and a creation stored in the arctic. Clicking on the Arctic Code Vault Contributor badge in the highlights section of a profile will reveal which of a user’s projects were saved in this snapshot.
GitHub created the Arctic Code Vault Badge to honor the millions of developers worldwide who contributed to the open-source project. This badge is displayed in the highlights section of the developer’s GitHub profile.
So if you have the Arctic Code Vault Contributor badge then congratulations your code or project will be safe for 1000years at least and hopefully, someone in those times would find it useful.
Have a look at this video and see where your code/project is stored and how they are stored-
GitHub Archive Program: the journey of the world’s open source code to the Arctic
At GitHub Universe 2019, we introduced the GitHub Archive Program along with the GitHub Arctic Code Vault. Our mission is to preserve open source software for future generations by storing your code in an archive built to last a thousand years.
Your code is safe and sound in the Arctic
At GitHub Universe 2019 , we introduced the GitHub Archive Program along with the GitHub Arctic Code Vault . Our mission is to preserve open source software for future generations by storing your code in an archive built to last a thousand years.
On February 2, 2020, we took a snapshot of all active public repositories on GitHub to archive in the vault. Over the last several months, our archive partners Piql , wrote 21TB of repository data to 186 reels of piqlFilm (digital photosensitive archival film). Our original plan was for our team to fly to Norway and personally escort the world’s open source code to the Arctic, but as the world continues to endure a global pandemic, we had to adjust our plans. We stayed in close contact with our partners, waiting for the time when it was safe for them to travel to Svalbard. We’re happy to report that the code was successfully deposited in the Arctic Code Vault on July 8, 2020.
Join us as we follow the code in its journey to the Arctic, and take a look at a few other things we’ve been up to here at the GitHub Archive Program.
The journey of the world’s open source code to the Arctic Circle
Your code’s journey begins in Piql’s facility in Drammen, Norway where the boxes with 186 film reels were shipped to Oslo Airport and then loaded into the belly of the plane which provides passenger service to Svalbard. Svalbard, roughly 600 miles (1000 km) north of the European mainland, just recently opened up to visitors from countries within the Schengen Area and the European Economic Area.
The code landed in Longyearbyen, a town of a few thousand people on Svalbard, where our boxes were met by a local logistics company and taken into intermediate secure storage overnight. The next morning, it traveled to the decommissioned coal mine set in the mountain, and then to a chamber deep inside hundreds of meters of permafrost, where the code now resides fulfilling their mission of preserving the world’s open source code for over 1,000 years.
Introducing the Arctic Code Vault Badge
Millions of developers around the world contributed to the open source software now stored in the Arctic Code Vault. To recognize and celebrate these contributions, we designed the Arctic Code Vault Badge, which is shown in the highlights section of a developer’s profile on GitHub. Hover and you can discover some of the repositories an individual contributed to.
An update from our Archive Program partners
Internet Archive
The Internet Archive is a well-known, widely beloved non-profit digital library which provides free public access to collections of digitized materials. In partnership with the GitHub Archive Program, the Internet Archive (IA) commenced its ongoing archive of GitHub public repositories on April 13 of this year. At present, IA is using a two-pronged approach. First, their well-known Wayback Machine is accessing and archiving raw GitHub data as WARCs, or Web ARChive files. As of this writing they have archived some 55TB of data . Second, they have the goal of making entire archived GitHub repositories available via “git clone,” while also keeping repo comments, issues, and other metadata easily accessible on the web. This second initiative is well underway and initial archiving is expected to commence this month.
Software Heritage Foundation
Software Heritage is a non profit, multi-stakeholder initiative launched by Inria in collaboration with UNESCO with the goal to collect, preserve and share the source code of our software commons. They already archive more than 130 million projects , with their full development history, and we are delighted to announce that 100 million of these are from GitHub. Thanks to the collaboration announced at GitHub Universe 2019 , the archival engine is being improved with the goal to keep it up to speed with GitHub ‘s growth, but if the project you are interested in, or its latest version, is not archived yet, you do not need to wait, it’s easy to trigger its archival right now in a few clicks on https://save.softwareheritage.org .
Project Silica
Project Silica is developing the first storage technology designed and built from the media up for cloud-scale storage of long-lived data. By leveraging recent discoveries in ultrafast laser optics, data is stored in quartz glass, through a process that permanently changes the physical structure of the glass material. Quartz glass is a durable storage media that offers unparalleled data lifetimes of upwards of tens of thousands of years. It is resilient to electromagnetic interference, water, and heat, making it the ideal storage medium for ensuring the world’s open source software is forever preserved for future generations. As a partner in the GitHub Archive Program , Project Silica is committed to driving storage innovation, and developing a storage technology that addresses the need for a sustainable and reliable storage technology for the world’s long-lived data. We’ve archived 6,000 of the world’s most popular repositories as a proof of concept for future archives.
What’s next?
Code, culture, history, and technology: The Tech Tree
Every reel of the archive includes a copy of the “Guide to the GitHub Code Vault” in five languages, written with input from GitHub’s community and available at the Archive Program’s own GitHub repository . In addition, the archive will include a separate human-readable reel which documents the technical history and cultural context of the archive’s contents. We call this the Tech Tree.
Inspired by the Long Now’s Manual for Civilization , the Tech Tree will consist primarily of existing works, selected to provide a detailed understanding of modern computing, open source and its applications, modern software development, popular programming languages, etc. It will also include works which explain the many layers of technical foundations that make software possible: microprocessors, networking, electronics, semiconductors, and even pre-industrial technologies. This will allow the archive’s inheritors to better understand today’s world and its technologies, and may even help them recreate computers to use the archived software.
Encapsulating the world’s cultural context and technical history is a challenging prospect, and we expect the Tech Tree to evolve and iterate over time. We will soon publish to the Archive Program’s GitHub repository a very initial draft list of works selected for the Tech Tree, along with, importantly, a request for community input. We look forward to incorporating ideas and suggestions from the GitHub community before the Tech Tree is added to the Arctic Code Vault.
Полный список значков и достижений на GitHub
Отображение значков достижений в вашем профиле совершенно необязательно.
По умолчанию достижения может видеть любой, кто просматривает ваш общедоступный профиль.
Вы можете отказаться от отображения достижений в своем профиле, перейдя в Настройки профиля.
Значок | Название | Можно получить? | Как получить? |
---|---|---|---|
![]() |
Heart On Your Sleeve | Проходит тестирование | Отреагируйте на что-либо на GitHub с помощью эмодзи ❤️. |
![]() |
Open Sourcerer | Проходит тестирование | Вы сделали PR в несколько общедоступных репозиториев и эти PR были смёржены. |
![]() |
Starstruck | Да | Созданный вами репозиторий должен получить 16 звёзд или больше. |
![]() |
Quickdraw | Да | Выдаётся, если вы хотя бы один раз закрыли issue или смёржили pull request в течение 5 минут после открытия. |
![]() |
Pair Extraordinaire | Да | Был смёржен один или несколько pull request , который вы делали в соавторстве с другими разработчиками. |
![]() |
Pull Shark | Да | Было принято (смёржено) два открытых вами pull request -а (или больше). |
![]() |
Galaxy Brain | Да | Автор дискуссии принял два (или больше) ваших ответа (нажал Mark as answer ). |
![]() |
YOLO | Да | Выдаётся, если хотя бы один ваш pull request был принят без замечаний (автор не написал ни одного треда и смёржил правки). |
![]() |
Public Sponsor | Да | Выдаётся, если вы хотя бы один раз спонсировали opensource-проект или разработчика на GitHub. |
![]() |
Mars 2020 Contributor | Нет | Сделали вклад в один из репозиториев проекта Mars 2020 Helicopter Mission. Выдавался за хотя бы 1 случай вклада. |
![]() |
Arctic Code Vault Contributor | Нет | Ваш код был включён в программу 2020 GitHub Archive Program. Выдавался за хотя бы 1 случай вклада. |
Градация некоторых достижений
Для получения большинства значков необходимо произвести некоторое действие однократно, но в ряде случаев действие необходимо произвести несколько раз.
Достижение | По-умолчанию | Бронзовый | Серебряный | Золотой |
---|---|---|---|---|
Starstruck | ![]() |
![]() |
![]() |
![]() |
16 stars | 128 stars | 512 stars | 4096 stars | |
Pair Extraordinaire | ![]() |
![]() |
![]() |
![]() |
1 PR | 10 pull requests | 24 pull requests | 48 PR | |
Pull Shark | ![]() |
![]() |
![]() |
![]() |
2 PR | 16 pull requests | 128 pull requests | 1024 PR | |
Galaxy Brain | ![]() |
![]() |
![]() |
![]() |
2 answers | 8 answers | 16 answers | 32 answers | |
Heart On Your Sleeve | ![]() |
![]() |
![]() |
![]() |
. | . | . | . | |
Open Sourcerer | ![]() |
![]() |
![]() |
![]() |
. | . | . | . |
Все остальные значки на GitHub даются за однократное выполнение условий.
Цвет значков и тон Emoji
Внешний вид некоторых значков зависит от предпочитаемого вами тона кожи Emoji.
Вы можете изменить предпочитаемый тон кожи, перейдя в Настройки внешнего вида.
Badge | ||||||
---|---|---|---|---|---|---|
Starstruck | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Quickdraw | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Другие значки на GitHub
Значок | Название | Как получить? |
---|---|---|
Pro | Используйте тарифный план GitHub Pro | |
Discussion answered | Ваш ответ в дискуссиях был отмечен как правильный | |
Developer Program Member | Станьте зарегистрированным участником программы GitHub Developer Program | |
Security Bug Bounty Hunter | Помогали в поиске уязвимостей на GitHub Security | |
GitHub Campus Expert | Принимали участие в GitHub Campus Program | |
Security advisory credit | Ваши рекомендации по безопасности были приняты в GitHub Advisory Database |
Есть предложения?
Если у вас есть вопросы или предложения, как улучшить информацию на этой странице, вы всегда можете написать в issues.
GitHub разместил свой архив в арктическом хранилище Arctic World Archive
Компания GitHub рассказала в своем блоге, что 8 июля 2020 года архив открытых исходных кодов сервиса был успешно размещен в арктическом хранилище Arctic World Archive на острове Шпицберген.
Чтобы заархивировать и перевести на физических носителях весь GitHub понадобилось более пяти месяцев кропотливой работы. 2 февраля 2020 года специалисты компании сделали копию всего открытого исходного кода, хранившегося на сервисе — это вклад работы более 37 миллионов пользователей, который включает около 100 миллионов активных публичных репозиториев.
Разработчики, код которых попал в текущую версию архива, помечены в профиле на GitHub специальной меткой «Arctic Code Vault Contributor». Проверьте свои аккаунты в GitHub, вы можете быть приятно удивлены этой ачивке.
Объем архива составил 21 ТБ. Далее эти данные были перенесены в виде фотоснимков на 186 пленочных накопителей piqlFilm. Это 3500-футовые (1066-метровые) пленочные катушки с галогенидами серебра в полиэфире, которые созданы специально норвержской компанией Piql. Они могут хранить без потерь большие объемы информации сотни лет, максимум заявлено — 1000 лет.
Предполагалось, что команды GitHub полетит в Норвегию, где должна была присоединиться в городе в Драммен к специалистам Piql, чтобы вместе сопровождать перевозку столь ценного груза до подземного хранилища. Однако, из-за пандемии, пришлось изменить часть этого плана, а также отодвинуть сроки отправки. Только в начале июля 2020 года 9 больших коробок с 186 пленочными барабанами были отправлены в аэропорт Осло, а затем были доставлены на Шпицберген. Далее архив GitHub оказался в городке Лонгйире, где местная логистическая компания оказала услуги по его промежуточному безопасному хранению. Фактически, только 8 июля 2020 года архив с пленками GitHub был успешно депонирован в арктическое хранилище в бывшей угольной шахте на глубине 250 метров в вечной мерзлоте.
Piql приложила к архиву специальное руководство, которое позволяет определить местоположение каждого проекта и объясняет, как восстановить данные. Каждая катушка включает в себя копию руководства по кодовому хранилищу GitHub на пяти разных языках мира. Одна катушка содержит техническую историю и культурную ценность содержимых сведений. Кроме того, в архиве представлены работы, в деталях объясняющие основы современной информатики, вычислений и разработки открытого кода. По словам инициаторов проекта, это позволит будущим поколениям понять, каким был современный мир, и даже воссоздать компьютеры и технологии в случае необходимости.
На случай возникновения проблем с хранилищем «Arctic World Archive», GitHub рассматривает возможность создания дублирующих архивов длительного хранения.