Web archiving is a solved problem, you record the website in an interactive environment [1], and everything what happens on-screen will be saved in a single file in an open [2] and standardized [3] file format authored by Internet Archive and endorsed by Library of Congress for preservation [4].
You can store the resulting WARC file wherever, be it on S3 or under your pillow.
As an archivist, I urge everybody here to not reinvent the wheel, please..
The technical side is a solved problem, the legal side not so much.
You practically cannot preserve and make available something if the copyright holders don't want you to. If the copyright situation is complicated you bear the risk.
You can say, that this is how it is supposed to be, but it is not like it works in the non-digital realm. You could argue that we'd need something like digital monument protection, where artifacts can be preserved against the copyright holders will.
The film has been digitized by us (Národní filmový archiv) to make screening DCP for the Pordenone Silent Film Festival, where the film will be unveiled.
The raw scans were also provided to our counterparts in Cinemateca Brasiliera who are now in charge to decide about the film's fate.
I surely hope they will make it accessible under a non-permissible license.
The scan of duplicate negative was done in 2K (2048x1556).
We had considered doing the scan in 6K, but the tests proved it unnecessary as it didn't yield any more detail except more grain from the film emulsion.
An example of a nonpermissive license is the General Public License (GPL) that ensures users have the freedom and responsibility to share their changes with the community.