Explaining File Compression Formats
Explaining File Compression Formats
Advertisement

LEAVE YOUR COMMENT

LATEST COMMENTS

@ExplainingComputers Says:
As some people have noted below, in the word cloud "bz2" appears incorrectly as "b2z". But all is correct in the BZip 2 section at 7:11. And I have corrected the thumbnail. Also note that GZip compressed TAR files have the extensions tgz or tar.gz, not tar.tgz, as noted in the third bullet point around 7:53. My apologies. This video was checked, corrected and re-rendered many times . . . but at the moment, things like this are slipping through. :(
@DopeXen Says:
6:43
@genericelaf1007 Says:
Really love the style of this man.!!
@dannyboots Says:
Zip, 7z, and RAR I use all the time
@ankitkumarsingh6185 Says:
WinUHA should have been mentioned
@desiredditor Says:
7zip is the best app
@k4piii Says:
Sorry about this ,but your voice is perfect to sleep
@lucasmoratoaraujo8433 Says:
I love this guy! And this hairstyle is not easiest one to pull off, but does it nonetheless. Thanks for sharing you knowledge with the world! ❤
Says:
Great explanation, thanks!
@sebastjansslavitis3898 Says:
mostly useless thing nowadays, the biggest files are pictures and videos and they don't compress. it works great on text files, but I don't know how much text you need to worry about space... maybe if you own a library :D password protection and archiving is only thing that matters
@JfromUK_ Says:
Great video! I knew some of this but it filled in a lot, and was so complete, well paced and expertly put together. Big 👍🏻 from me, as a newbie to your channel 😊
@americanswan Says:
I'm conflicted. I love RAR. It offers great features. But I genuinely don't have much need for compressing anything.
@IamTheHolypumpkin Says:
Sometimes file compression can achieve amazing results. With xz. I was able to compress a diskimage from 20 GiB to about 2 GiB.
@nico3534 Says:
this one of the craziest intro tracks iv ever heard
@elblanco5 Says:
I install the free 7-zip on windows almost as soon as the updates for the OS finish. It's great, supports a ton of formats, it's free and source is available. There's also version for Linux and MacOS.
@xeridea Says:
Compression is still used commonly on the internet. Generally web pages compress the HTML, CSS, JS, etc files on the fly to the browser (using fast compression option), as it still generally saves time loading, and is useful for your phone on mobile data, and those still on DSL. It was a huge deal in the 80s though, connecting to BBSs over slow dialup, compression was a godsend.
@Jeroen74 Says:
No ARJ 😢
@SegNode Says:
I almost clicked off the video at the beginning since the intro was a little slow and boring, but I'm glad I stayed around because the rest of the video held my attention well and was very informative 😊. Thank you for making it.
@ShadowKestrel Says:
one curious strength of gzip is that both compression and decompression support streams, whereas most formats require entire files at a time. This is especially useful for sending lots of data over networks
@jsrodman Says:
Comparing xz vs 7zip is just comparing your settings since they use the same format.
@jsrodman Says:
its a mistake to suggest reliance on password protected zip files. The method used in pkzip is defeatable in fractions of a second. The method used in winzip is better, but as winzip (into zip really) has been unmaintained for 20 years, it too is defeatable in a short timescale, likely hours even with a good password on a single machine.
@jamesmnguyen Says:
My favorite thing about data compression is that the resulting data is generally more random looking. Or to put it another way, pure random data cannot be compressed.
@lidarman2 Says:
We did a clever trick at work for 8 bit lidar data compression. We take a chunk of data which in this case is a set of successive time series values, load them in a 2D array and save as a PNG file. This reduces the data storage size losslessly but also allows one to look at the thumbnails and visualize the data when browsing the data set.
@feynstein1004 Says:
4:55 You say that like it's a bad thing 😁
@kenaitchison7712 Says:
This video is painful to listen to to my American ears.
@El-Ge Says:
For so many years I am using 7z for it is free
@pierrekilgoretrout3143 Says:
thank you, interesting summary! You confirmed my choice of 7zip for compactness, and zip for openness (my partners can open without additional sw) If everybody is using windows I can also create a self-extractive 7z archive password protected, however not everybody will trust an exe!
@gosnooky Says:
Compression is more prevalent than most people realize. Nearly every request through the web for text-based files (HTML, CSS, JSON, JavaScript, Plain text) is gzip (or Brotli) compressed on the server and decompressed automatically by the browser. It can save tons of money when using cloud-based services where bandwidth usage is metered, or on people's metered data connections on mobile. The system I work on needs to send tens of megabytes of text-like files every second from one server to many mirrored (load-balanced) smaller cloud nodes, so everything is compressed. Our AWS bill for this particular service would be 15x higher without compression. It's also good for the planet - smaller files, less network attenuation, less power usage.
@sirkerzenhalter8191 Says:
Win.RAR and its never ending trial And we love it
@belgiumball2308 Says:
did bro seriously download winzip?
@meowthx1985 Says:
Well Chris it appears 7-Zip 24.00 beta just been released. Also I was provided WinZip for my Windows 95 when I bought the Amiga Emulator in 1998 from HMV. WinZip was good but after being took over by Corel and having to pay for the registered version is off putting and it doesn't compress as much as .rar nor .7z. The thing is I use 7-zip with its ability to compress my Counter-Strike maps as .bz2 for my FastDL server at ultra compression mode. But the question is will we get better compression to reduce our files at the very minimum? Besides .gzip and .zip have very poor compression results, which is why I use .7z for my PC and .tar.xz for my Linux for archiving.
@doragonmeido Says:
where is peazip?
@TheReptileDragons Says:
I dont understand why there are commercial compression or archiving methods or programs. Why would someone create a file that requires you to buy software to decompress it when there are several alternatives that are free and have faster compression/decompression and/or higher compression ratio?
@flyingbluelion Says:
lossless compression sometimes produces bigger files. Lossy compression will always make a smaller file. Choose wisely. As always, for best results in technology, investigate WHY!
@philosoaper Says:
WinRAR was the first piece of PC software I ever bought that wasn't a game. I don't remember exactly when but I think I was still using one of Windows 95 betas (Chicago) at the time. ... my back hurts just thinking about it
@christoffer4862 Says:
There was another great experimental compression format called NanoZip but it disappeared.
@fimapearl9132 Says:
smart jewish guys :D none of this would exists without israel
@isheamongus811 Says:
The Linux kernel contains code for zip?
@Yanni_X Says:
i always wondered and will still continue to why people create .rar-files, which explicitly need a proprietary & commercial software to compress (& for a long time decompress) while zip was mostly "free". At least since windows introduced .zip-support i never understood winrar-people... I don't see advantages here except if you purposefully want to annoy people that don't have winrar installed.
@mewnz Says:
I remember ARJ being very popular in the early 90s on BBS hosts.
@user-ty3hp7wq6z Says:
Too bad zstd isn't discussed further. Created by Facebook and donated to Open Source with great balance of speed, memory and compression.
@LeYuzer Says:
NanoZip or 7zip-zstd and use z-standard. Others are trash
@MrItrollaround Says:
Ok so apparently, I've been using YouTube before it was even owned by Google, now I just learned that links are no longer allowed??? WTF? I guess I'll repost my entire comment again. For those who are interested in data compression as a whole and just would like to learn a bit about the process. The ZPAQ Author, Matt Mahoney, has a site called "Data compression explained." Check out that site as its very detailed in how compression works on images, videos, sound general purpose, and etc. Talks about Deflate (zip, gzip, zlib), 7z (lzma, lzma2), bzip2 (BTW), and all of the other popular algorithms and formats. Highly advise looking at this site to understand the types of compression you can use to either compress efficiently, faster, or just figure out how far you can go compressing data.
@firetruck988 Says:
File compression is especially useful on windows to bundle folders with 1000s of files, as without compression windows can have a rather hard time moving them all around. That's my use case, rather than saving space.
@ted_van_loon Says:
actually a friend of mine started about creating our own new internet, something I have been into for quite a while as well. so took that as a reason to start doing such things again since the last time I did something like that was when I just started to learn programming and such. right now I have much better logic designs and such for it, much more complete mostly, and also can use custom hardware and such, since we also wanted this new internet to not need to rely on the normal internet, which lead to getting even more interest into certain projects like meshtastic. the thing is networks like meshtastic and technology like LoRa can't yet really handle real internet use, or well it could but optimally would need something to bypas any form of spoofable or paid/licenced identifier so that you can always find and acces any site or person you seek to contact even if their location and IP and such would constantly change and at moments noone might know where they are. (should be fully decentralized and decentralizable with full user controll over themself.) next to that to allow a functional internet to work on well on such networks instead of only random short messages without clogging them even if many people use them a different internet culture is needed, so more aimed at actual use instead of abuse, but also compression, or a new type of compression. speciffically one easy one is a alternative to HTML, which allows open universal easy to use and even readable by a normal person compression method as well as easy software to read it. it works quite similar to a dictionairy compression algorythm, also just making it based on what is in it. but it just uses a header which contains all such things and special exceptions, then after that all which is needed is to just reffer to that, this allows people to even encode and decode it manually. also the main focus is just for text, but lang makeup things should also be a lot more compact since html actually really uses super long strings for even basic operations, this can be easily bypassed by using that header above, since that header not only holds the dictionary but also the exceptions. or even allowing to manually set instructions to something else, all in minimal characters. on a simple html page this can reduce the amount of data with around 10 times easily, on simple pages with a lot of text even more so, complex pages will be limited by the data things like images and video take up. but since the network is designed to work kind of like a online decetralized library, so more for communicating and finding actual info, most images needed will only need a few colours and a few pixels, so I designed 2 image formats, wich actually even already before compression can show complete images in barely no file size at all, the default is black and white only, since it is meant to be used kind to depict shapes and such, or for reffereneces. the black and white one is simple and would for example only need around 9 bytes to stora a image before compression actually 8 bytes if a exception is set up, the 9th byte is kind of the info byte in this case to support custom things. but it can also support shades and colours, still meant for when only a few are used since it is meant more for refference than for full images, for full high quality images normal formats would be better. also at some point for big pictures it might be better to eventually use AI to compress them so you can generate something more like SVG or a combination of VG and bitmap, as in that is would draw lines and shapes instead, I guess that would be the third format, even though that by default would work by just drawing the lines. so compression still is quite usefull up to today. actually it would be great if the normal internet would also use more compression since it would save tons of bandwith and also storage and potentially even help the environment, since such a web format as which I am somewhat developing(more in the random moments I feel like it, so barely anything but it is there), would allow it to be compressed once, or even be written already compressed since it is designed to be able to be human writable and human readable, even though with current AI someone might rapidly make a compression algorythm which actually uses AI to generate the perfect library and such. but since it is easy to decompress as well, and even more so for modern hardware, the reduction in the amount of data transferred actually saves far more power than decompressing that would be, next to that the reduced transaction time also means the screen has to be on for shorter amounts of time for the same operation. also designed a new search engine, which is both far more effient, uer controllable and gives results which no modern search engine can even dream of, even when they use AI with it they don't even get close to the usability of this search engine, since it actually uses the natural instinct of people and adopts itself to that so it can help people find what it truly needs, and it doesn't need to do so through datasets and algorythms, all is super leight weight and can easily be run locally as long as you have or have acces to some form of a index file or list. but intead of showing the user what it thinks the user wants to see. it uses a special "new" kind of UI which actually is more like a psychologic tric which helps people to figure out what they truly want and why, as well as to look deeper, but this search engine is so early stage that it really isn't ready yet at all.
@anon_y_mousse Says:
I remember back when bzip2 was new that I went and compressed nearly everything using it because it produced the smallest files of any compression software. I was still new to using Linux back then and had a lot of fun playing with that, so I look at it with fond memories even though it's no longer the best. Of course, now there's a bzip3 and I'm going to need to test that.
@DeepakThakur24 Says:
RIP to those who paid to use WinRAR 😂
@kunka592 Says:
I prefer to use WinRAR over 7-Zip because rar files allow you to open a random file in the archive much more quickly than a similar 7z file. I generally compress rar files with the older format - not RAR5, and with 3% recovery record. This ensures compatibility even if I want to open it with my older machines. You can even get a version of UnRAR for DOS that will work with this format.
@snoopdogg9490 Says:
He looks AI generated
@Jajaho2 Says:
Another great video, you made it very captivating.

More Programing Videos