Care to share some insights I’m missing asides from CDN’s?
For example, the image name… why do they even bother renaming it to that? Is it a random filename generated based on time upload? Is it a government secret hash that will burn your soul if you know why they generate the filenames the way they do? haha
I guess same goes for sites like flickr. They have very similar filenames. The directory structure is different (all numeric) and they are also hosted on a farm server… literally named farm8 on the account I’ve been looking at the past 2 mins.
It’s all well and good, but a few things I’m not sure about.
1. I’ve read in a few places that keeping the files limited to 1000 per directory is a good idea (for various reasons differing here and there, but it’s the number often stated). What is keeping the file number to this 1000 limit on the previous algorithm? I’ve actually not seen any algorithm that actually enforces the limit, it just kinda depends on the hash to spread out the files.
2. How does the previous link ensure that the files are spread evenly?
3. Why does he use bytes as directory structure indicators, rather than just using the first few chars of the hash?
Is all this worth it if you aren’t storing a million user images? It’s awesome to know either way.