Académique Documents
Professionnel Documents
Culture Documents
www.scantips.com
Web scantips.com
(.JPG file extension, pronounced Jay Peg). This is the right format for
those photo images which must be very small files, for example, for
web sites or for email. JPG is often used on digital camera memory
cards, but RAW or TIF format may be offered too, to avoid it. The JPG
file is wonderfully small, often compressed to perhaps only 1/10 of the
size of the original data, which is a good thing when modems are
involved. However, this fantastic compression efficiency comes with a
high price. JPG uses lossy compression (lossy meaning "with losses to
quality"). Lossy means that some image quality is lost when the JPG
data is compressed and saved, and this quality can never be
recovered.
JPG files don't work that way. JPG is a big exception. JPG
compression is not lossless. JPG compression is lossy. Lossy means
"with losses" to image quality. JPG compression has very high
efficiency (relatively tiny files) because it is intentionally designed to
be lossy, designed to give very small files without the requirement for
full recoverability. JPG modifies the image pixel data (color values) to
be more convenient for its compression method. Tiny detail that
doesn't compress well (minor color changes) can be ignored (not
retained). This allows amazing size reductions on the remainder, but
when we open the file and expand the data to access it again, it is no
longer the same data as before. This lost data is like lost purity or
integrity. It can vary in degree, it can be fairly good, but it is always
unrecoverable corruption of the data. This makes JPG be quite
different from all the other usual file format choices. This will sound
preachy, but if your use is critical, you need a really good reason to
use JPG.
Even worse, more quality is lost every time the JPG file is compressed
and saved again, so ever editing and saving a JPG image again is a
questionable decision. You should instead just discard the old JPG file
and start over from your archived lossless TIF master, saving that
change as the new JPG copy you need. JPG compression can be
selected to be better quality in a larger file, or to be lesser quality in a
smaller file. When you save a JPG file, your FILE - SAVE AS dialog box
should have an option for the degree of file compression.
Regardless of that selected image size in pixels, the camera menu will
also offer a smaller file size choice in bytes, related to quality, via JPG
file compression. This menu will offer a best quality setting which is
the largest file, and maybe intermediate sizes, and a smallest but
worst quality choice. My Nikon D70 offers three JPG file size choices of
Fine (about 1/4 size in bytes), Norm (about 1/8 size in bytes), or Basic
(about 1/16 size in bytes), comparing compressed file size to the
uncompressed size. The best (largest) JPG file size will still contain JPG
artifacts, but very mild, essentially undetectable, vastly better than the
smallest file choice. Even better, some cameras also offer a RAW or
TIF format to bypass JPG problems all together. These images may be
large, but memory cards are becoming less expensive ($100 for 1 GB),
and larger or multiple cards are by far the best quality solution.
With either scanner or camera images, individual image JPG file sizes
will vary a little, because detail in the individual image greatly affects
compressibility. Large featureless areas (skies, walls, etc.) compress
much better (smaller) than images containing much tiny detail all over
(a tree full of leaves). Therefore images of the same size in pixels and
using the same JPG quality setting, but with differing image content,
will vary a little in JPG file size, with extremes perhaps over a 2 to 1
range around the average size.
Since each image varies a little, the file size is only a crude indicator of
JPG quality, however it is a rough guide. For ordinary color images (24
bit RGB), the uncompressed image size when opened in memory is
always 3 bytes per pixel. For example, an image size of 3000x2000
pixels is 6 megapixels, and therefore by definition, when
uncompressed (when opened), this memory size is 3X that in bytes, or
18 MB. That is simply how large the 24 bit data is. The compressed
JPG file size will be smaller (same pixels, but fewer bytes). A High
quality JPG file size might be compressed to 50% to 25%
uncompressed size (bytes). A JPG file size only 10% of that image's
size in memory would the general ballpark for a fair tradeoff of quality
vs. file size for color images of web page quality (but not best quality).
The 10% size is not very precise, but of course only refers roughly to
the average image size, since each individual image varies a little.
Color compresses better than grayscale files, so grayscale doesn't
decrease as much. These are very rough guidelines, your image, your
photo program, your purpose, and your personal criteria or tolerance
will all be a little different.
JPEG Artifacts
I am sent JPG images with questions why they appear "bad" and very
"unclear", when the only problem is excessive JPG compression, too
much, or saved too many times, or both. Too little Quality or too much
Compression will affect JPG image quality visibly. It can be awful.
You can and should learn to recognize this bad effect yourself.
It's easy to recognize, almost trivial, after you've seen it once. Then
you will know if it is acceptable or not, and you will know that you
have choices.
If using a digital camera, shoot the same scene as the best file format
you have, a RAW or TIF file, or at least at the best JPG quality setting
(largest file). Also shoot exactly the same scene at the lowest JPG
quality setting (smallest file). If using a scanner, scan a new photo
image and save it as a TIF file. Then, after that, also save the same
image as a JPG file using more compression than usual, a lower JPG
quality setting this one time to help learn to see it. Then close that JPG
image, and reopen that file to be sure you are viewing the contents of
the file image you actually wrote (instead of the uncompressed image
on the screen that you thought you wrote).
Then examine both large and small file images side by side on the
same screen, by zooming in to about 4 times size (400%,
huge) on both. You will have to scroll around on them, but the 400%
is to help you learn to recognize the artifacts this first time. The
differences you see are the JPG artifacts of compression.
Here are samples below, zoomed large, but same size and side by side
for comparison, to better learn to recognize the JPG artifacts (that is a
cookie jar at left, with a little reflection on it)
Original TIF image ABOVE, zoomed to 300% size
Zooming to 300% size is not the normal thing to do, but it does help
to recognize these JPG artifacts the first time. After you learn what you
are looking for, then you can recognize them at 100% size too.
Note that most other types of image file compression (for example
PNG or TIF LZW) are lossless compression, meaning that there is
absolutely no loss of quality due to compression (zero loss), so that
then file quality is simply not an isssue at all, and the most critical user
need not ever worry about it. The TIF file above used LZW
compression. However lossless file compression is less effective,
meaning that it can not produce files so drastically small as JPG. The
lossless file size is closer to the actual size of the color data, perhaps
70% or 80% instead of 5% to 20%.
I want to call this JPG an extreme example, and it is poor, but it is not
extreme. You ought to see some of the images that people send me
asking why their images are so poor. This JPG was done in Photoshop,
and Adobe's lowest quality settings are conservative, and won't let us
make them as extreme and poor as some other programs will. The JPG
quality numbers like 8 or 9 are NOT absolute values, instead they are
relative to the JPG properties that each program chooses to
individually use.
JPG artifacts do vary, and this will be of more concern when you do
this same test on your own images. Once you realize what you are
looking for, then JPG artifacts are easy to see and recognize at 100%
size. Some people are more critical than others, asking "How good can
I make it?" instead of "How small can I make it"? Your priorities are
your own, but afer you are able to recognize JPG artifacts, you will be
able to judge how much of this you want in your own images.
Use a higher JPG Quality factor to minimize these effects (or don't use
JPG at all if maximum quality is important). Less JPG Quality is more
JPG compression, a smaller file, but worse artifacts. Normally you may
detect some artifacts even at high Quality factors, and you can learn
to recognize this easier if you zoom in to about 400% size. But the
image can still be very usable size if the compression is mild. Now you
know what to look for, and how to look for it, and how to judge if you
want it or not.
The JPG artifacts become part of the image data, and it cannot be
removed. Sharpening again after JPG compression (next time) will
emphasize these JPG artifacts, so be careful with that. Actually, very
slight intentional softening or blurring before JPG compression will help
minimize the effect of the JPG artifacts (and will slightly reduce JPG file
size too).
If you find that you must edit the JPG image and must save it as JPG
again, at least try to use the same program and same value of Quality
or Compression every time you save the file. Using different values will
use different parameters that will aggravate the damage due the lossy
compression. I am certainly not suggesting repeated saving of JPG
files with the same parameters is a good thing, but only that there are
even worse ways to do it.
IrfranView needs its free extra JPG plugin package to provide lossless
rotation, which is then found at menu Options - Lossless JPG
Operations when the JPG file is open.
Continued
Web scantips.com
JPG format has a magic status for us. JPG is wonderful when the
purpose is right, but we need to understand we pay a cost in quality. I
know you surely will consider using JPG for master copies, everyone
does at first, because the JPG file is so small, and the idea is frightfully
appealing. But it's a high price paid in lost quality, and you will
eventually come around to appreciate the quality of TIF files. I hope
that happens before you have damaged important images that you
cannot scan again. Some people argue that high quality JPG masters
are not so bad. That's OK with me, it's your file, but the file size is one
property of an image, and quality is another. You can choose either
way, and I hope you choose Highest Quality, at least for your master
copies. My goal is that you understand the effects of your choice in
regard to your valuable master copies.
If you simply must use JPG for your master copy (if file size is the
most important property of this image), then at least use the best
Quality you can. Aim for a average JPG file size at least as large as
25% of its true memory size. For web images, file size is indeed
important, and absolute quality is less important, and a JPG file size
10% of uncompressed size is often a good tradeoff of quality for size
for color images, but that is too extreme for any notion of a master
copy. Grayscale images do not compress as much as color at the same
settings. Grayscale is already 1/3 the size of color, and grayscale JPG
files won't compress as much.
Then keep a master copy that you never rewrite. For any purpose,
always start with a copy of that JPG master and never modify that JPG
master itself. Saving that JPG master image again causes more
artifacts, even more loss, permanent loss, so simply don't do it. If you
simply need to rotate to vertical, or to scale for printing, consider not
saving that simple change at all if it is trivial to do again. Save any
change to a different file, and never make your one master copy
worse. Or save the change to a TIF file, so as to not create more JPG
artifacts. You could of course just use a TIF master instead, and sleep
better at night. And be aware that lowering scan resolution to
reasonable values for the purpose is often the best size reduction you
can make.
Judge your own common sense periodically. Why can't we see that a
decent JPG image compressed to be only 25% of original size (1/4
size, still amazingly small, and still with mild artifacts), is more useful,
and a much better deal, than one at 5% size that is simply too awful
to use? I know, the macho types are saying "but I can get it down to
3%". Then go for it man! <grin>
For emailing and for web pages, JPG file format is the smallest by far,
and the best goal by far (for photo images). JPG is acceptable for such
read-only "viewing" use, and a JPG copy normally is used for such
purposes. JPG is NOT very acceptable for "editing" use, when you may
need to edit and save the image again (we always do). JPG is best
only for photographic images. For line art and graphic files
(characterized by containing graphic sharp edges and very few but
solid colors), then TIF LZW or PNG is good, or GIF files too, which have
historically been used for graphics on web pages.
The process of saving a JPG file to disk first compresses the image
data to make the file be much smaller. This JPG compression affects
the image quality, because JPG compression uses a very different
technique than does PKZIP or TIF LZW or PNG file compression (lossy
vs. lossless). JPG compression is extremely effective, because it does
not attempt to be 100% faithful to the original image data. If parts of
the image data are a problem for it, the JPG compression simply
modifies that data to make it conform better. That means when we
read that JPG file later, we don't get back the same original image
data (that is the meaning of lossy, losses of image quality). Also the
software opening that JPG image does a similar thing when it reads it.
It does not necessarily show exactly what is in the file, it can also take
similar liberties. Web browsers for example are not the best JPG image
viewers, but they are the fastest JPG viewers.
Most other file formats (say TIF or PNG) use lossless compression.
These files are larger than JPG because they use milder, fully
recoverable (lossless) compression to carefully preserve all of the
original image data. These file formats remain full quality at all times,
no matter how many times we might save them to a file, which is of
course exactly what is needed for a master copy.
The small JPG file size is great, but it has a big price of reduced
quality. There are proper times and places one would use it, and also
major reasons one would not. There's nothing wrong with creating a
JPG image using a moderate to high Quality factor to put a photo
image on a web page or to send it via email. It's the only practical
way. However there is an additional quality loss when we try to edit
and save that JPG file a second time, so JPG is usually inappropriate
for important archived master copies. The risk if you make this
mistake now is that you cannot undo it in the future, so now is the
best time to understand the situation.
Progressive JPG is a web option that quickly shows the entire image in
very low quality, and the quality fills in and improves as the file
download completes.
JPEG 2000
A new JPEG 2000 file format is available now in many newer program
versions, with various file extensions, normally .jp2 (or .jpx, with
option extensions). JPEG 2000 uses a wavelet compression method. It
has a lossless option (to perhaps 65% size). Otherwise it is lossy, size
comparable to regular JPG files, but different... slower then JPG, but
arguably perhaps better quality. Extremes of compression have few
detectable artifacts, however JPEG 2000 images become noticeably
soft with greater compression.
Note that web browsers do NOT support JPEG 2000 yet, and so
compatibility is still a very significant issue. Therefore it seems
important to stay with the original JPG format for now, at least if you
want others to be able to open your files.