stonesoflife: Computer Graphics and Applications (B.E. IT)

https://dl.dropbox.com/u/94729309/Computer%20Graphics%20and%20Applications.docx
file ကို ေဒါင္းျပီးၾကည့္ရင္ေတာ့ပံုေတြပါျမင္ရမယ္။

ဒါကေတာ့ B.E. IT မွာသင္ၾကားရမယ့္ Computer Graphics and Applications ပါ။ မွားတာရွိရင္ေျပာျပေပးပါ။ လုိတာရွိလည္းျဖည့္စြက္ ေပးေပါ့။ လုိတာေတာ့ရွိမွာအမွန္ဘဲ။

Computer Graphics and Applications

Chapter 1

Digital Image Representation

Monochrome image ဆိုတာ color တမ်ိဳးထဲပါတဲ့ image မ်ိဳးကိုေျပာတာ။ အဲဒီလ္ို image မ်ိဳးကို function တစ္ခုအေနနဲ႕ ေျပာမယ္ဆိုရင္ f(x,y) ေပါ့။ ဒီေနရမွာ x တို႕ y တို႕ဆိုတာက coordinate ေတြကိုေျပာတာပါ။ ဥပမာ f(10,15)=100 ဆိုပါေတာ့။ ဆိုလိုတာက x = 10, y = 15 မွာရွိေနတဲ့ point ရဲ႕ brightness သို႕မဟုတ္ gray level ဟာ 100 ရွိတယ္လုိ႕ေျပာတာပါ။ 0 ဆိုရင္ အဲဒီေနရာေလးမွာအမဲေရာင္ေပါ့။ 255 ဆိုရင္ေတာ့ အဲဒီေနရာေလးမွာ အျဖဴေရာင္ေပါ့။ 0 နဲ႕ 255 ၾကား တန္ဘိုးတစ္ခုခုဆိုရင္ေတာ့ အမဲနဲ႕အျဖဴၾကားကတစ္ခုခုေပါ့။ မွတ္ထားရမွာက value နည္းရင္ dark ျဖစ္ျပီး မ်ားရင္ light ျဖစ္ပါတယ္။ ဆိုလိုတာက light intensity နည္းရင္ေမွာင္တာေပါ့။ မ်ားရင္ေတာ့ light ျဖစ္တာေပါ့။

Digital image ဆိုတာ discretize လုပ္ထားတဲ့ image မ်ိဳးကိုေျပာတာ။ ဥပမာ ေျပာရရင္ digital device ေတြျဖစ္တဲ့ကြန္ပ်ဴတာတုိ႕၊ ဖုန္းတုိ႕၊ memory stick တုိ႕အစရွိတဲ့ေနရာေတြမွာ သိမ္းထားတဲ့ image ေတြကို digital image လုိ႕ေခၚတာေပါ့။ ဖလင္ေပၚမွာ၊ photo paper ေပၚမွာရွိေနတဲ့ပံုေတြကေတာ့ discretize လုပ္ထားတာမဟုတ္တဲ့အတြက္ digital image လုိ႕မေခၚႏုိင္ပါဘူး။ digital image တစ္ခုကို two dimensional matrix တစ္ခုလို႕ျမင္ၾကည့္ပါ။ အဲဒီ matrix ထဲမွာပါတဲ့ element ေတြကိုညြန္းဘုိ႕ဆို row number ေတြ column number ေတြနဲ႕ညြန္းရပါတယ္။ ပါ၀င္တဲ့ element ေတြက ေတာ့ အဲဒီ ပံုမွာ ရွိေနတဲ့ gray level value ေတြပါဘဲ။ အဲဒီ element ေတြကို image element, picture element, pixel element, pixel or peal လိုေခၚပါတယ္။ pixel လုိ႕အေခၚမ်ားပါတယ္။ pixel ဆိုတာ picture element ရဲ႕ေရွ႕၂လံုးစီကိုတြဲေခၚလိုက္တာပါဘဲ။ ဒါဆိုနားလည္ထားရမွာက digital image တစ္ခုကို pixel ေတြနဲ႕ဖြဲစည္းထားျပီး pixel ဆိုတာ light intensity ကိုေျပာတာပါ။

Digital image တစ္ခုရဲ႕ size ေတြ gray level ေတြကိုသတ္မွတ္တဲ့အခါမွာ 2 ရဲ႕ integer power နဲ႕သတ္မွတ္ေလ့ရွိပါတယ္။ ဥပမာ TV image တစ္ခုရဲ႕ size က 512 x 512 ဆိုပါေတာ့။ ဒီ image ဟာ row ေပါင္း 512 ခုရွိျပီး column ေပါင္း 512 ခုရွိတယ္ေပါ့။ ဒါဆို pixel ေပါင္း 512 x 512 = 262144 ရွိတာေပါ့။ image တစ္ခုရဲ႕ size ကိုေျပာတဲ့အခါ အဲဒီ image မွာပါတဲ့ pixel အေရအတြက္နဲ႕ေျပာပါတယ္။ ဒါေပမယ့္ pixel အေရအတြက္ကို တုိက္ရုိက္မေျပာဘဲ row x column နဲ႕ေျပာပါတယ္။ ဒီလုိသတ္မွတ္တဲ့ေနရာမွာလည္း ခုနကေျပာခဲ့သလို 2 ရဲ႕ integer power နဲ႕သတ္မွတ္ေလ့ရွိပါတယ္။ (128 x 128) or (256 x 128) အစရွိသျဖင့္ေပါ့။ gray level ကိုလည္း 128, 256 အစရွိသျဖင့္ 2 ရဲ႕ integer power နဲ႕ဘဲေျပာပါတယ္။

Element of digital image processing system

အဓိက သံုးမ်ိဳးခြဲၾကည့္ရေအာင္။

· Digitization

· Processing

· Display

Digitizer

Image တစ္ခုကို digital computer ထဲမွာသိမ္းဘို႕ဆိုရင္ ကြန္ပ်ဴတာက နားလည္တဲ့ numerical ပံုစံေျပာင္းေပးရပါတယ္။ အဲဒါကို digitization လုပ္တယ္လို႕ေခၚပါတယ္။ ေျပာင္းေပးတဲ့ေကာင္ကိုေတာ့ digitizer လို႕ေခၚတာေပါ့။ အသံုးမ်ားတဲ့ digitizer ေတြကေတာ့ microdensitometer, flying spot scanner, image dissector နဲ႕ TV camera digitizers ေတြပါ။ microdensitometer တို႕ flying spot scanner ေတြနဲ႕ digitize လုပ္မယ္ဆိုရင္ေတာ့ အဲဒီ image ဟာ transparency (ဥပမာ- negative film) or photograph ပံုစံနဲ႕ရွိေနရပါမယ္။ image dissector နဲ႕ TV camera ေတြကေတာ့ဒီလုိပံုစံမ်ိဳးရွိေနတဲ့ image ေတြသာမက natural image ေတြကိုပါ digitize လုပ္ေပးႏုိင္ပါတယ္။ ဒီ image ေတြမွာ detector ကို excite ျဖစ္ေစႏုိင္ေလာက္တဲ့ light intensity ေတာ့ရွိရမွာေပါ့။

Microdensitometer ကိုသံုးျပီး digitize လုပ္မယ္ဆို transparency or photograph ပုံစံနဲ႕ရွိေနတဲ့ image ကို flat bed ေပၚမွာတင္ သို႕မဟုတ္ drum မွာ ပတ္ထားရပါတယ္။ ျပီးရင္ အဲဒီ image ကို scan လုပ္ပါတယ္။ လုပ္တဲ့ ပံုစံကေတာ့ အဲဒီ image ေပၚကို အလင္းတန္းေတြက်ေရာက္ေစတာပါ။ ျပီးရင္ bed ကို translate လုပ္မယ္၊ drum ဆိုရင္လည္း rotate လုပ္မယ္။ လွည့္မယ္ဆိုပါေတာ့။ transparency image မ်ိဳးဆိုရင္ အလင္းတန္းေတြက film ကိုေဖာက္သြားမယ္။ photograph ဆိုရင္ေတာ့ image ရဲ႕ surface ေပၚအလင္းက်ေရာက္ျပီးအဲဒီကေနအလင္းျပန္လာမွာေပါ့။ ျပီးရင္ အဲဒီ beam (အလင္းတန္း)ေတြက photo detector ဆီေရာက္မယ္။ ေရာက္လာတဲ့ beam ရဲ႕ light intensity ေတြကို record လုပ္လုိက္တာပါဘဲ။ digital image ဆိုတာ intensity ရဲ႕ discrete value ေတြကိုသာယူပါတယ္။ microdensitometer ဆိုတာ ေႏွးေကြးတဲ့ device မ်ိဳးပါ။ ဒါေပမယ့္ position တိက်မႈအရွိဆံုး device ပါဘဲ။ mechanical translation နည္းနဲ႕လုပ္တာမုိ႕ ေႏွးတာပါ။ position ေတာ့တိက်တာေပါ့။

Flying spot scanner ရဲ႕အလုပ္လုပ္ပံုကလည္း photodetector ကိုလာထိတဲ့ bean ေတြကိုဘဲ focus လုပ္သြားတာပါ။ Microdensitometer ကေတာ့ image ကုိ bed ေပၚတင္ထားျပီး bed ကို translate လုပ္တာတုိ႕ drum မွာပတ္ထားျပီး drum ကို rotate လုပ္တာတုိ႕ရွိပါတယ္။ Flying spot scanner ကေတာ့ image ကိုမေရႊ႕ဘဲအေသထားပါတယ္။ light source က CRT ေပါ့။ ဒီမွာေတာ့ beam ေတြရဲ႕ movement ဟာ electronically သြားတာမုိ႕ scanning speed ကျမန္ပါတယ္။

Flying Spot Scanner

Image dissector နဲ႕ TV camera ေတြမွာေတာ့ image ကို photosensitive tube မွာတုိက္ရိုက္ focus လုပ္ပါတယ္။ ဒီ tube မွာ ရိုက္ခတ္လာတဲ့ light pattern အရ output ထုတ္ေပးပါတယ္။ Dissector ကေတာ့ electronic emission နည္းကိုသံုးပါတယ္။ General purpose TV image digitizer ေတြကေတာ့ vidicon tube ကိုသံုးပါတယ္။ photoconductivity ကို အေျခခံျပီးအလုပ္လုပ္ပါတယ္။

Standard vidicon digitizer ေတြက အျခား system ေတြနဲ႕စာရင္ တိက်မႈပိုင္းမွာအားနည္းပါတယ္။ ဒါေပမယ့္ သူ႕ရဲ႕ အားသာခ်က္ေတြေတာ့ရွိတယ္။ digitize လုပ္ေနတဲ့ image ကို TV monitor ေပၚမွာျမင္ေနႏုိင္ပါတယ္။

Image Processors

Image processing အတြက္အသံုးျပဳတဲ့ system ေတြကေတာ့ range အမ်ိဳးမ်ိဳးရွိပါတယ္။ special purpose application ေတြအတြက္အသံုးျပဳတဲ့ microprocessor device ေတြရွိသလို high resolution image ေတြကို ေဆာင္ရြက္ႏုိင္တဲ့ larger computer system ေတြလည္းရွိပါတယ္။

လူသံုးမ်ားတဲ့ storage media ၂ခုကေတာ့ magnetic tape နဲ႕ disk pack ပါ။ (ခုေတာ့သိတဲ့အတုိင္း SD card ေတြေရာ ဘာေတြေရာ အစံုရွိေနပါျပီ။ ခုဟာကေတာ့ စာအုပ္ထဲပါတဲ့အတုိင္းေျပာရတာပါ။) ဒီ device ေတြဟာ device တစ္ခုမွာ image ေပါင္းေျမာက္မ်ားစြာကိုသိမ္းဆည္းထားႏိုင္ပါတယ္။ ဘယ္ေလာက္ထိသိမ္းႏုိင္လည္းဆိုတာ ကေတာ့ အဲဒီ device ေတြ႕ရဲ႕ size နဲ႕ သိမ္းမယ့္ image ေတြရဲ႕ size ေပၚမူတည္တာေပါ့။ သာမန္အားျဖင့္ဆိုရင္ေတာ့ အျမဲတမ္းအတြက္သိမ္းထားဘုိ႕ဆိုရင္ magnetic tape ေတြကိုအသံုးျပဳပါတယ္။ process လုပ္ေနစဥ္ကာလအတြင္းမွာ image data ေတြကို computer memory နဲ႕ bulk storage media (hard disk ဆိုပါေတာ့) ၾကား transfer လုပ္ရာမွာ ျမန္ဆန္ေစဘို႕အတြက္ကေတာ့ disk ေပၚမွာ သိမ္းပါတယ္။

Image processing လုပ္မယ့္ computer sytem တစ္လံုးမွာ ဆိုရင္ 32000 ကေန 64000 word ထိရွိတဲ့ core memory ပါ minicomputer ရယ္ disk device ၂ခုရယ္ magnetic tape ရယ္ အျခား peripheral ေတြျဖစ္တဲ့ scope terminal, cassette recorder, line printer သို႕မဟုတ္ အျခား hard-copy output device ေတြရွိဘုိ႕လိုပါတယ္။ (ဒါလည္းစာအုပ္ထဲပါတဲ့အတုိင္းေျပာတာ။ ခုေတာ့ နဲနဲအေျပာင္းအလဲရွိတာေပါ့။)

Display Devices

Image processing system တစ္ခုမွာပါတဲ့ display unit ရဲ႕တာ၀န္က computer ထဲမွာ store လုပ္ထားတဲ့ numerical form ကေန လူေတြျမင္ႏုိင္တဲ့ ပံုစံျဖစ္အာင္ေျပာင္းေပးရတာပါ။ ဥပမာ အျဖဴေရာင္ pixel တစ္ခုနဲ႕ အမဲေရာင္ pixel တစ္ခုပါတဲ့ image တစ္ခုဆိုပါေတာ့။ ကြန္ပ်ဴတာထဲသိမ္းထားတဲ့အခါမွာ 0 နဲ႕ 255 ဆိုျပီးသိမ္းထားမွာပါ။ ဒါကိုကြန္ပ်ဴတာကနားလည္ေပမယ့္ လူကိုဒီအတုိင္းထုတ္ျပလို႕ မရပါဘူး။ ထုတ္ျပလိုက္ရင္ လူက 0 နဲ႕ 255 ဆိုတာကိုဘဲျမင္ျပီး ဘယ္လို interpret လုပ္ရမယ္မွန္းသိမွာမဟုတ္ဘူး။ လူနားလည္တဲ့ အျဖဴေရာင္နဲ႕ အမဲေရာင္ျဖစ္သြားေအာင္ display unit ကလုပ္ေပးရတာပါ။ display unit ေတြကေတာ့ CRT တုိ႕ TV system တုိ႕ printing device တုိ႕ပါဘဲ။

Chapter 2

Digital Image Fundamentals

ဒီ chapter ရဲ႕ ရည္ရြယ္ခ်က္ကေတာ့ စာဖတ္သူေတြကို image concepts ေတြနဲ႕မိတ္ဆက္ေပးဘုိ႕ပါ။ ဒီစာအုပ္မွာ သံုးမယ့္ notation ကိုလည္းေျပာသြားမွာေပါ့။ ပထမ section ကေတာ့ human visual system ရဲ႕ mechanics ကို အက်ဥ္းခ်ဳပ္ေျပာမွာပါ။ မ်က္လံုးထဲမွာ ပံုတစ္ခု ဘယ္လိုျဖစ္လာလဲဆိုတာမ်ိဳးေပါ့။ ျပီးေတာ့ brightness adaptation ရယ္ discrimination ရယ္နဲ႕ပတ္သက္ျပီး လူ႕မ်က္လံုးကဘယ္လုိစြမ္းေဆာင္ႏုိင္တယ္ဆိုတာေပါ့။ Section 2.2 မွာေတာ့ image model အေၾကာင္းေျပာပါမယ္။ uniform image sampling နဲ႕ gray-level quantization အေၾကာင္းကိုေတာ့ Section 2.3 မွာေျပာပါမယ္။ Section 2.4 မွာေတာ့ nonuniform sampling နဲ႕ quantization အေၾကာင္းေျပာပါမယ္။ ေနာက္ဆံုး section ျဖစ္တဲ့ Section 2.5 မွာေတာ့ photographic film နဲ႕ သူ႕ရဲ႕ အေရးၾကီးတဲ့ characteristic ေတြကို ေတြ႕ရမွာျဖစ္ပါတယ္။

2.1 Element of Visual Perception

2.1.1 Structure of the Human Eye

Human eye ရဲ႕ horizontal cross section (ေဘးတုိက္ျမင္ရတဲ့အေနအထား) ကို Figure 2.1 မွာၾကည့္ႏုိင္ပါတယ္။ မ်က္လံုးကေတာ့ သိတဲ့အတုိင္း လံုးလံုးၾကီးေပါ့။ သူ႕ရဲ႕ diameter က 20 mm ေလာက္ရွိတယ္။ သူ႕ကို အလႊာ ၃ခုနဲ႕ဖုံးထားတယ္။ အဲဒီအလႊာ ၃ခုကေတာ့ (၁) cornea and sclera outer cover (၂) choroid (၃) retina တို႕ေပါ့။ ပံုမွာလည္းၾကည့္ႏုိင္ပါတယ္။

Figure 2.1

ဒါကလူ႕မ်က္လံုးကိုေဘးတုိက္ျမင္ရတဲ့အေနအထားပါ။

cornea ဆိုတာကေတာ့ အလင္းေဖာက္ထြင္းႏုိင္တဲ့အစိတ္အပိုင္းတစ္ခုေပါ့။ မ်က္လံုးရဲ႕အတြင္းမ်က္ႏွာျပင္ကိုဖုံုးအုပ္ေပးထားတာပါ။ sclera ဆိုတာကေတာ့ Greek ဘာသာ skleros ကလာတာပါ။ မာေက်ာတယ္ဆိုတဲ့အဓိပါယ္ေပါ့။ လူ႕မ်က္လံုးမွာဆုိရင္ sclera ကအျဖဴေရာင္ျဖစ္ပါတယ္။ မာမမာေတာ့မသိဘူး။ သူကအလင္းမျဖတ္ႏုိင္တဲ့ membrane တစ္ခုေပါ့။ optic globe ရဲ႕က်န္တဲ့အပိုင္းေတြကိုဖံုးအုပ္ထားတယ္။ သူက cornea နဲ႕တစ္ဆက္ထဲပါ။ cornea ကေတာ့ အလင္းျဖတ္ႏုိင္တယ္လို႕ခုနေျပာျပီးျပီ။

choroid ကေတာ့ sclera ရဲ႕ေအာက္တည့္တည့္မွာရွိတယ္။ သူ႕မွာေသြးေၾကာမွ်င္ေတြရွိျပီး မ်က္လံုးအတြက္လုိအပ္တဲ့ nutrition ေတြေထာက္ပံ့ေပးေနတာပါ။ သူက pigmented ျဖစ္တယ္။ ဆိုလိုတာက အလင္းေတြရဲ႕အေရာင္ကိုေျပာင္းပစ္ႏုိင္တယ္ေပါ့။ ဒါေၾကာင့္ မ်က္လံုးထဲ၀င္လာတဲ့ extraneous light ေတြရဲ႕ပမာဏ နဲ႕ optical globe အတြင္းက backscatter ကိုေလွ်ာ့ခ်ေပးႏုိင္တယ္။ backscatter ဆိုတာကေတာ့ အလင္းျပန္တာကိုေျပာတယ္လို႕မွတ္သားမိတာဘဲ။ မ်က္လံုးရဲ႕အတြင္းအက်ဆံုးအပို္င္းမွာ ဆိုရင္ choroid ကို ၂ပိုင္းခြဲထားတယ္။ ciliary body နဲ႕ iris diaphragm ေပါ့။ မ်က္လံုးထဲ၀င္လာတဲ့ အလင္းရဲ႕ပမာဏကိုထိန္းခ်ဳပ္ဘုိ႕အတြက္ iris diaphragm ကလုပ္ေပးရတာပါ။ pupil (အမဲေရာင္အလံုး) ရဲ႕ diameter ကေတာ့ 2mm ကေန 8mm ထိရွိတတ္ပါတယ္။ iris ရဲ႕မ်က္ႏွာစာဘက္မွာ visible pigment ရွိျပီး ေနာက္ဘက္မွာေတာ့ black pigment ပါပါတယ္။

မ်က္လံုးရဲ႕အတြင္းအက်ဆံုး membrane ကေတာ့ retina (မ်က္ၾကည္လႊာ)ပါ။ မ်က္လံုးနဲ႕ object တစ္ခုကိုၾကည့္လုိက္တဲ့အခါ အဲဒီ object ကလာတဲ့ အလင္းတန္းဟာ retina ေပၚမွာပံုရိပ္ထင္ပါတယ္။ retina ရဲ႕ မ်က္ႏွာျပင္ေပၚမွာ discrete light receptor ေတြျပန္႕ႏွံ႕ေနပါတယ္။ receptor ၂မ်ိဳးရွိပါတယ္။ cones နဲ႕ rods ပါ။ မ်က္လံုးတစ္လံုးမွာ cone ေပါင္း six million ကေန seven million ထိရွိတတ္ပါတယ္။

cone ေတြဟာ retina ရဲ႕ central protion (fovea) မွာျပန္႕ႏွံ႕ျပီးရွိေနပါတယ္။ သူက color ကို sensitive ျဖစ္ပါတယ္။ cone ေတြေၾကာင့္သာ အေသးစိတ္ အႏုစိတ္ၾကည့္ျမင္ႏုိင္ၾကတာပါ။ ဘာေၾကာင့္လဲဆိုေတာ့ cone တစ္ခုခ်င္းစီဟာ nerve ေၾကာတစ္ခုစီနဲ႕ခ်ိတ္ဆက္ထားလုိ႕ပါ။ ကိုယ္ျမင္ခ်င္တဲ့ object တစ္ခုရဲ႕ image ကို fovea မွာထင္လာေစဘုိ႕အတြက္ muscle ေတြကိုသံုးျပီး eye ball ကို rotate လုပ္ေပးရပါတယ္။ (မ်က္လံုးကစားတယ္ေခၚမွာေပါ့။) cone vision ကို photopic or bright-light vision လို႕ေခၚပါတယ္။

Rod ရဲ႕အေရအတြက္ကေတာ့ cone ထက္ပိုမ်ားပါတယ္။ 75 to 150 million ေလာက္ရွိပါတယ္။ cone လုိဘဲ retina ရဲ႕မ်က္ႏွာျပင္မွာျပန္႕ႏွံ႕ျပီးရွိေနတာပါ။ rod ေတြကအေရအတြက္မ်ားေတာ့ သူျပန္႕ႏွံ႕ေနတဲ့ area ကလည္းက်ယ္ျပန္႕တာေပါ့။ ျပီးေတာ့ rods အေတာ္မ်ားမ်ားဟာ nerve ေၾကာတစ္ခုထဲကိုဘဲစုျပံဳခ်ိတ္ဆက္ထား ၾကပါတယ္။ ဒီအေၾကာင္း ၂ခုေၾကာင့္ ဒီ rod (receptor) ေတြက အေသးစိတ္ အႏုစိတ္ ခြဲျခားႏုိင္စြမ္းကိုေလ်ာ့က်ေစပါတယ္။ ဆိုလိုတာက rod ေတြဟာ overall picture ေလာက္သာေပးႏုိင္ပါတယ္။ color vision အတြက္လည္း rod ကလုပ္မေပးႏိုင္ပါဘူး။ low level illumination ေတြကိုသာ sensitive ျဖစ္ပါတယ္။ ဥပမာ ေန႕အလင္းေရာင္မွာ အေရာက္ေတာက္ေတာက္ပပ ျမင္ရတဲ့ ပစၥည္းတစ္ခုကို လေရာင္ေအာက္မွာ ၾကည့္တဲ့အခါ colorless ျဖစ္ေနတာေတြ႕ရပါမယ္။ ဘာေၾကာင့္လည္းဆိုေတာ့ လေရာင္ေအာက္မွာဆိုရင္ rod ေတြသာ stimulate ျဖစ္လို႕ပါ။ rod ေတြက color ကို sensitive မျဖစ္ဘူးေလ။ ဒါကို scotopic or dim-light vision လုိ႕ေခၚပါတယ္။

မွတ္ခ်က္။ ။ ကိုယ့္မ်က္စိတည့္တည့္မွာရွိေနတဲ့ object ေတြကိုအေသးစိတ္၊ အႏုစိတ္ျမင္ႏုိင္ပါတယ္။ object ေဘးကအရာေတြကိုေတာ့ အေသးစိတ္မျမင္ႏုိင္ဘဲ pattern တခုအျဖစ္သာျမင္ရတယ္ဆိုတာကိုလည္းလက္ေတြ႕သိႏုိင္ပါတယ္။

Len (မွန္ဘီလူး) ေတြကိုေတာ့ fibrous cell ေတြနဲ႕ျပဳလုပ္ထားပါတယ္။ ဒီ len ကို ciliary body မွာခ်ိတ္ဆြဲထားပါတယ္။ အမွ်င္ေတြနဲ႕ခ်ိတ္ဆြဲထားတာေပါ့။ len မွာ ေရဓါတ္ 60-70 %, အဆီ 6% ေလာက္ပါ၀င္ပါတယ္။ မ်က္လံုးထဲကအျခား tissue ေတြနဲ႕စာရင္ protein ဓါတ္အမ်ားဆံုးပါ၀င္တဲ့ အစိတ္အပိုင္းတစ္ခုေပါ့။ အသက္ၾကီးလာတာနဲ႕အမွ် len ေတြဟာ နဲနဲ ၀ါလာတတ္ပါတယ္။ len ဟာ visible light spectrum ရဲ႕ 8% ေလာက္ကို absorb လုပ္ႏိုင္ပါတယ္။ wavelength တိုရင္ ပိုျပီး absorb လုပ္ႏုိင္ပါတယ္။ len အတြင္းက protein ေတြဟာ infrared ေရာ ultraviolet light ေတြကိုပါ absorb လုပ္ပါတယ္။ ဒီလိုေရာင္ျခည္ေတြအမ်ားၾကီးၾကည့္မိရင္ေတာ့ မ်က္လံုးပ်က္ႏုိင္ပါတယ္။

2.1.2 Image Formation in the Eye

မ်က္လံုးထဲမွာ ပံုရိပ္တစ္ခုဘယ္လိုထင္လာသလဲဆိုတာ ၾကည့္ရေအာင္။ ျမတ္စြာဘုရားရဲ႕တရားေတာ္အရဆိုရင္ေတာ့ ရူပါရံု နဲ႕ စကၡဳပသာဒအၾကည္ဓါတ္ တုိ႕ထိေတြ႕မႈေၾကာင့္ ျမင္သိစိတ္ျဖစ္ေပၚလာတယ္လုိ႕ဆိုပါတယ္။ အခု Image formation in the eye ဆိုတာကိုဖတ္ျပီးရင္ေတာ့ လက္ေတြ႕သိႏိုင္ပါျပီ။

လူ႕မ်က္လံုးထဲမွာပါတဲ့ len နဲ႕ အျပင္မွာအသံုးျပဳေနၾကတဲ့ len ၾကားျခားနားခ်က္တစ္ခုကေတာ့ လူ႕မ်က္လံုးထဲက len က flexible ျဖစ္တာေပါ့။ အျပင္က len ကို သြားေကြးလုိက္လို႕ကေတာ့ ကြဲသြားမွာေပါ့။ Figure 2.1 မွာျပထားတဲ့အတုိင္း len ရဲ႕အတြင္းမ်က္ႏွာျပင္က အျပင္မ်က္ႏွာျပင္ထက္ပိုခံုးေနတယ္ဆိုတာေတြ႕ရမယ္။ len ရဲ႕ ပံုသ႑န္ကိုေတာ့ ciliary body မွာရွိတဲ့အမွ်င္ေတြက control လုပ္ေပးတယ္။ အေ၀းကပစၥည္းေတြကိုၾကည့္ခ်င္ရင္ muscle ေတြကေန len ကိုျပားသြားေစတယ္။ အနီးအနားက ပစၥည္းေတြကိုၾကည့္ဘို႕ဆိုရင္ေတာ့ ခံုးသြားေစတယ္။ (မ်က္လံုးကိုျပဴးလိုက္ရင္ len ကျပားသြားတယ္။ ၾကံဳ႕လိုက္ေတာ့ ခံုးသြားတယ္။) အနီးမႈံတဲ့သူေတြဆိုသိသာတယ္။ အနီးက object ေတြကိုၾကည့္တဲ့အခါ (မ်က္မွန္မပါဘဲစာဖတ္တဲ့အခါနဲ႕အပ္ေပါက္ထိုးတဲ့အခါေတြမွာ) မ်က္လံုးကိုၾကံဳ႕ျပီး ၾကည့္ေလ့ရွိတယ္။ ကိုယ္ကအနီးမႈန္သူမုိ႕လက္ေတြ႕စမ္းၾကည့္တယ္။ စာဖတ္တဲ့အခါ ခပ္၀ါး၀ါးျဖစ္ေနရင္ မ်က္လံုးကိုၾကံဳ႕ျပီးၾကည့္တဲ့အခါ ပိုထင္ရွားလာတာေတြ႕ရတယ္။ အေ၀းကိုျပဴးၾကည့္ရင္ပိုျမင္ရသလားဆိုတာကေတာ့အေ၀းမႈံသမားေတြစမ္းၾကည့္ၾကေပါ့။

Len ရဲ႕ focal center နဲ႕ retina ၾကားအကြာအေ၀းကေတာ့ 14 mm ကေန 17 mm ေလာက္ရွိမယ္။ မ်က္လံုးက 3m ထက္ေက်ာ္တဲ့အကြာအေ၀းက object တစ္ခုကိုၾကည့္လုိက္တဲ့အခါ len ရဲ႕ refractive power က lowest ျဖစ္ျပီး အရမ္းနီးတဲ့ object အတြက္ဆိုရင္ေတာ့ refractive power က အရမ္း strong ျဖစ္တယ္။ ျပင္ပက object တစ္ခု retina ေပၚမွာေပၚလာတဲ့အခါ retina image ရဲ႕ size ဘယ္ေလာက္ရွိတယ္ဆိုတာတြက္ႏုိင္တယ္။ Figure 2.2 ကိုၾကည့္ပါ။ လူတစ္ေယာက္က 100 m အကြာမွာရွိတဲ့ 15 m ျမင့္ေသာ သစ္ပင္တစ္ပင္ကိုၾကည့္တယ္ဆိုပါေတာ့။

Figure 2.2

အဲဒီသစ္ပင္ဟာ retina မွာ size ဘယ္ေလာက္နဲ႕ေပၚမလဲတြက္ၾကည့္ရေအာင္။ မသိကိန္းျဖစ္တဲ့ retina image ရဲ႕ size ကို x mm လုိ႕ထားရေအာင္။

(15/100) = (x/17) ေပါ့။

ဒါဆိုရင္ x = 2.55 mm

100 m အကြာမွာရွိတဲ့ 15 m ျမင့္တဲ့သစ္ပင္ဟာ မ်က္လံုးထဲမွာေတာ့ 2.55 mm သာရွိပါေတာ့တယ္။ အကြာအေ၀းကို 1000 m လို႕ထားၾကည့္ပါ။ retina image ဟာ 0.255 mm ျဖစ္သြားပါမယ္။ ဒါကိုၾကည့္ျခင္းအားျဖင့္ object ေတြဟာ ေ၀းသြားေလေလ retina image size ဟာ ေသးသြားေလေလေပါ့။ အရမ္းေ၀းသြားရင္ေတာ့ ျမင္ကုိမျမင္ရေတာ့ဘူးဆိုတဲ့အေျခအေနကိုေရာက္သြားပါတယ္။

အဲဒီ retina image ဟာ fovea မွာ လာေပၚတာပါ။ ျမင္သိမႈဘယ္လိုျဖစ္လာလည္းဆိုေတာ့ light receptors ေတြရဲ႕ excitation ကေနျဖစ္ေစပါတယ္။ သူတုိ႕က radiant energy ကေန electrical impulse အျဖစ္ကိုေျပာင္းေပးတာပါ။ ဒီ electrical impulse ေတြကိုမွ brain ကေနျပီး decode ျပန္ုလုပ္ယူပါတယ္။ အဲဒီမွာ “ျမင္တယ္” ဆိုတာျဖစ္လာတာဘဲ။

2.1.3 Brightness Adaptation and Discrimination

Digital image ေတြကို display လုပ္တဲ့အခါ discrete set of brightness point ေတြအျဖစ္နဲဲ႕ျပပါတယ္။ ဒါေၾကာင့္မ်က္လံုးရဲ႕ brightness level အမ်ိဳးမ်ိဳးကိုခြဲျခားႏုိင္စြမ္းဟာ image processing result ေတြကိုထုတ္တဲ့အခါမွာထည့္သြားစဥ္းစားရမယ့္အခ်က္တစ္ခုျဖစ္ပါတယ္။

Human visual system က adapt လုပ္ႏုိင္တဲ့ light intensity level ရဲ႕ range ကအေတာ္ေလးေတာ့မ်ားပါတယ္။ (scotopic (မွိန္) threshold ကေန glare (စူးရွေတာက္ပ) limit အထိ)

Subjective brightness (ဆိုလိုတာက Human visual system ကျမင္ႏိုင္တဲ့ brightness) ဆိုတာက မ်က္လံုးကို က်ေရာက္လာတဲ့ light intensity ရဲ႕ logarithmic function တစ္ခုဘဲျဖစ္ပါတယ္။ Figure 2.3 မွာၾကည္ပါ။ ပံုက light intensity နဲ႕ subjective brightness တုိ႕ရဲ႕ plot ပါ။ long solid curve က visual system က adapt လုပ္ႏုိင္တဲ့ intensity range ကိုျပတာပါ။ (scotopic threshold ကေန glare limit အထိရွိပါတယ္။)

2.1.3 Brightness Adaptation and Discrimination

Brightness Adaptation

          Digital image ေတြကို display လုပ္တဲ့အခါ discrete set of brightness point (intensity) ေတြအျဖစ္နဲဲ႕ျပပါတယ္။ ဒါေၾကာင့္မ်က္လံုးရဲ႕ brightness level အမ်ိဳးမ်ိဳးကိုခြဲျခားႏုိင္စြမ္းဟာ image processing result ေတြကိုထုတ္တဲ့အခါမွာထည့္သြားစဥ္းစားရမယ့္အခ်က္တစ္ခုျဖစ္ပါတယ္။

          Human visual system က adapt လုပ္ႏုိင္တဲ့ light intensity level ရဲ႕ range ကအေတာ္ေလးေတာ့မ်ားပါတယ္။ light intensity level ေပါင္း 10¹⁰ ေလာက္ကို adapt လုပ္ႏုိင္တာပါ။ (scotopic (မွိန္) threshold ကေန glare (စူးရွေတာက္ပ) limit အထိ) ပံုမွာဆိုရင္ -6 to 4 ေပါ့။

          Subjective brightness ဆိုုတာ မ်က္လံုးကို က်ေရာက္လာတဲ့ light intensity ရဲ႕ logarithmic function တစ္ခုဘဲျဖစ္ပါတယ္။ Figure 2.3 မွာၾကည္ပါ။ ပံုက light intensity နဲ႕ subjective brightness တုိ႕ရဲ႕ plot ပါ။ long solid curve က visual system က adapt လုပ္ႏုိင္တဲ့ intensity range ကိုျပတာပါ။ (scotopic threshold ကေန glare limit အထိရွိပါတယ္။ -6 to 4) scotopic ကေန photopic vision ကိုေျပာင္းသြားတာက -3 ကေန -1 ၾကားမွာပါ။ ပံုမွာ first line and second line ၾကားေပါ့။ photopic vision ခ်ည္းသက္သက္ရဲ႕ range ကေတာ့ 10⁶ ရွိပါတယ္ (ပံုမွာ second red line and third red line ၾကား)။

          Human visual system က အဲေလာက္မ်ားျပားတဲ့ range ကိုေတာ့ တျပိဳင္နက္ operate မလုပ္ႏုိင္ပါဘူး။ သူ႕ရဲ႕ overall sensitivity ကို အေျပာင္းအလည္းလုပ္ရပါတယ္။ အဲဒါကိုဘဲ brightness adaptation လုိ႕ေခၚတာပါ။ total adaptation range နဲ႕ႏႈိင္းစာမယ္ဆိုရင္ မ်က္လံုးက တျပိဳင္နက္ discriminate (ခြဲျခားသိျမင္ႏုိင္စြမ္း) လုပ္ႏုိင္တဲ့ intensity level ကအေတာ္ေလးနည္းပါတယ္။ human visual system ရဲ႕ current sensitivity level ကုိ brightness-adaptation level လုိ႕ေခၚပါတယ္။ ပံုမွာဆိုရင္ B_a ဆိုတဲ့ point မွာရွိတဲ့ brightness-adaptation level က (B_b to B_a) ျဖစ္ပါတယ္။

Figure 2.3

Discrimination

မ်က္လံုးရဲ႕ contrast sensitivity ကို တုိင္းတာဘုိ႕အတြက္ Brightness B ရွိတဲ့ မ်က္ႏွာျပင္တစ္ခုရဲ႕အလယ္မွာ brightness (I + I) ရွိတဲ့ စက္၀ုိင္းတစ္ခုကိုဆြဲျပီး ၾကည့္ခုိင္းရေအာင္။ I တန္ဘုိးကို 0 ကေနစျပီး စက္၀ုိင္းေလးကိုျမင္ေလာက္တဲ့အထိ တိုးသြားရေအာင္။ ျမင္သာတဲ့အထိဘဲေပါ့။ I/I ကို Weber ratio လုိ႕ေခၚပါတယ္။ I/I က 2% ေလာက္ျဖစ္ပါလိမ့္မယ္။ ဆိုလုိတာက background ရဲ႕ brightness 100 ျဖစ္ျပီးသူ႕ေပၚကစက္၀ိုင္းရဲ႕ brightness က 2 ေလာက္ဆိုရင္ လူက အဲဒီစက္၀ိုင္းကိုျမင္ပါျပီ။ စက္၀ိုင္းကို background ကေနခြဲျပီးျမင္ႏုိင္ျပီေပါ့။

Figure 2.4

တကယ္လုိ႕ I/I ရဲ႕တန္ဘိုးက နည္းတယ္ဆိုရင္ intensity နည္းနည္းေလးေျပာင္းသြားတာကိုေတာင္ သိႏုိင္တယ္။ ခြဲျခားႏုိင္တယ္။ brightness discrimination ေကာင္းတယ္လုိ႕သတ္မွတ္ႏုိင္ပါတယ္။ I/I ရဲ႕တန္ဘိုးမ်ားရင္ေတာ့ poor brightness discrimination လုိ႕သတ္မွတ္ရမွာေပါ့။ average ကေတာ့ 2 ေလာက္ရွိၾကပါတယ္။ ဒါက မ်က္လံုးနဲ႕ခ်ည္းဆိုင္တာမဟုတ္ဘူး။ background illumination နဲ႕လည္းဆိုင္ပါတယ္။ background က low level illumination ျဖစ္ရင္ brightness discrimination မေကာင္းဘူး။ higher level illumination background ဆိုရင္ေတာ့ brightness discrimination ပိုေကာင္းလာပါတယ္။

Figure 2.4 က constant brightness background အတြက္ျဖစ္ျပီး Figure 2.5 မွာေတာ့ background ရဲ႕ brightness ကိုအမ်ိဳးမ်ိဳးေျပာင္းျပီးတြက္ထားတဲ့ contrast sensitivity ကိုျပထားပါတယ္။ Figure 2.5 ကိုေတာ့ စာအုပ္ထဲသာျပန္ၾကည့္ၾကပါေတာ့။

Mach Bands

Figure 2.6 (a) မွာျပထားတဲ့ပံုကိုၾကည့္မယ္ဆိုရင္ (ပံုကိုစာအုပ္ထဲကဘဲၾကည့္ပါေတာ့) intensity value ကိုတျဖည္းျဖည္းမသိမသာတိုးထားတဲ့ပံုပါ။ ဘယ္ကေနညာကိုတုိးထားတာပါ။ ဒါေပမယ့္ မ်က္လံုးကဒါကို ဘယ္ေနရာမွာအေျပာင္းအလည္းျဖစ္သြားတယ္ဆိုတာ ခြဲျခားျပီးသိႏုိင္ပါတယ္။ B နဲ႕အမွတ္အသားျပထားတဲ့ေနရာမွာ Brighter stripe ကိုျမင္ျပီး D နဲ႕ျပထားတဲ့ေနရာမွာ Darker stripe ကိုျမင္ႏုိင္ပါတယ္။ အဲဒီ stripe ေတြကို Mach Bands လို႕ေခၚတာပါ။ 1865 မွာ ဒါကိုစျပီးေဖာ္ထုတ္ခဲ့တဲ့ Ernst Mach ရဲ႕အမည္ကိုေပးထားတာပါ။ Figure 2.6 မွာေတာ့ band ေတြအမ်ားၾကီးေတြ႕ပါလိမ့္မယ္။

Figure 2.6 (b)
2.2 An Image Model

Image ကို f(x,y) ဆိုတဲ့ two-dimensional light intensity function နဲ႕ေဖာ္ျပတယ္ဆိုတာေျပာျပျပီးပါျပီ။ spatial coordinates (x,y) မွာရွိတဲ့ f ရဲ႕ value or amplitude ဆိုတာဟာ အဲဒီ point မွာရွိတဲ့ image ရဲ႕ brightness or intensity ကိုေျပာတာပါ။ light ဆိုတာ energy တစ္ခုျဖစ္တဲ့အတြက္ f(x,y) ဟာ nonzero ျဖစ္ရမယ္။ finite ျဖစ္ရပါမယ္။

0<f(x,y)<∞ (2.2.1)

Image ကို object ေတြကေန ျပန္လာတဲ့ အလင္းေတြနဲ႕ဖဲြ႕စည္းထားတာပါ။ f(x,y) မွာ component ၂ခုပါ၀င္ပါတယ္။ object ေပၚက်ေရာက္တဲ့အလင္းရဲ႕ ပမာဏ (illumination) ရယ္ အဲဒီ object ကျပန္လာတဲ့ အလင္း (reflectance) ရယ္ေပါ့။ illumination ကို i(x,y), reflectance ကို r(x,y) လုိ႕သတ္မွတ္တယ္။ ဒါေၾကာင့္ f(x,y) ဆိုတာဟာ

f(x,y)=i(x,y)r(x,y) (2.2.2)

where 0<i(x,y)<∞ (2.2.3) and

0<r(x,y)<1 (2.2.4)

reflectance ဟာ 0 နဲ႕ 1 ၾကားရွိျပီး total absorption ဆိုရင္ 0 ေပါ့။ ဆိုလိုတာက object ေပၚအလင္းက်ေရာက္ျပီး လံုး၀အလင္းမျပန္တဲ့ object မ်ိဳးဆို r(x,y)=0 ေပါ့။ အလင္းအရမ္းျပန္တဲ့ဟာမ်ိဳးဆိုရင္ေတာ့ r(x,y)=1 ေပါ့။ မွန္လိုဟာမ်ိဳးဆိုသစ္သားထက္ အလင္းျပန္တာမ်ားတာေပါ့။ အမဲေရာင္ ကတၱီပါစရဲ႕ reflectance r(x,y) ဟာ 0.01 ေလာက္ျဖစ္ပါတယ္။ steel အတြက္ကေတာ့ 0.65, အျဖဴေရာင္နံရံဆိုရင္ 0.8, ေငြသတၱဳျပားကေတာ့ 0.9, snow ကေတာ့ 0.93 ပါ။ ဒါေၾကာင့္ အမဲေရာင္ပစၥည္းေတြကရွာရခက္ျပီး အေရာင္ေတာက္တဲ့ ပစၥည္းေတြကရွာရလြယ္ပါတယ္။ reflectance မ်ားလို႕ပါ။

i(x,y) ကိုေတာ့ object ေတြရဲ႕ characteristics နဲ႕သတ္မွတ္ပါတယ္။ ေနသာတဲ့ေန႕မ်ိဳးမွာဆိုရင္ ေနကထုတ္လႊတ္လိုက္တဲ့ illumination ဟာ 9000 foot-candles သာသာရွိပါတယ္။ တိမ္ထူတဲ့ေန႕ေတြမွာဆိုရင္ေတာ့ 1000 foot-candles ေလာက္ထိေလ်ာ့သြားတတ္ပါတယ္။ တိမ္ကင္းစင္တဲ့ လျပည့္ညေတြမွာဆိုရင္ေတာ့ illumination ဟာ 100 foot-candles ေလာက္ရွိပါတယ္။

Coordinate (x,y) မွာရွိတဲ့ monochrome image f ရဲ႕ intensity ကို အဲဒီ point မွာရွိတဲ့ image ရဲ႕ light ရဲ႕ gray level (l) လုိ႕ေခၚၾကရေအာင္။ l ရဲ႕ value ဟာ

L_min≤ l ≤ L_max (2.2.5)

သီိအိုရီအရဆိုရင္ L_minရဲ႕ တန္ဘိုးဟာ positive ဘဲျဖစ္ရမယ္။ L_max ကလည္း finite ျဖစ္ရပါမယ္။ လက္ေတြ႕မွာေတာ့

L_min = i_min r_min

L_max = i_max r_max

Indoor image processing application ေတြအတြက္ဆိုရင္ L_min ≈ 0.005 ျဖစ္ျပီး L_max≈ 100 ေလာက္ရွိပါမယ္။ L_minနဲ႕ L_maxၾကားက interval ကိုေတာ့ gray scale လို႕ေခၚပါတယ္။ ဒီ interval ကို interval [0,L] အျဖစ္ ေျပာင္းျပီးသံုးေလ့ရွိပါတယ္။ အမဲေရာင္အတြက္ဆို l = 0 ေပါ့။ l = L ဆိုရင္ေတာ့ အျဖဴေရာင္ေပါ့။ ၾကားက တန္ဘုိးေတြကေတာ့ black နဲ႕ white ၾကားက shaded gray ေတြေပါ့။

stonesoflife

Thursday, December 27, 2012

Computer Graphics and Applications (B.E. IT)

1 comment:

က်မရဲ႕အျခားဆိုက္မ်ားသုိ႕ဒီေနရာကေနအလြယ္တကူသြားႏုိင္ပါတယ္။

ဒီ Page ကိုLike လုပ္ႏုိင္ပါတယ္။

က်မနဲ႕ဆက္သြယ္ႏုိင္ပါတယ္။

လာလည္သူအားလံုးကိုေက်းဇူးတင္ပါတယ္။

ေျပာခ်င္တာေတြရွိရင္ဒီေနရာကေနေျပာခဲ့ႏိုင္ပါတယ္။