computer sience thesis

Graduation Project model-EN-2021/0 Process Management Documents Cover-2021.doc

2021 Graduation Project
Process Management Documents

Topic Image Processing and File Manipulation using vb.net(IAV File Converter)

School
Information & Electronic Engineering

Major

Computer Science & Technology

Class

CST 1xx

Student ID 9xxxxxxxxxx

Name

xxxxxxxx xxxxxxxx

Advisor xxxx xxxxxxxxxxx

May 2021

Zhejiang University of Science & Technology

Graduation Project

Process Management Documents

Summary

1. Assignment Book
2. Literature Review
3. Thesis Proposal
4. Thesis Proposal Defense Notes
5. Work Record Sheet
6. Work Instruction Sheet
7. Advisor’s Evaluation Sheet
8. Academic Teacher’s Evaluation Sheet
9. Graduation Project Defense Notes
10. Evaluation sheet of Defense Committee
11. Mid-term Check Sheet

Graduation Project model-EN-2021/1 Assignment Book-2021.doc

2021 Graduation Project (Assignment Book) Zhejiang University of Science & Technology

Zhejiang University of Science & Technology

Graduation Project (Thesis)

Assignment Book

(2021)

Topic Image Processing and File Manipulation using vb.net(IAV File Converter)

School
Information & Electronic Engineering

Major

Computer Science & Technology

Class

CST xxx

Student ID xxxxxxxxxxx

Name

xxxxxxxxxxxxxxxxx

Advisor xxxxxxxxxxxxxxx

Release Date

1 Main Tasks and Targets

Design and implement an image, audio and video (IAV) Converter using Vb.Net. The main tasks of the system are to perform operations such filtering and transformation on digital images using computer based algorithms, extract text from images using Object Character Recognition(OCR); being focused on image processing, as a bonus the system also provides audio and video conversions. The sole target of this system is to show that Vb.Net can be a language in which this area (image processing) can be further researched and advanced.

2 Main Contents and Basic Requirements

1. Image filtering and transformation; complete the related algorithms involved for the mentioned processes.
2. OCR; design and use the functions associated with the recognition.

3. Audio and Video conversions; link ffmpeg to visual studio and create algorithms to enable conversions.
4. Finish the framework and function design of Converter.

5. Design the function modules, and draw the corresponding data flow diagrams.

6. Design user interface and application flow, and draw program flowchart and write codes.

7. Debug and test the software.

3 Work Schedule

Time

Work Contents

Jua. 1- Mar 6, 2021

Collect Literature Materials,
complete Literature Review and Thesis Proposal

Jan. 8-Mar. 13, 2021

Finish Requirements Analysis and General System Design

Mar. 14-Apr. 17, 2021

Finish Designing and Coding of Various Modules

Apr. 18-30, 2021

System Debugging and Testing

May 2-13, 2021

Write Graduation Thesis

May 14-20, 2021

Examine and Evaluate Graduation Design (Thesis)

May 21-27, 2021

Graduation Project Defense

4 Main Reference Literatures

1. Jain, Fundamentals of Digital Image Processing, Prentice-Hall Inc., 1982.
2. RC Gonzalez, 

HYPERLINK “http://xueshu.baidu.com/s?wd=author%3A%28Richard%20E.%20Woods%29%20&tn=SE_baiduxueshu_c1gjeupa&ie=utf-8&sc_f_para=sc_hilight%3Dperson” t “_blank” RE Woods,. Digital Image Processing(3rd Edition), Prentice-Hall, Inc. , 2007
3. Castleman, K.R., Digital Image Processing. Second ed. 1996, Englewood Cliffs, New Jersey: Prentice-Hall.
4. Russ, J.C., The Image Processing Handbook. Second ed. 1995, Boca Raton, Florida: CRC Press.
Advisor:
Date:
Dean:

Date:

– 4 –

Graduation Project model-EN-2021/2 Literature Review 2021.doc

2021 Graduation Project (Literature Review) Zhejiang University of Science & Technology

Zhejiang University of Science & Technology

Graduation Project (Thesis)

Literature Review

(2021)

Topic Image Processing and File Conversion using vb.net(IAV File Converter)

School
Information & Electronic Engineering

Major

Computer Science & Technology

Class

CST xxx

Student ID xxxxxxxxxxx

Name

xxxxxxxxxxxxxxxxxx

Advisor xxxxxxxxxxxxxxxxx

Finish Date xx

Literature Review for Image Processing and File Manipulation using vb.net(IAV File Converter)

1 Introduction

In this literature review, we study and analyze the IAV File Converter that was designed to read text in a picture and inform the user about the result of the request using Object Character Recognition(OCR), filter images and convert them to different formats, and as a bonus the system also converts audio and video to different formats. This project is mainly related to image processing. Section 1.1 of this chapter investigates image processing, and Section 1.2 presents image processing methods. Section 2 looks at the study status and trends of the IAV file converter, 2.1 presents the system functions, followed by section 2.2 which describes current status then 2.3 the developing trends and the last section, before we conclude, defines the summary.

1.1 Image Processing

Image processing consists of converting an image into a digital form, and then performing operations on it such as extracting its content, or the information in it. It is also used for 1 object recognition. A digital image is an array of square picture elements or pixels arranged in columns and rows. There are colour images, grayscale images and binary images. Colour images can be converted to grayscale in order to facilitate the extraction of some information within the image. A grey scale image is an 8-bit image, in which each pixel has an assigned intensity between 0 (black) and 255 (white). A binary image is an image in which pixels can only have two values: black (0) or white (1). Most common image formats are: GIF, JPEG, TIFF, PNG, PS, and PSD.

1.2 Image processing methods

There are two types of methods used for image processing:

– Analog image processing or visual techniques of image processing: used for printouts and photographs.

– Digital image processing: processing digital images by using a computer. This technique includes three phases for processing images: pre-processing, enhancement and display, information extraction.
Let us briefly define each of those phases:

* Image pre-processing or image restoration consists of correcting the image from different errors, noise and geometric distortions.

* Image enhancement improves the visual aspect of the image, after the correction of errors, to facilitate the perception or interpretability of information in the image.

* Information extraction utilizes the computer’s decision-making capability to identify and extract specific pieces of information or pixels.

The different image processing techniques used in the system help in extracting the text contained in the image, image filtering as well as audio and video conversion as a bonus to its users.

2 Study Status and Trends of IAV File Converter

Digital image processing is always an interesting field as it gives improved pictorial information for human interpretation and processing of image data for storage, transmission, and representation for machine perception. Image Processing is a technique to enhance raw images received from cameras/sensors placed on satellites, space probes and aircrafts or pictures taken in normal day-to-day life for various applications. This field of image processing significantly improved in recent times and extended to various fields of science and technology. The image processing mainly deals with image acquisition, Image enhancement, image segmentation, feature extraction, image classification.
The main purpose of the system is to show how vb.net as a language can contribute to the development in this field and other related areas as compared to other programming languages.

2.1 Structure and Functions

The system has the following structure and functions;
1. Image Filter

The filter section allows the user to use three (3) different options namely; grayscale, invert and black/white.

2. Image Converter

The image converter has five(5) formats available which are Joint Photographic Expect Group(JPEG), Bitmap(BMP), Portable Network Graphics(PNG), Graphics Interchange Format (GIF) and Tagged Image File Format(TIFF). A user can convert an image to any of the above formats.
3. Optical Character Recognition(OCR)

This feature allows a user to read and extract text from an image using Emgu CV.
4. Audio Converter

This feature simply enables audio conversion to WAV, MP3 and WMA using ffmpeg package as an extension.
5. Video Converter

The video converter helps change video formats to FLV, MP4, MPEG, MOV, AVI, WMV. It also enables the user to convert music videos to MP3 and WAV.

Terminologies

Object Character Recognition(OCR):

This project uses this feature, extracting text from an image, allowing the user to copy the text from a text box.
OCR is a technology that recognizes text within a digital image. It is commonly used to recognize text in scanned documents, but it serves many other purposes as well.

OCR software processes a digital image by locating and recognizing characters, such as letters, numbers, and symbols. Some OCR software will simply export the text, while other programs can convert the characters to editable text directly in the image.

RGB

The RGB color model is an additive color model in which red, green and blue light are added together in various ways to reproduce a broad array of colors. The name of the model comes from the initials of the three additive primary colors, red, green, and blue.

The secondary colors of RGB – cyan, magenta, and yellow – are formed by mixing two of the primary colors (red, green or blue) and excluding the third color. Red and green combine to make yellow, green and blue to make cyan, and blue and red form magenta. The combination of red, green, and blue in full intensity makes white.
Image formats

· JPEG:

JPEG is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality. JPEG typically achieves 10:1 compression with little perceptible loss in image quality. Compression is used in a number of image file formats.
· BMP:

Bitmap (BMP) is an image file format that can be used to create and store computer graphics. A bitmap file displays a small dots in a pattern that, when viewed from afar, creates an overall image. A bitmap image is a grid made of rows and columns where a specific cell is given a value that fills it in or leaves it blank, thus creating an image out of the data.

· Portable Network Graphics(PNG):

Portable network graphics (PNG) refers to a raster graphics image format used on websites and Web applications. It is a type of open and bitmapped graphical file format that is built on the concept of lossless data compression. It was created and conceived by the World Wide Web Consortium as a replacement to graphics interchange format (GIF).
· Graphics Interchange Format (GIF):

GIF is an image file format commonly used for images on the web and sprites in software programs. GIFs store image data using indexed color, meaning a standard GIF image can include a maximum of 256 colors. A GIF image can actually store more than 256 colors.
· Tagged Image File Format(TIFF):

Tagged Image File Format (TIFF) is a standard file format that is largely used in the publishing and printing industry. The extensible feature of this format allows storage of multiple bitmap images having different pixel depths, which makes it advantageous for image storage needs. Since it introduces no compression artifacts, the file format is preferred over others for archiving intermediate files.

Video Formats

· FLV

An FLV file is a flash-compatible video file exported by the Flash Video Exporter plug-in (included with Adobe Animate) or other program with FLV file support. It consists of a short header, interleaved audio, video, and metadata packets. The file stores audio and video data in a similar format used by standard Flash (.SWF) files.
· MP4

An MP4 file is a multimedia file commonly used to store a movie or video clip, but may also contain subtitles or images. MP4 is short for MPEG-4 Part 14, which is a container format based on the QuickTime File Format (QTFF) used by .MOV and .QT files. MP4 files use MPEG-4 compression, a standard developed by the Moving Picture Experts Group (MPEG).
· MPEG

Popular video format standardized by the Moving Picture Experts Group (MPEG); compressed using MPEG-1 or MPEG-2 compression; often used for creating movies that are distributed over the Internet.MPEG movies are typically saved with an .MPG extension.
· MOV

A MOV file is a common multimedia container file format developed by Apple. MOV files commonly use the MPEG-4 codec for compression. MOV files may contain video, audio, timecode, and text tracks, which can be used to store and synchronize different types of media.
· AVI

An AVI file is a file saved in a multimedia container format created by Microsoft. It stores video and audio data that may be encoded in a variety of codecs such as DivX and XviD.
· WMV

A WMV file is a video file based on the Microsoft Advanced Systems Format (ASF) container format and compressed with Windows Media compression. It contains video encoded with one of Microsoft’s Windows Media Video (WMV) proprietary codecs and is similar to an .ASF file.

Audio Formats

· WAV

A WAV file is an audio file that uses a standard digital audio file format utilized for storing waveform data. It allows audio recordings to be saved with different sampling rates and bitrates and are often saved in a 44.1 KHz, 16-bit, stereo format, which is the standard format used for CD audio. WAV is short for “Waveform Audio File Format,” and is pronounced “wave”.
· MP3

An MP3 file is an audio file saved in a compressed audio format developed by the Moving Picture Experts Group (MPEG) that uses “Layer 3” audio compression. It is commonly used to store music and audiobooks with near-CD quality sound (stereo, 16-bit) and roughly 1/10 the size of a .WAV or .AIF file.
· WMA

A WMA file is an audio file saved in the Advanced Systems Format (ASF) proprietary format developed by Microsoft. It contains Windows Media Audio and metadata objects such as the title, artist, album, and genre of the track. WMA files are similar to .MP3 files and primarily used for streaming music from the web.

Packages Used

· ffmpeg

FFmpeg is a command-line tool that is part of an open source multimedia framework also named FFmpeg. It is used to convert many different audio and video formats very quickly and also has the ability to get media from live audio and video sources as well as getting you high quality results out of converting and manipulating multimedia.
· Emgu CV

Emgu CV is a cross platform .Net wrapper to the OpenCV image processing library. Allowing OpenCV functions to be called from .NET compatible languages such as C#, VB, VC++, IronPython etc. The wrapper can be compiled in Mono and run onWindows, Linux, Mac OS X, iPhone, iPad and Android devices.

2.2 Current Status

A feasibility study carried out to assess the practicality of the proposed project. Three(3) major distinct and interrelated areas were taken into consideration.
1. Technical Feasibility :

Most systems related to this project are either just an OCR or image, audio and video converter. As advantage, this system has all mentioned features put together, giving efficiency to its users. This feasibility was carried to determine whether the relevant technology is stable and established as well as ascertain that the technology chosen for software development has a large number of users so that they can be consulted when problems arise or improvements are required. With a lot of these features put together, a large group of users is expected. As a result the system is technically feasible as it is efficient, can produce outputs faster, can input large amount of data in limited time scale and easier to use in operation.
2. Operational Feasibility :
Assessed the extent to which the required software performs a series of steps to solve business problems and user requirements. This feasibility is dependent on human resources and involves visualizing whether the software will operate after it is developed and be operative once it is installed. Operational feasibility also performs the following tasks.

· Determines whether the problems anticipated in user requirements are of high priority

· Determines whether the solution suggested by the software development team is acceptable

· Analyzes whether users will adapt to a new software
3. Economic Feasibility :
This system is economically feasible in the sense that the cost of the hardware and software and the installation cost is less than the cost of other systems with separate functions. This may not be a big sum in the long run of the school business.

2.3 Developing Trends

Emerging Trends in Image Processing, Computer Vision, and Pattern Recognition discusses the latest in trends in imaging science which at its core consists of three intertwined computer science fields, namely: Image Processing, Computer Vision, and Pattern Recognition. There is significant renewed interest in each of these three fields fueled by Big Data and Data Analytic initiatives including but not limited to; applications as diverse as computational biology, biometrics, biomedical imaging, robotics, security, and knowledge engineering. These three core topics discussed here provide a solid introduction to image processing along with low-level processing techniques, computer vision fundamentals along with examples of applied applications and pattern recognition algorithms and methodologies that will be of value to the image processing and computer vision research communities. Drawing upon the knowledge of recognized experts with years of practical experience and discussing new and novel applications Editors Leonidas Deligiannidis and Hamid Arabnia cover; Many perspectives of image processing spanning from fundamental mathematical theory and sampling, to image representation and reconstruction, filtering in spatial and frequency domain, geometrical transformations, and image restoration and segmentationKey application techniques in computer vision some of which are camera networks and vision, image feature extraction, face and gesture recognition and biometric authenticationPattern recognition algorithms including but not limited to; Supervised and unsupervised classification algorithms, Ensemble learning algorithms, and parsing algorithms.How to use image processing and visualization to analyze big data. Discusses novel applications that can benefit from image processing, computer vision and pattern recognition such as computational biology, biometrics, biomedical imaging, robotics, security, and knowledge engineering.Covers key application techniques in computer vision from fundamentals to mid to high level processing some of which are camera networks and vision, image feature extraction, face and gesture recognition and biometric authentication. Presents a number of pattern recognition algorithms and methodologies including but not limited to; supervised and unsupervised classification algorithms, Ensemble learning algorithms, and parsing algorithms.Explains how to use image processing and visualization to analyze big data.

3 Summary

We can then say that the Phone Reader project is not a brand new topic, related works have been implemented since 2008 by major companies whose software are expensive. A few other systems have been implemented by researchers that are not big companies, but are not widely used. Nevertheless, those applications designed for visually impaired can also be used for non-native speakers when they include a translation functionality, and for illiterates if they generate audible results. Furthermore it has been noticed that camera phones often take low quality images, hard to process efficiently 11 by an OCR. This is the reason why to implement this Phone Reader application, we will have to mainly work on the image processing aspect, so that the OCR will easily recognize characters from the text in the processed image. Thus the TTS engine will be able to give more accurate results.

References

1. M Sonka, V Hlavac , RBDM Ceng, Image Processing, Analysis and Machine Vision, Journal of Electronic Imaging, 2008.
2. Castleman, K.R., Digital Image Processing. Second ed. 1996, Englewood Cliffs, New Jersey: Prentice-Hall.
3. Russ, J.C., The Image Processing Handbook. Second ed. 1995, Boca Raton, Florida: CRC Press.
4. Stockham, T.G., Image Processing in the Context of a Visual Model. Proc. IEEE, 1972. 60: p. 828 – 842.
5. Jain, Fundamentals of Digital Image Processing, Prentice-Hall Inc., 1982.
6. RC Gonzalez, 

HYPERLINK “http://xueshu.baidu.com/s?wd=author%3A%28Richard%20E.%20Woods%29%20&tn=SE_baiduxueshu_c1gjeupa&ie=utf-8&sc_f_para=sc_hilight%3Dperson” t “_blank” RE Woods,. Digital Image Processing(3rd Edition), Prentice-Hall, Inc. , 2007
7. Groen, F.C.A., R.J. Ekkers, and R. De Vries, Image processing with personal computers. Signal Processing, 1988. 15: p. 279-291.
8. Kuwahara, M., et al., Processing of RI-angiocardiographic images, in Digital Processing of Biomedical Images, K. Preston and M. Onoe, Editors. 1976, Plenum Press: New York. p. 187-203.
9. Van Vliet, L.J., Grey-scale measurements in multi-dimensional digitized images, PhD Thesis: Delft University of Technology, 1993.
10. Serra, J., Image Analysis and Mathematical Morphology. 1982, London: Academic Press.

11. Young, I.T., et al., A new implementation for the binary and Minkowski operators. Computer Graphics and Image Processing, 1981. 17(3): p. 189-210.

Comments:

Instructor:

Date:

– 12 –

Graduation Project model-EN-2021/3 Thesis Proposal 2021.doc

2021 Graduation Project (Thesis Proposal) Zhejiang University of Science & Technology

Zhejiang University of Science & Technology

Graduation Project (Thesis)

Thesis Proposal

(2021)

Topic Image Processing and File Manipulation using vb.net(IAV File Converter)

School
Information & Electronic Engineering

Major

Computer Science & Technology

Class

CST xxx

Student ID xxxxxxxxxxxx

Name

xxxxxxxxxxxxxxxxx

Advisor xxxxxxxxxxxxxxx

Opening Date

1 Background and Significance of Selected Topic

Image processing generally is the processing of images using mathematical operations by using any form of signal processing for which the input is an image, aseries of images, or a video, such as a photograph or video frame; the output of image processing may be either an image or a set of characteristics or parameters related tothe image. It is among rapidly growing technologies today, with its applications in various aspects of a business. Image Processing forms core research area within computer science and engineering disciplines too.
Digital Image Processing is the process of performing operations on digital images using computer-based algorithms to alter its features for enhancement or for other effects. Through Image Processing, essential information can be extracted from digital images. It is an important area of research in computer science. The techniques involved in image processing include transformation, classification, pattern recognition, filtering, image restoration and various other processes and techniques.
Image processing basically includes the following three steps;
1. Importing the image with optical scanner or by digital photography
2. Analyzing and manipulating the image which includes data compression and image enhancement and spotting patterns that are not to human eyes like satellite photographs.

3. Output is the last stage in which result can be altered image or report that is based on image analysis.

Main purpose of Image Processing

The purpose of image processing is divided into 5 groups. They are:
1.      Visualization – Observe the objects that are not visible.
2.      Image sharpening and restoration – To create a better image.
3.      Image retrieval – Seek for the image of interest.
4.      Measurement of pattern – Measures various objects in an image.
5.      Image Recognition – Distinguish the objects in an image.

Applications of Image Processing

Following are the main applications of Image Processing:

· UV Imaging, Gamma Ray Imaging and CT scan in medical field

· Transmission and encoding

· Robot Vision

· Color Processing

· Pattern Recognition

· Video Processing

Significance

With the system being to extract text from images, converter audio and video to various types and also filter images, significantly it will help users both private and business levels with the mentioned capabilities.

2 Basic Study Content and Main Problems to be Solved

Constraints
1. Accessibility; This software will accessible to all window users.

2. Limitations; strictly for windows users, minimum 2GB RAM.

3 Study Methods and Technical Route

Requirements

Firstly, finding out what was needed to make this system different from other related systems, overviewing the new system and determining its objectives. Most of the other systems each have one of the features this system has, hence making it better and reliable to its competition. During this phase, data should be collected and once the structured result of the research is established, the next route is taken.

Analysis

With the help of the previous route, analysis of how the system should be done easier considering all aspects. This should bring about resulted requirements needed to be met. Gathering, analyzing, and validating the information provided as well as evaluating the alternatives and prioritizing the requirements. A System Requirement Specification(SRS) is prepared at the end of this phase, which specifies software, hardware and functional requirements of the system.

Design

During this phase, layout of how the system should be built as well as description of what data to be inputted and outputted. It includes the design of application, user interfaces and system interfaces. Transform the SRS document into logical structure, which contains detailed and complete set of specifications that can be implemented in a programming language. Review the proposed design. Ensure that the final design must meet the requirements stated in SRS document. Finally, prepare a design document which will be used during next phases.

Implementation

This is the phase in which computer based system is constructed from the specifications prepared in the design phase. Implement the design into source code through coding, Combining all the modules together into training environment that detects errors and defects. A test report which contains errors is prepared through test plan that includes test related tasks such as test case generation, testing criteria, and resource allocation for testing. Integrate the system into its environment and install the new system.

Testing

During system testing, the system is used experimentally to ensure that the software does not fail. In other words we can say that it will run according to its specifications and in the way users expect. Special test data are input for processing, and the result examined.

Maintenance

The importance of maintenance is simply to continue to bring the new system to standards. Implementing the changes that software might undergo over a period of time, or implement any new requirements after the software is deployed. It also includes handling the residual errors and resolve any issues that may exist in the system even after the testing phase.

4 System Functions

Lets understand the functions of the system as listed;

· The OCR feature should be able to first, scan the image and then display text in the textbox.

· The image filter feature should be able to first, read an image from computer memory then process one of the 3 filters selected by the user. It should also be able to save the filtered image directly to the computer memory.

· The image converter should convert images read from the computer into the 5 different formats available.

· The audio and video converters should be able to read audio and video files, convert then to the formats available and then save the converted files.

5 Overall Study Arrangement and Schedule

Time

Work Contents

Jua. 1- Mar 6, 2021

Collect Literature Materials,
complete Literature Review and Thesis Proposal

Jan. 8-Mar. 13, 2021

Finish Requirements Analysis and General System Design

Mar. 14-Apr. 17, 2021

Finish Designing and Coding of Various Modules

Apr. 18-30, 2021

System Debugging and Testing

May 2-13, 2021

Write Graduation Thesis

May 14-20, 2021

Examine and Evaluate Graduation Design (Thesis)

May 21-27, 2021

Graduation Project Defense

6 Main Reference Literatures

1. M Sonka, V Hlavac , RBDM Ceng, Image Processing, Analysis and Machine Vision, Journal of Electronic Imaging, 2008.
2. Castleman, K.R., Digital Image Processing. Second ed. 1996, Englewood Cliffs, New Jersey: Prentice-Hall.
3. Russ, J.C., The Image Processing Handbook. Second ed. 1995, Boca Raton, Florida: CRC Press.
4. Stockham, T.G., Image Processing in the Context of a Visual Model. Proc. IEEE, 1972. 60: p. 828 – 842.
5. Jain, Fundamentals of Digital Image Processing, Prentice-Hall Inc., 1982.
6. RC Gonzalez, 

HYPERLINK “http://xueshu.baidu.com/s?wd=author%3A%28Richard%20E.%20Woods%29%20&tn=SE_baiduxueshu_c1gjeupa&ie=utf-8&sc_f_para=sc_hilight%3Dperson” t “_blank” RE Woods,. Digital Image Processing(3rd Edition), Prentice-Hall, Inc. , 2007
7. Groen, F.C.A., R.J. Ekkers, and R. De Vries, Image processing with personal computers. Signal Processing, 1988. 15: p. 279-291.
8. Kuwahara, M., et al., Processing of RI-angiocardiographic images, in Digital Processing of Biomedical Images, K. Preston and M. Onoe, Editors. 1976, Plenum Press: New York. p. 187-203.
9. Van Vliet, L.J., Grey-scale measurements in multi-dimensional digitized images, PhD Thesis: Delft University of Technology, 1993.
10. Serra, J., Image Analysis and Mathematical Morphology. 1982, London: Academic Press.

11. Young, I.T., et al., A new implementation for the binary and Minkowski operators. Computer Graphics and Image Processing, 1981. 17(3): p. 189-210.

Examining Opinion:

Instructor:

Date:

– 7 –

Graduation Project model-EN-2021/4 Process Management Documents-4~11-EN.doc

Name

Sex

Major

Computer Science and Technology

Grade

17

Advisor Name

Advisor Title

Subject

Computer

Research Direction

Computer

Thesis Topic

Overtime Management System

Time of Defense

Place of Defense

2021年5月27日

C1-538

Advisor preliminary assessment opinions

Signature of Advisor:

2021年x月x日

Panel Comments

Examination results
Signature of Group Leader:

年 月 日

Zhejiang University of Science and Technology graduation design (Thesis) report of Defense Report. (Thesis Proposal Defense Notes)
Note: the results of the assessment are passed and not passed through two grades

Graduation project (paper) Work Sheet

University: School of Information and electronic engineering

Major: Computer Science and Technology

Class: Computer Science (International Students)131

Name: Xxxx Xxxx

Date

Graduation Design(Thesis)

Main Link

Main Work Content

Signature of Instructor

2021.1.10

Selected Topic

Discuss, study and finalize the topic of graduation design with the instructor.

2021.3.10

Data Collection

Consult relevant literature; fully demonstrate the rationality, feasibility and practicability of the topic.

2021.3.18

Opening Report

According to the design goals and specific requirements of the selected topic, determine the research methods and technical route.

2021.5.5

First Draft

On the basis of completing the software system development of graduation design, write the first draft of graduation design paper.

2021.5.11

2nd Draft

Upon review, the instructor modifies the deficiencies of the software system and the papers in accordance with the recommendations

2021.5.16

Final Draft

Check the operation of the whole software system, and improve the graduation design papers and related materials.

2021.5.27

Defence

Complete the graduation project defense.

Graduation Design(Thesis)Work Instruction Sheet

University: School of Information and Electronic Engineering
Major: Computer Science and Technology
Class: Computer Science(International Student)131

Name

Xxxx Xxxx

Advisor

XXX

Plan Process Table

Date

Work Content

Completion

2021.1.14-2021.3.10

Literature review and report

Complete

2021.3.11-2021.3.31

Requirement analysis and project module design

Complete

2021.4.1-2021.4.27

Project module function algorithm design

Complete

2021.4.28-2021.5.17

Project module coding

Complete

2021.4.18-2021.5.10

Project module debugging, testing and improvement

Complete

2021.5.11-2021.5.20

Writing graduation thesis

Complete

Instruct teachers checking

Date

Checking Content

Instructor signature

2021.3.10

Complete the check report and literature review

2021.3.31

Check requirement analysis and project module design, put forward improvement suggestion

2021.4.15

Check the completion of project module and solve technical problems

2021.4.25

Check item inspection, project module debugging and test, put forward improvement suggestion

2021.5.9

Check situation of graduation thesis writing and propose amendments

Student leave Record

Date

Reasons for asking for leave

Instructor Signature

Graduation Project(Thesis)Advisor’s Evaluation Sheet)

Graduation Project(Thesis) Topic:Overtime Management System

University

School of Information and electronic engineering

Major

Computer Science and Technology

Name

Xxxx Xxxx

Student No.

9120212009

Advisor’s Comments:

Whether or not you agree:

Proposed grade(Percentage system):

Advisor’s Signature:

2021年5月xx日

Graduation Project(Thesis )Academic Teacher’s Evaluation Sheet

Graduation Project(Thesis)Topic:Overtime Management System

University

School of Information and Electronic Engineering

Major

Computer Science and Technology

Name

Xxxx Xxxx

Student No.

9120212009

Advisor’s Comments:

Whether or not you agree:

Proposed grade(percentage system):

Advisor’s signature:

年 月 日

Graduation Project(Thesis )Graduation Project Defense Notes

Name

Xxxx

Sex

Major

Computer Science

Class

Computer Science (International Student)131

Advisor’s Name

Title

Associate professor

Subject

Computer Science

Research direction

Computer Science

Thesis Topic

Overtime Management System

Defense team members

Time of Defense

2021年5月27日

Place of Defense

C1-538

Note-taker

Student Statement points

Questions raised by teachers and key points for students ‘ answers

Team Leader signature:

2021年5月27日

Note: The Defense time should be moderate (students report about 15 minutes, ask questions and answers 15 minutes), the defense record should be accurate and complete (the record should be above 100 words).

Graduation Project(Thesis )Evaluation sheet of Defense Committee

Graduation Project(Thesis )topic:Overtime Management System

Name

Xxxx Xxxx

Student No.

913xxxxxxx

Major

Computer Science and Technology

Class

Computer Science(International)131

Advisor’s Comments:

Grade

Group Leader Signature:

2021年5月27日

Comprehensive comments of defense committee:

Grade

Chairman of Defense Committee Signature:

Dean Signature:

Academic Seal:

年 月 日

Note: “Grade” points are excellent, good, medium, pass and unqualified grade five

ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY

Student Graduation Project (Thesis) Scoring Standard

Respondent Team Scoring Criteria:

序号

Main content of defense examination

标 准 分

1

Clear thinking and clear expression.

5

2

The discussion is correct, thorough and logical.

10

3

Answer questions with theoretical basis and clear basic concept. The main questions are answered accurately and deeply.

15

4

Answer questions quickly, understand, comprehend or have unique opinions

10

Note:1、Each item in the table should be graded and the score measured to 0.5 points.
2、The defense team grading table, the institute to develop their own

Grading criteria of the College Defence Committee:

Serial No.

Item No.

Main content of assessment

Standard Score

1

Master and apply basic knowledge and skills

Ability to apply basic theories and specialized knowledge; to rread and analyze the capabilities of foreign and Chinese materials; to conduct independent experiments, designs and scientific research

20

2

Title of Report, Design (thesis), instruction, and translation works

The application and advanced nature of the subject, the feasibility of the guiding ideology, the correctness of the theoretical basis, the rationality of the scheme, the correctness of calculation and experimental results; drawings, works quality, writing instructions and translation documentation

30

3

Normal work performance

Attendance, work learning attitude and efficiency

10

4

Panel defense

(according to the statement of Defence grade standard)

40

Note: Each item in the table should be graded and the score measured to 0.5 points.

Zhejiang University of Science and Technology Graduate Design (thesis) mid-term teaching checklist
School: School of information and electronic engineering class: computer science (International students) 131

Student Name: Xxxx Xxxx Advisor: XXXXX

Graduation Project(Thesis )topic: Overtime Management System

1、 Topics

1, the type of topic: academic paper (); Engineering design (√); experimental type (); theoretical calculation (); others ().

2, the topic and the degree of professional Integration: good (), better (√);

3, the difficulty degree of the topic: more difficult (√);

二、Literature Review and Literature summary:

1, literature review and Information summary: Good (√), better ();

Literature Review 12;

2, the relevance of foreign language literature content and Subject: Good (√), better ();

3, literature review of the preparation of written materials: good (√), better ();

三、the report of the problem: good (√), better ();
四、Progress of Graduation Design (thesis):

1, whether according to the original plan process: Yes (√);

2. The total task has been completed 50%.

3, the Graduation internship report or On-the-spot investigation report progress situation: Good (√), better ();

4、To instruct the teacher on the periodical work: Good (), better (√);

五、 Pre-stage work attitude and discipline of students:

1. To guide teachers ‘ evaluation of students ‘ attitude towards work: serious (√);

2, whether often in the design site (laboratory, Design Institute (), room, etc.) work: Is (√);

Average working time 8 hours per day, average working time of 5 days per week;

3, whether the absence of leave, the number of days off 0;

Main reason

六、Overall evaluation: good (√), better ();

七、Comments and suggestions for the next stage of work: I hope to quickly complete the Project module, code, Project module debugging and testing phase.

Examiner:

2021 年 4 月 15 日

Graduation Project model-EN-2021/Thesis Cover-2021.doc

2021 Graduation Project

Topic Image Processing and File Manipulation using vb.net(IAV File Converter)

School
Information & Electronic Engineering

Major

Computer Science & Technology

Class

CST xxx

Student ID xxxxxxxxxxx

Name

xxxxxxxxxxxxxxxxx

Advisor xxxxxxxxxxxxxxx

Finish Date May xxx 2021

May 2021

Zhejiang University of Science & Technology

Graduation Project

(2021)

Topic Image Processing and File Manipulation using vb.net(IAV File Converter)

School
Information & Electronic Engineering

Major

Computer Science & Technology

Class

CST xxx

Student ID xxxxxxxxxxx

Name

xxxxxxxxxxxxxxxxx

Advisor xxxxxxxxxxxxxxx

Finish Date May xx 2021

Graduation Project model-EN-2021/Thesis Project 2021.doc

2021 Graduation Project (Thesis) School of Information and Electronic Engineering ZUST

Abstract

The IAV File converter is a minor project developed in visual basic.Net. This project focus on image processing and so, provides related features to its users. These features namely; Object Character Recognition(OCR), image filtering using RGB colors and transformation as well as audio and video conversion as a bonus to the system.
Vb.net is a programming language being surpassed by new languages, thus this project also shows how .Net can still do what are languages are doing.
Main aim of this project is to extract important data from images. Using this extracted information description, interpretation and understanding of the scene can be provided by the machine. Main point of image processing is to modify images in to desired manner. In other words image processing is called as altering and analyzing pictorial information of images. In our daily life we come across different type of image processing best example of image processing in our daily life is our brain sensing lot of  images when we see images with eyes and processing is done is very less time. In existing system there are many techniques which are available for extracting information from images. The IAV file converter software is a collection of functions that extend the capability of most systems.

Keywords: OCR, Filtering, transformation., RGB.

CONTENTS

I
Abstract

1
Chapter 1 Introduction

1
1.1 Background

1.2 Main Research
2

1.3 Tasks and Targets
2

1.4 Definitions and Terminologies used
3

Chapter 2 System Analysis
7

2.1 Objectives
7

2.2 Existing Systems
7

2.3 Requirement Definition
8

2.4 Proposed System
8

Chapter 3 System Design
9

3.1 Overview
9

3.2 System Architecture
9

3.3 Data Flow Diagrams
10

3.4 Flow Charts
13

3.5 UI Design
18

Chapter 4 System Implementation
21

4.1 Algorithms Design
21

4.2 Coding
22

4.2.1 Programming Language
22

4.2.2 Source Code
22

4.3 Screenshots and Functions
27

Chapter 5 System Test
31

5.1 Objectives
31

5.2 Pointers
37

5.3 Future Developments
37

Chapter 6 Conclusion
38

6.1 How far target were met
38

6.2 Overall Conclusion
38

Acknowledgements
39

References
40

Chapter 1 Introduction

1.1 Background

Image processing basically includes the following three steps:
· Importing the image via image acquisition tools;
· Analyzing and manipulating the image;
· Output in which result can be altered image or report that is based on image analysis.
There are two types of methods used for image processing namely, analogue and digital image processing. Analogue image processing can be used for the hard copies like printouts and photographs. Image analysts use various fundamentals of interpretation while using these visual techniques. Digital image processing techniques help in manipulation of the digital images by using computers. The three general phases that all types of data have to undergo while using digital technique are pre-processing, enhancement, and display, information extraction.
We have had systems being focused on one area of image processing. It being Image transformer, converter, OCR etc, what if there could a system that would support a number of those features and include other fields like audio and video, would that be a great idea?
Most systems as developed in android, java and php. Is there any other language with stability. Will we see the same system having just one focus?

Advancing in the future with new ideas is the way forward, hence the idea of a new system having a combination of functions would totally contribute to this.

1.2 Main Research

Image processing is a wide field to research as a whole, hence during the development of this project, the main research was focused on extracting text from images as well as image filtering and transformation. Most systems have either one of the features mentioned hence combining them is, well a great thing.

In terms of grayscale conversion, an application like Photoshop uses Luma an algorithm consisting of RGB colors being multiplied by decimal numbers. As for this project, I will use a different approach to this, the average method. It’s called that because it’s simply averaging the RGB colors to get a grayscale color.
The project also has black/white and invert filters which use the same average method of RGB colors.

1.3 Tasks and Targets

The targets of this system are to;
· Use a different approach to filtering.
· Educate on some of the image processing techniques carried out.

· Show how vb.net can contribute to the growth of this sector.

· Help its users processing tasks with the features that it has.

The tasks the system should carry out are to;

· Extract text from images through the use of OCR.
· Convert images to different formats(PNG,BMP,JPEG,GIF,TIFF).
· Convert audios to other formats namely Mp3, WAV and WMA.
· Convert videos to different format namely MOV, AVI, MP4, WMV, MPEG and FLV.
Note: Audio and Video conversions are done through the use of ffmpeg package used as an extension in vb.net.

1.4 Definition and Terminologies used

Object Character Recognition(OCR):

This project uses this feature, extracting text from an image, allowing the user to copy the text from a text box.

OCR is a technology that recognizes text within a digital image. It is commonly used to recognize text in scanned documents, but it serves many other purposes as well.

OCR software processes a digital image by locating and recognizing characters, such as letters, numbers, and symbols. Some OCR software will simply export the text, while other programs can convert the characters to editable text directly in the image.

RGB

The RGB color model is an additive color model in which red, green and blue light are added together in various ways to reproduce a broad array of colors. The name of the model comes from the initials of the three additive primary colors, red, green, and blue.

The secondary colors of RGB – cyan, magenta, and yellow – are formed by mixing two of the primary colors (red, green or blue) and excluding the third color. Red and green combine to make yellow, green and blue to make cyan, and blue and red form magenta. The combination of red, green, and blue in full intensity makes white as shown in figure 1.0 below;

As mentioned earlier, this project uses RGB color and uses simple algorithms to filter images to three (3) different forms, namely grayscale, black/white and invert color. Later we will see how the colors and algorithms are implemented.

Image formats

· JPEG:

JPEG is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality. JPEG typically achieves 10:1 compression with little perceptible loss in image quality. Compression is used in a number of image file formats.
· BMP:

Bitmap (BMP) is an image file format that can be used to create and store computer graphics. A bitmap file displays a small dots in a pattern that, when viewed from afar, creates an overall image. A bitmap image is a grid made of rows and columns where a specific cell is given a value that fills it in or leaves it blank, thus creating an image out of the data.
· Portable Network Graphics(PNG):

Portable network graphics (PNG) refers to a raster graphics image format used on websites and Web applications. It is a type of open and bitmapped graphical file format that is built on the concept of lossless data compression. It was created and conceived by the World Wide Web Consortium as a replacement to graphics interchange format (GIF).
· Graphics Interchange Format (GIF):

GIF is an image file format commonly used for images on the web and sprites in software programs. GIFs store image data using indexed color, meaning a standard GIF image can include a maximum of 256 colors. A GIF image can actually store more than 256 colors.
· Tagged Image File Format(TIFF):

Tagged Image File Format (TIFF) is a standard file format that is largely used in the publishing and printing industry. The extensible feature of this format allows storage of multiple bitmap images having different pixel depths, which makes it advantageous for image storage needs. Since it introduces no compression artifacts, the file format is preferred over others for archiving intermediate files.

Video Formats

· FLV

An FLV file is a flash-compatible video file exported by the Flash Video Exporter plug-in (included with Adobe Animate) or other program with FLV file support. It consists of a short header, interleaved audio, video, and metadata packets. The file stores audio and video data in a similar format used by standard Flash (.SWF) files.
· MP4

An MP4 file is a multimedia file commonly used to store a movie or video clip, but may also contain subtitles or images. MP4 is short for MPEG-4 Part 14, which is a container format based on the QuickTime File Format (QTFF) used by .MOV and .QT files. MP4 files use MPEG-4 compression, a standard developed by the Moving Picture Experts Group (MPEG).
· MPEG

Popular video format standardized by the Moving Picture Experts Group (MPEG); compressed using MPEG-1 or MPEG-2 compression; often used for creating movies that are distributed over the Internet.MPEG movies are typically saved with an .MPG extension.
· MOV

A MOV file is a common multimedia container file format developed by Apple. MOV files commonly use the MPEG-4 codec for compression. MOV files may contain video, audio, timecode, and text tracks, which can be used to store and synchronize different types of media.
· AVI

An AVI file is a file saved in a multimedia container format created by Microsoft. It stores video and audio data that may be encoded in a variety of codecs such as DivX and XviD.
· WMV

A WMV file is a video file based on the Microsoft Advanced Systems Format (ASF) container format and compressed with Windows Media compression. It contains video encoded with one of Microsoft’s Windows Media Video (WMV) proprietary codecs and is similar to an .ASF file.

Audio Formats

· WAV

A WAV file is an audio file that uses a standard digital audio file format utilized for storing waveform data. It allows audio recordings to be saved with different sampling rates and bitrates and are often saved in a 44.1 KHz, 16-bit, stereo format, which is the standard format used for CD audio. WAV is short for “Waveform Audio File Format,” and is pronounced “wave”.
· MP3

An MP3 file is an audio file saved in a compressed audio format developed by the Moving Picture Experts Group (MPEG) that uses “Layer 3” audio compression. It is commonly used to store music and audiobooks with near-CD quality sound (stereo, 16-bit) and roughly 1/10 the size of a .WAV or .AIF file.
· WMA

A WMA file is an audio file saved in the Advanced Systems Format (ASF) proprietary format developed by Microsoft. It contains Windows Media Audio and metadata objects such as the title, artist, album, and genre of the track. WMA files are similar to .MP3 files and primarily used for streaming music from the web.

Packages Used

· ffmpeg

FFmpeg is a command-line tool that is part of an open source multimedia framework also named FFmpeg. It is used to convert many different audio and video formats very quickly and also has the ability to get media from live audio and video sources as well as getting you high quality results out of converting and manipulating multimedia.
· Emgu CV

Emgu CV is a cross platform .Net wrapper to the OpenCV image processing library. Allowing OpenCV functions to be called from .NET compatible languages such as C#, VB, VC++, IronPython etc. The wrapper can be compiled in Mono and run onWindows, Linux, Mac OS X, iPhone, iPad and Android devices.

Chapter 2 System Analysis

2.1 Objectives

· Describe the requirements of the new system

· Establish the requirements for the new system

2.2 Existing Systems

As mentioned earlier, with research on some related systems, like Photoshop it uses an algorithm called Luma consisting of RGB colors being multiplied by decimal numbers. Some may find this a difficult algorithm.

Also, OCR is a field in image processing growing at a fast rate. Most systems use this feature to extract text from images, convert images to document format etc. but these system when extract text, don’t provide users with the privilege to copy the extracted text. In addition to that, when these systems do allow that, they are expensive for users.

Other systems have only one feature, it being a video converter or audio converter, or OCR, or image filter software, because these features are in different fields. Users find themselves buying all these different software separately, costing huge sums of money. Not only buying them, but also have to maintain them, upgrade them and update.

2.3 Requirements Definition

A good system, or better system needs to meet users requirements, and as mentioned in the existing systems;

· users need a system that can change their minds from buying individual expensive software
· They need a stable system
· A system that can support a lot of features
· A system that’s worth their time, money and business
· A system that can work offline

2.4 Proposed System

IAV File converter is the proposed system that will meet the users requirements. Developed in vb.net and windows supported, large numbers of these users are windows users and this system will attract huge numbers. Therefore the system is going to have to:
· Work offline

· Be Windows compatible

· Encompass audio, video and image conversion

· Support Image filtering

· Extract text editable text from images

· Be a stable system

· User friendly

· Use a simple filter algorithm for developers

Chapter 3 System Design

3.1 System Overview

Users in this field of study have found difficulties having a system with one specific feature. For example an OCR will only have one area to focus, extracting text and converting images to text.
IAV File Converter is a system that is designed to help users with various features. It will provide the following capabilities:
· The OCR feature should be able to first, scan the image and then display text in the textbox.

· The image filter feature should be able to first, read an image from computer memory then process one of the 3 filters selected by the user. It should also be able to save the filtered image directly to the computer memory.

· The image converter should convert images read from the computer into the 5 different formats available.

· The audio and video converters should be able to read audio and video files, convert then to the formats available and then save the converted files.

3.2 System Architecture

The IAV File Converter will require of the following architecture;

Operating system: Windows 7,8,10

RAM: 4GB(minimum)

.NET Framework 4.7.1 or higher

Processor 1GHz

Disk space(Minimum): 32bit- 4.5GB, 64bit-4.5GB

3.3 Data Flow Diagram

A dataflow diagram for the Object Character Recognition. User points the OCR at the image, it scans it and outputs the text as shown in figure 1.1.

A dataflow diagram for the Image filter feature. User selects and image, chooses a filter and the filtered image is saved as shown in figure 1.2.

Figure 1.3 shows dataflow diagram for the Image converter. User chooses an image to convert as well as format and the converter converts the image.

Figure 1.4 shows a dataflow diagram for the Audio Converter. User selects an audio file and format, and the converter converts the file.
Video Converter

A dataflow diagram for the video converter. User selects a video file, and format, and converter converts it to desired format as figure 1.5 illustrates.
3.4 Flow Charts

Figure 2.1 shows the flowchart of the OCR. While on the main page, click on image and then click OCR. The OCR feature will load and next the user points the scanner to the image and presses the scan button. The text is then displayed.

Figure 2.2 shows the flowchart of the image filter. While on the main page, click on image and then click Filter. The filter feature will load and next the user selects an image and a filter. The filtered image is then displayed and the user can save it.

Figure 2.3 shows the flowchart of the image converter. While on the main page, click on image and then click converter. The converter loads and the user selects an image and a format, the user clicks convert and saves the converted image.

Figure 2.4 shows the flowchart of the audio converter. On the main page, click on audio and then click converter. The converter loads and the user selects an audio and then chooses a save location and a format, the user clicks start and the audio is saved.

Figure 2.5 shows the flowchart of the video converter. On the main page, click on video and then click converter. The converter loads and the user selects an video and then chooses a save location and a format, the user clicks start and the video is saved.

3.3 UI Design

Chapter 4 System Implementation

4.1 Algorithms Design

Image Filter

· Grayscale: I used an averaging method. Simply the average of three colors. Since its an RGB image, so it means that you have add r with g with b and then divide it by 3 to get your desired grayscale image.
grey = (Red + Green + Blue) / 3

· Black/white: averaging was used here as well but the difference is that after the image is converted to grey we then set the color using decimal code black(0, 0, 0) and white(255, 255, 255).

grey =(red + green + blue) / 3

If grey < 128 Then Set color (0, 0, 0) Else Set color (255, 255, 255) End If · Invert color: this was achieved by subtracting the red, green, and blue components from 255. In other words: new_red = 255 - old_red new_green = 255 - old_green new_blue = 255 - old_blue Hence the algorithm grey = (red + green + blue) / 3 set color (255 - red, 255 - green, 255 - blue) 4.2 Coding 4.2.1 Programming Language As mentioned earlier, the system was developed in visual basic.net using Microsoft visual studio 2010. VB.Net is a simple, modern, object-oriented computer programminglanguage developed by Microsoft to combine the power of .NET Frameworkand the common language runtime with the productivity benefits that arethe hallmark of Visual Basic. 4.2.2 Source Code Below is the vb.net source code for the various features the system has to offer; Figure 4.0 shows the source code of the Object Character Recognition(OCR) This feature allows a user to read and extract text from an image using Emgu CV . Emgu was added to the visual studio to enable the extraction of text from an image. Figure 4.1 is the grayscale code used in the system The method used for grayscale simply by averaging the values: (R + G + B) / 3. Below is Figure 4.2 an RGB chart showing basic colors and their decimal codes which were used in this project namely black, white and grey. As show above in the image figure 4.3, five formats an image can be converted to. · The class ImageFormat specifies the format of the image. Listed formats are; · Png · Jpeg · Bmp · Tiff · Gif Figure 4.4 .The function StartConversion() encompasses the algorithm used by the ffmpeg package. In order for that to work, the ffmpeg.exe file had to be called using a string from where it is stored in computer memory, simply the location. Figure 4.5 as seen above, the different formats an audio file can be stored in. Figure 4.6 The function StartConversion() encompasses the algorithm used by the ffmpeg package. In order for that to work, the ffmpeg.exe file had to be called using a string from where it is stored in computer memory, simply the location. Figure 4.7 Above seen in the picture are the different formats the video files can be converted to. 4.3 Screenshots Let’s take a look at the system screenshots illustrating how the system works. The system starts up with a splashscreen showing the name of the application as shown in figure 5.0 Then comes the main page of the application; From there, a user has options to choose from. Under the Image button is a dropdown list having Image filter, Converter and OCR as shown in figure 5.1 · Image Filter 1. The user loads an image from computer memory by pressing browse, the image then appears on the ‘Original Image’ section as shown in figure 5.2; 2. The user can then choose a filter by pressing the available filters on the bottom right. Grayscale for example as shown in figure 5.3, the filtered image is then displayed in the ‘Image Result’ section; 3. Invert filter shown in figure 5.4 · Image Converter 1. User can select an image by clicking the browse button Figure 5.5 2. We can see that theimage is of jpeg format as shown in figure 5.6 3. After conversion, we see in figure 5.7 now a converted image of PNG format · OCR 1. Figure 5.8 The user loads the OCR and points it to the image with text 2. Scan button is pressed as shown in figure 5.9 an text is extracted and displayed Chapter 5 System Test 5.1 Objectives · Executing the program with the intent of finding an error · Uncover undiscovered errors · Make sure the program is running well Software testing involves executing an implementation of the software with test data and examining the outputs of the software and its operational behavior to check that it is performing as required. Testing is a dynamic technique of verification and validation because it works with an executable representation of the system. Three (3) main testing were done namely, Unit, Integration and System testing. Unit Testing Unit testing focuses verification effort on the smallest unit of software design-the software component or module. Using the component – level design description as a guide, important control paths are tested to uncover errors within the boundary of the module. The relative complexity of tests and uncovered errors is limited by the constrained scope established for unit testing. The unit test is white-box oriented and the step can be conducted in parallel for multiple components. Modules to be tested; · OCR · Image Filter · Image Converter · Audio Converter · Video Converter OCR No Test Case Description Input Expected Behavior Observed behavior Test Result 1 Is the OCR transparent to read the image? Image Scanner should be transparent Scanner is transparent Success 2 Is text extracted from image displayed? Image Text should be displayed in the textbox Text is displayed Success 3 Is the text editable? Image Text should be editable Text is editable Success Image Filter No Test Case Description Input Expected Behavior Observed behavior Test Result 1 Is image selected displayed? Image Selected image should be displayed Image correctly displayed Success 2 Are filter buttons working? Button pressed Image should be filtered Image filtered Success 3 Does progress bar show filter progress? Button pressed Progress bar should show filter progress Progress shown Success Image Converter No Test Case Description Input Expected Behavior Observed behavior Test Result 1 Does picture box display image? Image Image is displayed Image correctly displayed Success 2 Are the image formats listed? Combo box pressed Image formats should be listed Formats are correctly listed Success 3 Is Convert button working? Button pressed Image should be converted and saved Image converted and saved Success Audio Converter No Test Case Description Input Expected Behavior Observed behavior Test Result 1 Does textbox show selected audio file? Audio Correct audio file displayed Audio correctly displayed Success 2 Does the save textbox display the correct location? Audio Location should be displayed in the textbox Location is displayed Success 3 Are audio formats listed? Combo box pressed Formats should be listed Formats correctly listed Success Video Converter No Test Case Description Input Expected Behavior Observed behavior Test Result 1 Does textbox show selected video file? video Correct video file displayed video correctly displayed Success 2 Does the save textbox display the correct location? video Location should be displayed in the textbox Location is displayed Success 3 Are video formats listed? Combo box pressed Formats should be listed Formats correctly listed Success 4 Does the start button work? Button pressed Conversion should start Conversion in progress Success 5 Does progress bar display progression? Button pressed Progress bar should show filter progress Progress shown Success Integration Testing Integration testing tests the interface between modules of the software application.  The different modules are first testing individually and then combined to make a system. Testing the interface between the small units or modules is integration testing. No Test Case Description Input Expected Behavior Observed behavior Test Result 1 Is image, audio or video file uploaded correctly displayed? Image/audio/video User should be able to see correct file selected Files are correctly displayed Success 2 Is text from OCR displayed? Image Text should be displayed in the textbox Text is displayed Success 3 Are files correctly saved? Image/video/audio Files should be saved correctly Files are saved correctly Success System Testing Objectives · To check whether the software system is made according to the customer needs written in Software Requirements Specifications, it meets both functional and non-functional design requirements of the system. · Find the defects or bugs in all the interfaces as well the whole system. · The system test cases are made according to the end- to – end use perspective. No Test Case Description Input Expected Behavior Observed behavior Test Result 1 Can user see extracted text from image? Image from computer memory OCR should displayed text Text is displayed only if OCR can read the text Success 2 Can user add images Image from computer memory Images should be loaded Images are displayed when user selects an image Success 3 Can User filter an image? Image Images should be filtered Images are filtered when user chooses a filter Success 4 Can User add an audio file or video file? Audio file/video file User should be able to add an audio or video file Audio and video files are added when user selects from computer memory Success 5 Can User save converted image, audio or video? Image/audo/video User should be able to save converted files Converted files are saved when user chooses location Success 5.2 Pointers · Large sized images selected for conversion and filter may take longer than those of small size. · OCR feature may sometimes read text incorrectly, therefore make sure images are clear and are of readable font. · Audio and Video files with large sizes will most of the time take a longer conversion time. · The system is completely user friendly · Exit from any page is possible · Enjoy the system features. 5.3 Future Developments · More filters for images to be added for more variety · A more advanced OCR could be developed to enable more accuracy Create an online system to involve non-windows users · Chapter 6 Conclusion 6.1 Conclusion 6.1.1 How far the targets were met The initial targets were to; · Use a different approach to filtering. · Educate on some of the image processing techniques carried out. · Show how vb.net can contribute to the growth of this sector. · Help its users processing tasks with the features that it has. With regard to these targets, a different approach to filtering using the average method was written and used that performed the necessary filter operations. 6.1.2 Overall Conclusion In terms of the overall project, it did meet users requirements as we have seen that the system works offline, is windows compatible, encompasses audio, video and image conversion, supports Image filtering, extracts editable text from images, is a stable system, user friendly and used a simple filter algorithm for developers. Users would be pleased to finally have a system that can not only focus on images, but also added bonuses of audio and video conversion. This can save their time in maintaining only one system, save money in investing in one huge system in the future and also make their personal and business life easier. This project can educate on some of the image processing techniques used, and also illustrate on how vb.net can contribute as a programming language to the growth of the field discussed. Acknowledgement I would like to convey thanks to everyone that helped me complete this project. Firstly, I acknowledge my Supervisor, Mr XXX Xxxxxxx, who has also been my teacher from my first year at Zhejiang University of Science and Technology. He is firm and able to tell and motivate you in any way possible to see you succeed, all qualities of a good teacher, and more reason why I chose him as my supervisor. He is the one who taught me vb.Net, the very language I used to complete this project. Secondly, I would like to thank my family especially grandma, mom, uncles and my brother for their constant and unrelenting support towards my education and for their impartial love for me. Lastly, my classmates (friends) with whom I have had a great long run with. I acknowledge my department for providing the courses and a great atmosphere that contributed to my completion. References 1. Dr Rao, K. Overview of image processing. Readings in Image Processing (25-26 September 2004), 1–7. 2. Jain, Fundamentals of Digital Image Processing, Prentice-Hall Inc., 1982. 3. RC Gonzalez,  HYPERLINK "http://xueshu.baidu.com/s?wd=author%3A%28Richard%20E.%20Woods%29%20&tn=SE_baiduxueshu_c1gjeupa&ie=utf-8&sc_f_para=sc_hilight%3Dperson" t "_blank" RE Woods,. Digital Image Processing(3rd Edition), Prentice-Hall, Inc. , 2007 4. M Sonka, V Hlavac , RBDM Ceng, Image Processing, Analysis and Machine Vision, Journal of Electronic Imaging, 2008 5. Dudgeon, D.E. and R.M. Mersereau, Multidimensional Digital Signal Processing. 1984, Englewood Cliffs, New Jersey: Prentice-Hall. 6. Castleman, K.R., Digital Image Processing. Second ed. 1996, Englewood Cliffs, New Jersey: Prentice-Hall. 7. Russ, J.C., The Image Processing Handbook. Second ed. 1995, Boca Raton, Florida: CRC Press. 8. Giardina, C.R. and E.R. Dougherty, Morphological Methods in Image and Signal Processing. 1988, Englewood Cliffs, New Jersey: Prentice-Hall. 321. 9. Freeman, H., Boundary encoding and processing, in Picture Processing and Psychopictorics, B.S. Lipkin and A. Rosenfeld, Editors. 1970, Academic Press: New York. p. 241-266 10. Stockham, T.G., Image Processing in the Context of a Visual Model. Proc. IEEE, 1972. 60: p. 828 - 842. Groen, F.C.A., R.J. Ekkers, and R. De Vries, Image processing with personal computers. Signal Processing, 1988. 15: p. 279-291. 11. Verbeek, P.W., H.A. Vrooman, and L.J. Van Vliet, Low-Level Image Processing by Max-Min Filters. Signal Processing, 1988. 15: p. 249-258. 12. Young, I.T. and L.J. Van Vliet, Recursive Implementation of the Gaussian Filter. Signal Processing, 1995. 44(2): p. 139-151. 13. Kuwahara, M., et al., Processing of RI-angiocardiographic images, in Digital Processing of Biomedical Images, K. Preston and M. Onoe, Editors. 1976, Plenum Press: New York. p. 187-203. 14. Van Vliet, L.J., Grey-scale measurements in multi-dimensional digitized images, PhD Thesis: Delft University of Technology, 1993. 15. Serra, J., Image Analysis and Mathematical Morphology. 1982, London: Academic Press. 16. Young, I.T., et al., A new implementation for the binary and Minkowski operators. Computer Graphics and Image Processing, 1981. 17(3): p. 189-210. Figure 1.0 RGB colors including the secondary color namely cyan, magenta and yellow User Scans Image OCR Output text Figure 1.1 OCR Context DFD User Filtering Filterer Filtered image Image Filter Figure 1.2 Image filter Context DFD User Converting Converter Converted image Image Format Figure 1.3 OCR Context DFD User Converting Converter Converted audio Audio file Format Figure 1.4 Audio Converter DFD User Converting Converter Converted video Video file Format Figure 1.5 Video Converter DFD Start Main Page: Click on Image Click on OCR Point to the image and press scan Print output Stop Scanned Text Figure 2.1 OCR Flowchart Start Main Page: Click on Image Click on Filter Click browse to select image Choose Filter Stop Filtered Image Save Image Figure 2.2 Image Filter Flowchart Start Main Page: Click on Image Click on Converter Click browse to select image Choose format Stop Converted Image Click Convert Figure 2.3 Image Converter Flowchart Start Main Page: Click on Audio Click on Converter Click Open to select an audio Click save to name & choose file location Stop Converted Audio Click Start Choose Format Figure 2.4 Audio Converter Flowchart Start Main Page: Click on Video Click on Converter Click Open to select a video Click save to name & choose file location Stop Converted Video/Audio Click Start Choose Format Figure 2.5 Video Converter Flowchart Picture box: this enables the scanning of the images through a transparency property. The user points this to the image. Scan button: the button the user clicks to scan the image. Rich Textbox: Displays the text extracted from the image. Figure 3.0 shows the OCR User friendly interface Picture box with a default image the user can see Picture box that should display the filtered image. Filter buttons the user can click Figure 3.1 is a shot of the Image Filter friendly user interface Browse button to choose image and Convert button to convert the image Combo box containing the formats available Picture box displaying the chosen image by the user. Figure 3.2 is the user interface for the image converter Start button to start conversion and stop button to cancel conversion Textbox displaying the conversion progress Textbox showing if file is converting or not Combo box with formats available Figure 3.3 shows the user interface for the audio converter. Start button to start conversion and stop button to cancel conversion Textbox displaying the conversion progress Textbox showing if file is converting or not Combo box with formats available Figure 3.4 is a shot of the video converter user interface Figure 4.0 OCR main source code Figure 4.1 Grayscale algorithm Figure 4.2 RGB Chart Figure 4.3 image formats in code Figure 4.4 implementation of ffmpeg package for audio conversion Figure 4.5 Audio formats in code Figure 4.6 ffmpeg video algorithm Figure 4.7 Video formats in code Figure 5.0 IAV Splashscreen Figure 5.1 IAV Main page Figure 5.2 displaying original image Figure 5.3 Displaying Grayscale result Figure 5.4 invert result Figure 5.5 image Converter page Figure 5.6 jpeg image chosen Figure 5.7 jpeg to PNG result Figure 5.8 image to be scanned Figure 5.9 extracted text from image - 31 -

Place your order
(550 words)

Approximate price: $22

Calculate the price of your order

550 words
We'll send you the first draft for approval by September 11, 2018 at 10:52 AM
Total price:
$26
The price is based on these factors:
Academic level
Number of pages
Urgency
Basic features
  • Free title page and bibliography
  • Unlimited revisions
  • Plagiarism-free guarantee
  • Money-back guarantee
  • 24/7 support
On-demand options
  • Writer’s samples
  • Part-by-part delivery
  • Overnight delivery
  • Copies of used sources
  • Expert Proofreading
Paper format
  • 275 words per page
  • 12 pt Arial/Times New Roman
  • Double line spacing
  • Any citation style (APA, MLA, Chicago/Turabian, Harvard)

Our guarantees

Delivering a high-quality product at a reasonable price is not enough anymore.
That’s why we have developed 5 beneficial guarantees that will make your experience with our service enjoyable, easy, and safe.

Money-back guarantee

You have to be 100% sure of the quality of your product to give a money-back guarantee. This describes us perfectly. Make sure that this guarantee is totally transparent.

Read more

Zero-plagiarism guarantee

Each paper is composed from scratch, according to your instructions. It is then checked by our plagiarism-detection software. There is no gap where plagiarism could squeeze in.

Read more

Free-revision policy

Thanks to our free revisions, there is no way for you to be unsatisfied. We will work on your paper until you are completely happy with the result.

Read more

Privacy policy

Your email is safe, as we store it according to international data protection rules. Your bank details are secure, as we use only reliable payment systems.

Read more

Fair-cooperation guarantee

By sending us your money, you buy the service we provide. Check out our terms and conditions if you prefer business talks to be laid out in official language.

Read more
error: Content is protected !!
Open chat
1
You can contact our live agent via WhatsApp! Via + 1 929 473-0077

Feel free to ask questions, clarifications, or discounts available when placing an order.

Order your essay today and save 30% with the discount code GURUH