Vous êtes sur la page 1sur 4

International Journal of Computer Information Systems, Vol. 3, No.

6, 2011

Stroke Segmentation in Form Processor


Radhika.R.1,
1

PG Scholar,Dept of Computer Science and Engg, Adhiyamaan College of Engg,Hosur


radhikaravik@gmail.com

Ramya Dorai.D.2,
2

Assistant Professor,Dept of Computer Science and Engg, Adhiyamaan College of Engg,Hosur


dramyadorai@yahoo.com

Abstract: An inertial-measurementunit-based pen (IMUPEN)[1] and its associated trajectory reconstruction algorithm[1] for motion trajectory reconstruction and handwritten digit recognition applications[3].During writing or drawing movements, the inertial signals[1] generated for the movements are transmitted to a computer via the wireless module.The structure and content of this article is based on the existing process, which focuses on the capabilities and features needed by the users. In the course of our lives we fill out hundreds of forms - application forms, questionnaires, insurance claims, etc. At the same time computers have become indispensable for collecting and managing information, making the task of extracting data from filled form even more pressing. This article provides a comprehensive architectural overview of the pen base form processing through handwriting input device namely e_WriteMate(electronic_WriteMate) and segmenting the strokes. It provides the conceptual level architecture of the grouping, stroke registration and field extraction system. Stroke segmentation should be done for each form.Grouping has successfully validated the effectiveness of the e_WriteMate. Index Terms: FP, e-WriteMate, Strokes, Field Extraction

1.1 About e-WriteMate This e-WriteMate is a portable handwriting capture device based on natural handwriting as an input. Attach plain paper of any kind and use the electronic pen to capture, store and share handwritten data.Uses standard paper-no special paper required. The on-board flash memory can store up to 200 A4 Pages of handwritten forms.Rechargeable battery and LCD which indicates product status. a. These qualitative systems requirements include: 1. Usability 2. Reliability 3. Performance 4. Supportability 1.2 About e_Pen An electronic pen and a receiving unit, which captures & accumulates the data with storage capacity of upto 50 A4 pages . The Electronic Pen includes miniature IR and ultrasound transmitters. The Electronic Pen has same shape and feels as any other ordinary pen and can be easily placed in any person blouse pocket. The Receiving unit consists of two ultrasound receivers, an IR (infra-red) receiver and a processing unit that processes the signal, calculates the position of the transmitter and interfaces with the connected computing device. Also with storage capacity of 2 MB anstoring technology as strokes capture.

1.

INTRODUCTION

This document provides a high level overview and explains the whole process of Form Processor. It explains how a user will be able to swap his handwritten notes to digital data and it includes the underlying architectural process. The document provides a high-level description of the goals of the architecture, the use cases support by the system and architectural styles and components that have been selected to best achieve the use cases. This framework then allows for the development of the design criteria and documents that define the technical and domain standards in detail.

December Issue

Page 96 of 99

ISSN 2229 5208

2. BASIC BLOCKS OF FORM PROCESSOR About Form Processor Form Processor is a Windows .NET application that demonstrates form Processing Capabilities to HWR and create form Template sets which can be processed against those handwritten form sets to extract form data.Form Processor recognizes natural handwriting and converts it into digital information.Workflow is considerably improved as data are immediately available in XML/EXCEL format for further use in back-end applications.

International Journal of Computer Information Systems, Vol. 3, No. 6, 2011 Solution :The system me be fully FP compliant and thus can be deploy onto any Windows server 3.4Security: Description : Authentication and authorization mechanisms Solution :FP native security mechanisms will be Reused. Form processor has three basic components those have to execute during the processing. Remaining blocks has to customize has per the customer requirements. Those components are 1. 2. 3. 4. 5. 6. 7. Pre-processing Grouping Note registration Recognition Field Extraction[3]-Positioning the fields Post-processing Data Storage 3. CAPTURING UNIT The capturing unit provides the user filled data. On which our system will perform operation and provide form details.We are using handwriting input device(e_WriteMate) to do the data collection. But this block has to be independent of the device type.

Strokes Capturing Unit FP

Form details Storage

1. 2.

Size and Performance Time to process and Recognition: User tolerable timing.

Quality As far as FP is concerned, the following quality goals have been identified: Constraints Recognition Translation module results depend on the recognition accuracy. Grouping Constants Noise datas from the device Performance While Processing with huge amount of data. 3.1 Scalability: Description : Systems reaction when user demands increase Solution : FP application should support several workload management 3.2 Reliability, Availability: Description : Transparent failover mechanism Solution :: FP application in server model should supports load balancing 3.3Portability: Description : Ability to be reused in another environment 3.

Fig :e_WriteMate

4. STROKES Collection of X Y Coordinates in a Particular Pen Down, Pen Move and PenUp event obtained from the e-Pen.

December Issue

Page 97 of 99

ISSN 2229 5208

4.1 Stroke Segmentation This performance is related to all remaining modules will increase. In current phase we are supporting only for skew correction and registration. Skew(position) might be occurred due to improper placement of paper or/and it may be because of improper clipping of device to paper.Stroke segmentation is to separate a stroke trajectory cut off from the whole pen-tip movement by handwriting motion[3]. Since the developed pen system can be used for on a surface and in the air, the system should be able to distinguish the meaningful movements for writing a stroke and meaningless movements before and after writing the stroke.

International Journal of Computer Information Systems, Vol. 3, No. 6, 2011 4.2. Zero velocity compensation (ZVC): To correct inherent errors in inertial sensors used in IMUs[1], we have two choices, in principle. One method of eliminating these errors is to be aided by secondary sensors, such as GPS and optical sensors, usually along with a Kalman filter to minimize errors [6]. The other one is to use external measurement which relates to the trajectory parameters, such as zero velocity updates (ZUPTs), coordinate updates (CUPTs), and orientation updates [7]. Our approach corresponds to the second one.Utilizing the known information such as velocity,position, and orientation at a known time instance for correcting errors, these update methods are useful when no secondary sensors are available. Especially, ZUPTs can be obtained by just stopping the IMU and updating the current velocity to zero. It updates the velocity and position only at time instances when stopping the IMU.For the developed pen system, ZUPTs can be a good choice since stopping is necessary for stroke segmentation[2], too. An improved method based on ZUPTs, but only applicable to offline purpose, has been proposed in [8]. We will call the method Zero Velocity Compensation (ZVC), compared to ZUPTs. ZVC is applied to a time interval where the start and final values of velocities are zeros. 5. FORM PROCESSING: This is the core component in the system it behave has brain of the system. Generally, making the system to think and work by having unknown input (e.g. strokes) to achieve our goals is called Artificial Intelligence.It is science and engineering of making intelligence machine, especially intelligent system programs. So, it is related to the similar task of using computer to understand human intelligence and to reach our goals. 6. FORM DETAILS: Form Details is a powerful yet easy to use application that allows anyone to create professionally looking formswithin just a few minutes.Maintaininput boxes, text areas, labels, isolated fields, lines and much more, all with the click of a button. You can also add images, adjust colors, fonts, borders, and highlights. The program features a very simple and intuitive user interface which does not need any technical knowledge to be operated and makes this task a breeze.All thats left is to upload your Form and Form filled files in the Form Processor.

Note info

Stroke to word conversion

Word info

Word to Line conversion

Skew and translation detection

Fig: Stroke conversion block diagram


In other words, the beginning and the end of the stroke is to be found. Since it is difficult to discriminate if a trajectory segment is a part of a stroke or a meaningless movement, our approach is to put intentional stopping by a user at the beginning and the end of every stroke. Stopping is also used in the compensation of the integration error as key information in the next subsection.Detection[5] of IMU stopping seems to be easy but, in real application, it includes some obstacles such as hand trembles. To cope with hand trembles during pauses, one can simply predefine a threshold of the acceleration measurement and a holding time to keep the measurement being less than the threshold.If, however, a user turns the developed pen system to a different direction, the acceleration measured gets close to zero in a moment. At this moment, if the acceleration keeps being less than the threshold during a predefined holding time, a misdetection occurs. Thus, one cannot assert that the acceleration is almost zero only during a pause. Instead, we can assert that the acceleration is almost constant during a pause. After collection of X Y Coordinates in a Particular Pen Down, Pen Move and Pen Up event obtained from the e-Pen into the e_WriteMate then the actual process starts. The e_WriteMate device should be connected to the system and download the X Y Coordinates(i.e stroke values) and register the stroke values and then reconstruct the words by the following method named asZero Velocity Compensation(ZVC)[2].

December Issue

Page 98 of 99

ISSN 2229 5208

7.STORAGE It is the final stage, here all the forms will be stored with full details after reconstructed the strokes by using grouping, stroke segmentation[2] and field extraction that is positioning the fields. 8. DISCUSSION Actually, IMUPEN, is used only in front of the system.Instead of IMUPEN we are using e_WriteMate and e_pen to capture the stroke values and convert it to the accurate word. Stroke segmentation,Field extraction,Grouping plays an major role here. Forms should be preprocessed in cpu in the form of template. Stroke values should be downloaded into the system by connecting e_WriteMate which is captured. Positioning the fields into a corresponding fields in a form, is an challenging work to be done. This is the main advantage of this work. MATLAB is used to implement this work. e_WriteMate(electronic_WriteMate) and e_pen is really portable comparing IMUPEN[1]. Carried anywhere without any external references and used for user at any time in offline mode too. REFERENCES [1]. Jeen-Shing Wang, Member, IEEE, Yu-Liang Hsu, and JiunNan Liu,An Inertial-Measurement-Unit-Based Pen Witha Trajectory Reconstruction Algorithm and Its Applications, IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 57, NO. 10, OCTOBER 2010. [2] W. C. Bang, W. Chang, K. H. Kang, E. S. Choi, A. Potanin, and D. Y. Kim, Self-contained spatial input device for wearable computers, in Proc. IEEE Int. Conf. Wearable Comput., 2003, pp. 2634. [3] S. D. Choi, A. S. Lee, and S. Y. Lee, On-line handwritten character recognition with 3D accelerometer, in Proc. IEEE Int. Conf. Inf. Acquisition,2006, pp. 845850. [4] Z. Dong, G. Zhang, C. C. Tsang, G. Shi, W. J. Li, P. H. W. Leong, and M. Y. Wong, IMU-based handwriting recognition calibration by optical tracking, in Proc. IEEE Int. Conf. Robot. Biomimetics, 2007,pp. 382387. [5] C. C. Tsang, P. H.W. Leong, G. Zhang, C. F. Chung, Z. Dong, G. Shi, and W. J. Li, Handwriting tracking based on coupled IMU/Electromagnetic resonance motion detection, in Proc. IEEE Int. Conf. Robot. Biomimetics, 2007, pp. 377381

International Journal of Computer Information Systems, Vol. 3, No. 6, 2011 [6] J. Skaloud, K. P. Schwarz, Application of inertial technology to underground positioning: The Soudan mine shaft survey, ZeitschriftfrVermessungswesen, vol. 8, pp. 292299, 2000. [7] J. R. Huddle, Trends in inertial systems technology for high accuracy AUV navigation, in Proc. Workshop Autonomous Underwater Vehicles, Aug. 2021, 1998, pp. 6373. [8] M. Frank, Position refinement algorithm, U.S. Patent 6 292 751, Sep. 18, 2001.

AUTHORS PROFILE Radhika.R received the B.Tech degree in Information Technology from the University of Anna, Chennai, India, in 2006 and pursuing M.E degree in Computer Science Engineering from the Anna University. She worked as an Assistant Professor with the Department of Computer Science Engineering, CREC, JNTU University, Tirupathi, Andhra Pradesh, India. Her research interests include artificial intelligence, operating system, software testing tools. Has published papers in various International Journals and International Conferences. Ramya Dorai.D received her M.E degree from Anna University, Chennai, India in 2006. This author is currently working as Assistant Professor at Adhiyamaan College of Engineering, Hosur, Tamil Nadu, India. Is Life member of ISTE Chapter. Has published papers in various National and International Journals , and International Conferences.

December Issue

Page 99 of 99

ISSN 2229 5208

Vous aimerez peut-être aussi