Patent ReferencesText processing method and apparatus Computer method for automatic extraction of commonly specified information from business correspondence Method and apparatus for producing an abstract of a document Digital computing apparatus for preparing document text Patent #: 5257186 InventorsAssigneeApplicationNo. 085385 filed on 07/02/1993US Classes:715/531, Text704/2, Translation machine704/9Natural languageExaminersPrimary: Hayes, Gail O.Assistant: Shingala, Gita D. Attorney, Agent or FirmInternational ClassesG06F 007/38G06F 007/6 AbstractA summary is automatically formed by selecting regions of a document. Each selected region includes at least two members of a seed list. The seed list is formed from a predetermined number of the most frequently occurring complex expressions in the document that are not on a stop list. If the summary is too long, the region-selection process is performed on the summary to produce a shorter summary. This region-selection process is repeated until a summary is produced having a desired length. Each time the region selection process is repeated, the seed list members are added to the stop list and the complexity level used to identify frequently occurring expressions is reduced.Other References
| |