Sentence Analyzer - enabling business and enterprise applications to handle sentences and text |
| Sentences can be compared and evaluated. Your desktop Find/Search compares alphabets. Try it now with sentences. Compared by form or content, or both. |
Comparing sentences A sentence fingerprint, by itself, has limited use. More is achieved when sentences are "compared" to each other. Or, against a few desired sentences, a large target paragraph is compared, to find best matches.
|
|
Sentence comparison is somewhat subjective. Comparison weightages are usage specific. For example, one can decide "man bites dog" to be the opposite, similar, or identical to "dog bites man". It depends on taking a pure content view, a content + structure view, or a pure structure view.
|
|
In this toolkit, only the content+structure and the pure structure mode is used, implying that the machine does not really "understand" anything :-). It just does dumb mechanical comparisons, within some rules of grammar and using 2-3 distinct methodologies. It may need to be customized, because a "generic" comparison is necessarily somewhat fallible.
|
| Having said that, even this "limited because generic" comparison tool can work in business situations. For example, the comparison number from "man bites apple" is consistent and logical. Please try out each (A, B, C, D) sample below, and switch to different flavors of content+structure. Explore the "Compare less tightly" and other options. For questions : kinshuk_in @ yahoo dot. com |
| Note : The RESULTS shown are the human readable equivalent of Java/C# objects, and they have lots of additional information, like word meanings, group codes like colors and flavors etc., intended for further analytical/statistical treatment. This demo (created with no NLP APIs), stresses that with text, it is better to first maximize grammar based processing, and use statistics/math methods much later. |
| Sentences must be separated from each other by an ending period (. or ! or ?) and one space. | Skip the descriptive stuff and go directly to demo |
| Comparing sentences demo : compare by content + structure, or by structure alone ( Click more samples : A, B, C, D then Compare) |
| Enter ONE benchmark sentence. This will be the basis for a comparison. |
|
|
| Enter a TARGET sentence. This is what you want compared against the benchmark. |
|
|
|
|
(Please scroll down for the RESULTS)
|
| Summary results : The two sentences had 79.6 percent corelation factor. |
| Best matching content + structure after a find/collect operation |
|
|
| All sorted matches (Best finds on top, degrades towards the end, and very bad matches ignored) |
|
|
|
|
| For single sentence comparisons, a list of things compared.(obvious non-occurrences indicate bad grammar rules, or dictionary issues) |
|
|
|
- More features and +es of the demo (see also home/basic analyzer):
- Designed for semi-structured sentence comparisons, but with some homework does nicely on unstructured content too.
- Comparison modes, weightages and many comparison parameters can be varied.
- Each comparison option is arrived at in a different way. Best fit or combinations is left to end-use applications.
- A single numerical number after the comparison. This is intended to even out and simplify things for business apps.
- Special versions could have arrays and matrices of many comparison numbers.
- Caution and warning : Please do not assume that any "meaningful" comparison is being done !
- Nevertheless, scope exists for improving the meaning oriented side of things.
- For unstructured target inputs, scope exists to mechanically formulate a set of benchmarks that compares with desired input text.
|
| More view/try pages here. |
|
|
More reading for those interested... |
| 1. Things that are not obvious from the demo |
| 2. Business products and possibilities |
| 3. So what !! Universal grammar has been in use for decades now ... |
| 4. The inevitable comparisons, to what already exists out there. |
| 5. What is a sentence, to future application builders ? |
| 6. The genesis and design principles story |
| 7. Arbitrary listing of business usages |
| 8. Extensions, additions, customizations possible in the toolkit |
| 9. Important : Combining with the document extraction tool at text2data.net, benefits |
| If your software never makes a mistake, then what you really possess is an adding machine. Creativity is only possible when two intelligences compete. |
| Contact at : kinshuk_in @ yahoo dot. com |