Sentence Analyzer - enabling business and enterprise applications to handle sentences and text

Find, search, sort and filter.    A Find/Search utility now done with sentences. Use it for local, enterprise or web documents.

Sentences, after analysis, become amenable to more intelligent search operations.   This helps build useful and practical search tools, involving not just the words, but a complex rich with content and meaning and structure. A large mass of target sentences can be filtered/sorted using some desired "benchmarks".
The finding is not "textually rigid". Human-friendly allowances and structural flexibility becomes possible. Similar sentences get collected and stacked, minor differences in content and structure notwithstanding. The match is as near in meaning and intention as can be managed from a machine.
Its practical usages are many. Say, "highlighting" a document with some specific desired sentences standing out. Or, retrieving text from the web, extracting a specific theme. Some homework is needed to create just the right set of benchmarks, but that can be mechanized.
In the boxes below, please enter 1-2 desired benchmarks, and a bunch of 20-30 target sentences (takes a while). Try to use sentences which have structural similarity, that is a limitation of this demo. Also explore the "Compare less tightly" and other options. For questions : kinshuk_in @ yahoo dot. com
Note : The RESULTS shown are the human readable equivalent of Java/C# objects, and they have lots of additional information, like word meanings, group codes like colors and flavors etc., intended for further analytical/statistical treatment. This demo (created with no NLP APIs), stresses that with text, it is better to first maximize grammar based processing, and use statistics/math methods much later.
Sentences must be separated from each other by an ending period (. or ! or ?) and one space. Skip the descriptive stuff and go directly to demo
Find/filter sentences demo : find sentences by your benchmarks ( Click more samples : A, B, C, D then Find/Collect)
Enter ONE benchmark sentence. This will be the basis for a finding operation.
Enter multiple TARGET sentences. This is what you want searched using the benchmark.
(Please scroll down for the RESULTS)     
Summary results : Searched and sorted among 24 target sentences. Highest co-relation 129. percent.
Best matching content + structure after a find/collect operation
All sorted matches (Best finds on top, degrades towards the end, and very bad matches ignored)
Options for comparing, or finding and selecting (i.e. change default settings)
Compare less tightly Use frequently, if find results are not OK
More structural than content  and 
Tree depth (deeper into structure)
  • Features, tall claims, and things to note in this demo :
  • Most sentences will be analyzed and compared in this toolkit. That is a plus over many POS taggers. Most sentences will yield a pattern.
  • Do NOT assume that the "benchmarks" are being searched by human meaning. It is mostly structural, with a peppering of the content.
  • In unstructured things like hotel reviews, a lot of prior homework using similar reviews is needed to create effective benchmarks/finders, needs a tool (not in this demo).
But it is not exact ? : Only adding machines can be exact, and the "panda eats, shoots and leaves" ! Google is successful because it exceeds being an adding machine, becomes a "probably" with increasing exactness. So is high-frequency stock trading. This toolkit can be quite exact in many situations, especially when the sentences involved are structurally similar.
Can more than one benchmark be used ? : Yes, multiple benchmark sentences that adequately represent different variations of the desired text can be used. The finding process becomes slower.
More view/try pages here.
Generic use cases
Back to home/basic analyzer
Comparing sentences, several modes
Find/search/sort/filter
Crunching a big text
Wildcard usages in pure structure mode
Business use cases
Handling Notes section in annual reports
Crunching of a Presidential speech
Executive profiles
Project statuses
USPTO events alerter
Customer reviews
Back to home/basic analyzer

More reading for those interested...

1. Things that are not obvious from the demo
2. Business products and possibilities
3. So what !! Universal grammar has been in use for decades now ...
4. The inevitable comparisons, to what already exists out there.
5. What is a sentence, to future application builders ?
6. The genesis and design principles story
7. Arbitrary listing of business usages
8. Extensions, additions, customizations possible in the toolkit
9. Important : Combining with the document extraction tool at text2data.net, benefits
In another 10 years, our present word-based search engines will be laughed at.
Contact at : kinshuk_in @ yahoo dot. com