Pieces of my NLU world are coming together. Especially around this idea of a TChat that supports the '+' operator and the SetData() method. The TChat is supposed to be a minimal "text in/text out" entity that uses a topic specific tree and a set of topic specific narrative patterns. The TChat extracts information from the text and formulates a response. The '+' operator allows creating more complex TChats from simpler ones. It is more than likely that binding the input and the output within the same object is not quite right; but never mind. What is important is this whole business of "what is the data?". If you can answer that you have a start.
The TChat construct gets close to the heart of an important truth about geometric objects (or for that matter any data type that can be diagrammed or formulated into DB schema) - which is that a geometric object is comprised of a few visual perceptions and a lot of language constructs. For example Euclid's 'triangle' relies on intuitions of points, lines, and angles - which are glued together with words. What does a TChat have to do with this? It is about constructing more complex objects from simpler ones using '+', in a way that mirrors something of what builds understanding of the geometric object. There would be a tree of words related to triangles, and a collection of things that are said using the vocabulary of this tree.
The exciting corollary is that to design a language understanding system about a geometric object is simply a matter of writing down a definition for the geometric object. Quite literally, the language understanding [software] has the same organization as the geometric object because that object is linguistic organized in the first place! This is a huge simplification in the software design process.
So there. When you get harmony between your implementation of a topic and the natural way of thinking about it (sometimes called "business logic") you are on the roll.
But that is not all. Over the last couple days, a colleague named Serge Gomert from Belgium has been showing me what he has learned about NLU with Alexa and Microsoft LUIS. I just took a closer look and darned if Microsoft isn't headed in the right direction. But the good news is that they are still way behind Narwhal. And the last excitement is this: I can get a Javascript widget from Microsoft, designed to embed in a web page, and I am up and running with their voice-to-text accessible to me - where I can do my own NLU and, in particular, do TChats. Those Microsoft suckers may have given me a platform to leapfrog them.
Makes me want to quit my job and focus on creating a TChat wizard.
Parenthetically, the reason I believe LUIS is way behind Narwhal is for reasons big and small. The first is that their 'intent' construct seems to require MxN rather than M+N variations when there are M choices for one part and N choices for a second part of a two part intent. Further, although they have a score, it is not signed - which to me is another way of saying that they do not have proper support for sentiment. Still further, they do not support narratives like relation, event, sequence, or becoming. Still further, they do not allow any form of narrative nesting. So they are at the bottom of the ladder. I am three rungs up, waiting for someone smart to figure out the "Golden Algorithm".
So isn't there a commercial opportunity there? I mean if I could insert myself between Microsoft and the public, I might be able to afford a bigger retirement home but, more importantly, be able to continue feeling useful for a few more moments.
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment