We all have seen failures captured in freetext. This is such a common problem that this information is only used to review the shift and then stored never to be used again - the most important feature that has the failure modes!
Back in the 60s, NASA had the reverse problem, they wanted an efficient way to present human commands into a computer to control the spacecraft. They settled on an elegant way of coding things using VERB-NOUN format. OPEN-VALVE, SWITCHON-LIGHT and so on.
So in failure data, one would usually have a component and its status mentioned in one form or another. One way to encode the freetext into usable data is to use a similar approach.
Using an online Part-of-Speech (POS) interpreter, one can code the words into three categories, Nouns, Verbs and Adjectives. Adjectives are added to include STATUS words such as loose, damaged and broken. These really come in handy when looking for typical bad actors.





No comments:
Post a Comment