Self-Service Data Analytics and Governance for Managers. Nathan E. Myers. Читать онлайн. Newlib. NEWLIB.NET

Автор: Nathan E. Myers
Издательство: John Wiley & Sons Limited
Серия:
Жанр произведения: Корпоративная культура
Год издания: 0
isbn: 9781119773306
Скачать книгу
hour and a good jog later, the author then launched his thermostat app to turn down the AC, before getting in the car to begin the commute. Once in the car, the same phone displayed a message indicating that there was light traffic to the train station, a journey which was predicted to take 8 minutes. A prompt came up to allow for quick directions to the train station, which was hastily declined. Afterall, the authors are brilliant not only with data analytics and governance books, but with getting themselves to the train station, even without the benefit of GPS-assisted instructions.

      It is easy to see machine learning in action, in just the first 90 minutes of the day. It is not exactly clear which of the author's observed activities or features were used to trigger his phone to make suggestions, but it is clear that routine daily actions observed and logged over time had served as training data and had ultimately resulted in a number of predictions about subsequent activities or labels. Clearly, the phone knew the time the alarm was set for, at what time the commute begins, and where your author's car is left for the day.

      There is a scale of maturity for machine learning capabilities, beginning with descriptive analytics to look at what has happened in the past with data aggregation and mining, moving forward to diagnostic analytics, to understand the drivers of the target outcomes. Moving further along the continuum, we get to predictive analytics to help us to project from past observations what will happen in the future, based on statistical forecasting models, and on to prescriptive analytics, which uses optimization and simulation algorithms to advise on possible outcomes and to determine what actions should be taken. In the phone example above, the machine learning model is far out to the right, even approaching prescriptive analytics. The model was able to predict the next actions and to prescribe what to do about them – launch the driving directions app, as you are in the car and headed to the train station!

      Optical Character Recognition/Intelligent Character Recognition

      Optical character recognition (OCR) is a means of using software to convert images of typed, handwritten, or printed text into machine-encoded text, from a number of formats – scanned documents, photos of documents, or even from subtitles, captions, or text superimposed on an image. As a practical matter, OCR is often used to digitally capture books or other documents with consistent and universally recognizable fonts. OCR is often a component of document management software (DMS) that can be used to go paperless. Many readers may use DMS programs to allow them to take a snapshot of transaction receipts with their phones, and the software will capture and categories transactional details like the items purchased and the vendor, directly from the image. Other images that we work with can be tailored to our needs better with OCR. Common examples include Adobe Acrobat document images, which are common for locking down documents into a stable, read-only format, prior to distribution. Using OCR capabilities can allow for more flexible digital archiving, can make them searchable, and can even allow users to copy and paste from the created body of machine-encoded text, once it has been extracted from the image.

      Natural Language Processing

      Natural language processing (NLP) is a dimension of artificial intelligence that enlists linguistics and computer science to improve how computers can capture, analyze, and process large volumes of human language data. Efforts in this area have centered around speech recognition, language translation, natural language understanding, and natural language generation. This is perhaps the area of artificial intelligence which has been in existence the longest, spanning decades of work in the field.

      In our discussion of robotic process automation (RPA) in a previous section of this chapter, we described in some detail how Chat Bots can multiply the efforts of existing customer service staff by engaging users to extract common demands and then locating appropriate information to provide in response. Chat Bots leverage NLP to “understand” those demands. Other popular assistants in use today are reliant on NLP to enable commands to be invoked.

      For many of us, informal conversational English can be quite different from explicit computer language demands. “Hey Siri, can you place a call to order pizza from that place on Main Street we ordered from last week?” is a natural language request that needs to be classified, or broken down, to answer such questions as Q1: “What does the user want?” – A1: The user would like me to place a call, or at least to locate a phone number. Q2: “Place a call to (or get a phone number for) whom?” – A2: A pizza restaurant that is in the user's call history, with an address on Main Street, Q3: “If my classification fails, or any of my actions do not appear suitable, relevant, or of value, is there any other related information I can provide to better assist the user?” – A3: Provide pizza options from nearby restaurants.

      In the above example, getting assistance from a finely tuned NLP model, to process informal vernacular in posts, to glean meaning from content, to perform classification to determine whether posts are appropriate, and finally to flag as exceptions any posts deemed inconsistent with usage guidelines, could be a very valuable tool to have at your disposal. Equipping an NLP model to extract, structure, and classify messages, all while continually improving the outcomes, is integral to allowing humans to interface with computers on our own terms.

      Self-Service Data Analytics