ISBN-13: 9783836494519 / Angielski / Miękka / 2008 / 52 str.
The electronic age has increased the reach of human capabilities so much that the expectations about what constitutes an appropriate empirical linguistic analysis should change. A hundred years ago, linguistics was largely an empirical manual process that produced information intended for humans.Today, the world is different as inexpensive computing power and the prevalence of information in electronic format encourages that information is processed through an automated and scalable means that creates results that are readily consumable and understandable by computers. Creating sustainable and usable observations is possible through a standards-based approach that meets long term persistence and usability goals. This book presents a scalable architecture for creating linguistic observations in the form of string frequencies measurements and instantiates those measurements in a machine-readable standards-based format called Resource Description Framework (RDF). The metadata this effort creates consists of approximately half a billion logical assertions (over 40 gigabytes) and is available by contacting the author Ronald P. Reck by email at rreck@rrecktek.com.