<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Business Intelligence &#8211; Megaputer Intelligence</title>
	<atom:link href="https://www.megaputer.com/tag/business-intelligence/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.megaputer.com</link>
	<description>Your Knowledge Partner</description>
	<lastBuildDate>Tue, 24 Mar 2026 00:02:52 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=5.0.22</generator>

<image>
	<url>https://www.megaputer.com/wp-content/uploads/favicon.png</url>
	<title>Business Intelligence &#8211; Megaputer Intelligence</title>
	<link>https://www.megaputer.com</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Query languages—the Swiss army knife of information extraction</title>
		<link>https://www.megaputer.com/query-languages-the-swiss-army-knife-of-information-extraction/</link>
		<pubDate>Tue, 06 Feb 2024 05:19:41 +0000</pubDate>
		<dc:creator><![CDATA[Echo Lu]]></dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Computational Linguistics]]></category>
		<category><![CDATA[Data Analytics]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Entity Extraction]]></category>
		<category><![CDATA[Fuzzy Matching]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Morphology]]></category>
		<category><![CDATA[Pattern Definition Language]]></category>
		<category><![CDATA[Text Analytics]]></category>

		<guid isPermaLink="false">https://www.megaputer.com/?p=35231</guid>
		<description><![CDATA[<p>Text mining, the art of extracting information from text, requires the formulation of efficient queries that retrieve information based on user input. To do this, the user requires a language for writing queries. For the most basic use cases, the language operators could be regex or string search. But while regex and string search are...</p>
<p>The post <a rel="nofollow" href="https://www.megaputer.com/query-languages-the-swiss-army-knife-of-information-extraction/">Query languages—the Swiss army knife of information extraction</a> appeared first on <a rel="nofollow" href="https://www.megaputer.com">Megaputer Intelligence</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p><span style="font-weight: 400;">Text mining, the art of extracting information from text, requires the formulation of efficient queries that retrieve information based on user input. To do this, the user requires a language for writing queries. For the most basic use cases, the language operators could be regex or string search. But while regex and string search are indispensable for text mining, their utility hits a hard ceiling when semantic or meaningful search is required. They cannot, for example, capture complex entities such as human names, corporations, and drugs. For handling tasks like these, we need a more powerful query language that has semantic understanding, such as Megaputer’s PDL.</span></p>
<p><span style="font-weight: 400;">So, what is PDL, and how does it achieve semantic understanding while regex and string search do not? Let’s take a look at an example to find out.</span></p>
<p><span style="font-weight: 400;">First of all, PDL does not just search for the literal form of the word in the query: instead, it automatically extends its search to all morphological forms of the word. For example, when searching for the word “company” in financial news articles, the PDL query will not only find “company”, but also “companies,” the plural form. This feature often comes in handy, especially when the search involves a verb. Suppose that you are interested in extracting </span><i><span style="font-weight: 400;">what the CEOs said. </span></i><span style="font-weight: 400;">With regex or other substring search, you will need to list all possible verb forms such as “say,” “saying,” “says,” and “said.” With PDL, simply entering “say” in the query will automatically fetch all possible verb forms. This behavior can also be turned off by enclosing the word in the </span><i><span style="font-weight: 400;">form </span></i><span style="font-weight: 400;">function</span><span style="font-weight: 400;">,</span><span style="font-weight: 400;"> which will then restrict the search to the literal form of the word, such as in the example below.</span></p>
<p><img class="wp-image-35249 aligncenter" src="https://www.megaputer.com/wp-content/uploads/comparison_pdl-1.png" alt="" width="800" height="497" /><br />
<!-- <img class="wp-image-35232 aligncenter" src="https://www.megaputer.com/wp-content/uploads/pdl-image-1-300x186.jpg" alt="" width="710" height="440" srcset="https://www.megaputer.com/wp-content/uploads/pdl-image-1-300x186.jpg 300w, https://www.megaputer.com/wp-content/uploads/pdl-image-1-1024x636.jpg 1024w, https://www.megaputer.com/wp-content/uploads/pdl-image-1-768x477.jpg 768w, https://www.megaputer.com/wp-content/uploads/pdl-image-1-644x400.jpg 644w, https://www.megaputer.com/wp-content/uploads/pdl-image-1-600x373.jpg 600w" sizes="(max-width: 710px) 100vw, 710px" /> --></p>
<p><span style="font-weight: 400;">Another notable feature of the PDL language is its capability for users to tailor the scope of their searches using a range of built-in functions. Returning to the previous example, you may not wish to confine your search exclusively to the specific verb “say,” but rather include other synonymous verbs like “tell” or “mention.” Achieving this is straightforward with PDL – users can invoke the </span><i><span style="font-weight: 400;">synonym</span></i> <span style="font-weight: 400;">function with the verb &#8220;say,&#8221; as demonstrated in (a) below. As the subsequent results table (b) illustrates, the captured text now includes various speech verbs such as “tell,” “emphasize,” and “claim,” in addition to the word “say,” capturing them in all possible verb forms. For additional flexibility, the user can also create and modify synonym dictionaries.</span></p>
<p><img class="wp-image-35250 aligncenter" src="https://www.megaputer.com/wp-content/uploads/comparison_pdl-2.png" alt="" width="800" height="497" /><br />
<!-- <img class="wp-image-35235 aligncenter" src="https://www.megaputer.com/wp-content/uploads/pdl-image-2-300x269.jpg" alt="" width="737" height="661" srcset="https://www.megaputer.com/wp-content/uploads/pdl-image-2-300x269.jpg 300w, https://www.megaputer.com/wp-content/uploads/pdl-image-2-1024x919.jpg 1024w, https://www.megaputer.com/wp-content/uploads/pdl-image-2-768x689.jpg 768w, https://www.megaputer.com/wp-content/uploads/pdl-image-2-446x400.jpg 446w, https://www.megaputer.com/wp-content/uploads/pdl-image-2-600x538.jpg 600w" sizes="(max-width: 737px) 100vw, 737px" /> --></p>
<p><span style="font-weight: 400;">The PDL language offers various modes of information extraction, including proximity search (e.g., finding words A and B within a sentence, or within a 3-words range), syntactic relation (e.g., finding word A that is the subject or object of B), semantic relation (e.g., finding words that are synonyms/antonyms to word A), access to dictionaries and ontologies, and more. This language is expressive enough to capture complex patterns, and yet relatively easy to use, </span><span style="font-weight: 400;">having a syntax that closely resembles English. Having access to this versatile query language significantly enhances the power and quality of text mining operations.</span></p>
<p><span style="font-weight: 400;">In conclusion, PDL is a powerful and versatile query language that enables users to extract meaningful information from text with greater efficiency and accuracy than competing methods like regex or string search. Its ability to understand and capture morphological forms, synonyms, and other complex patterns makes it an indispensable tool for solving text mining tasks that require semantic understanding. By leveraging the capabilities of PDL, users can enhance their information extraction processes and gain valuable insights from their data, making it a true Swiss army knife of information extraction.</span></p>
<p>The post <a rel="nofollow" href="https://www.megaputer.com/query-languages-the-swiss-army-knife-of-information-extraction/">Query languages—the Swiss army knife of information extraction</a> appeared first on <a rel="nofollow" href="https://www.megaputer.com">Megaputer Intelligence</a>.</p>
]]></content:encoded>
			</item>
		<item>
		<title>Megaputer nominated for DM Review&#8217;s 2006 World Class Solution Award in the category of business intelligence</title>
		<link>https://www.megaputer.com/dm-review-nomination-business-intelligence-2006/</link>
		<pubDate>Wed, 01 Feb 2006 22:24:39 +0000</pubDate>
		<dc:creator><![CDATA[Sergei Ananyan]]></dc:creator>
				<category><![CDATA[Awards & Recognition]]></category>
		<category><![CDATA[Business Intelligence]]></category>

		<guid isPermaLink="false">https://www.megaputer.com/?p=27602</guid>
		<description><![CDATA[<p>Megaputer has been nominated for DM Review's 2006 World Class Solution Award based on it's business intelligence solution for EDS.</p>
<p>The post <a rel="nofollow" href="https://www.megaputer.com/dm-review-nomination-business-intelligence-2006/">Megaputer nominated for DM Review&#8217;s 2006 World Class Solution Award in the category of business intelligence</a> appeared first on <a rel="nofollow" href="https://www.megaputer.com">Megaputer Intelligence</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>EDS is an IT services company with 120,000 employees operating in 60 countries. Under new leadership since 2003, the company is pursuing aggressive goals in four areas: account operations, product mix, financial performance, and organizational effectiveness.</p>
<p>As a services company, EDS people are the key element of the company. Whereas software, hardware, and systems companies offer value mainly through products, services companies like EDS serve clients through people. The knowledge, skill, and experience of employees are the lifeblood of the company. As such, all corporate goals have important workforce components, such as change management and continuous re-skilling. In addition, maintaining the values of the company is an ongoing workforce priority.</p>
<p>Central to maintaining an effective workforce, is employee feedback. EDS leaders use a variety of forums to engage in constant dialog with employees. One mechanism for feedback is the EDS employee survey program.</p>
<p>The EDS survey program includes an annual survey, the Voice of the Employee (VOE) survey, in which all employees world-wide are invited to participate. Additional surveys are implemented three to four times a year. Called Pulse Surveys, these additional surveys are sent to stratified samples of employees across the company.</p>
<p>Historically, employees were providing responses to scaled questions on surveys, which delivered quantitative results along several variables such as leadership, communication, and the work environment. This data was analyzed and interpreted using traditional tools.</p>
<p>Since 2003, open-ended questions have been added to employee surveys. Text generated from open-ended questions offers a broader, richer source of information than do scaled questions alone.</p>
<p>Unstructured text data, of course, presents new challenges as well. Initially, text comments from open ended questions were processed manually. Teams of employees reviewed the text then coded each record into one or more topics. Inter-rater reliability was maintained by having several employees code a portion of the text, then comparing the results. After the coding process, summaries and examples were created and reported to EDS senior leaders for follow-up action.</p>
<p>While the value of text comments for improving organizational effectiveness was unquestioned, two problems persisted. First, the process was slow. Whereas survey reports of quantitative survey data were available a few weeks after a survey closed, text comment reports lagged by months. Moreover, the process at times involved more than a dozen employees. From a productivity standpoint, the slow, manual process could not be sustained.</p>
<p>Second, the manual coding process was difficult to adjust to varying situations. For example, if a topic became too broad to be meaningful (e.g. &#8220;unit-level communication&#8221;), a principled subdivision was difficult to achieve. At the other extreme, a single, widely-distributed comment (e.g. &#8220;many emails contain redundant information&#8221;) was difficult to link to higher-level topics.</p>
<p>By 2005, EDS survey analysts set three high-level goals for analyzing text comments. First, cycle time must be reduced to allow text and quantitative data to be reported at the same time. Second, the resource requirement must reduce from dozens of employees to one person. Third, the new approach must be scalable to a wide variety of projects.</p>
<p>To view the full document, please download the PDF: <a href="https://www.megaputer.com/wp-content/uploads/polyanalyst-employee-survey-analysis-eds.pdf" target="_blank" rel="noopener">Full Nomination (PDF)</a></p>
<p>The post <a rel="nofollow" href="https://www.megaputer.com/dm-review-nomination-business-intelligence-2006/">Megaputer nominated for DM Review&#8217;s 2006 World Class Solution Award in the category of business intelligence</a> appeared first on <a rel="nofollow" href="https://www.megaputer.com">Megaputer Intelligence</a>.</p>
]]></content:encoded>
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.w3-edge.com/products/

Page Caching using disk: enhanced 
Minified using disk

Served from: www.megaputer.com @ 2026-06-07 05:31:32 by W3 Total Cache
-->