BASIS TECHNOLOGY AND NUIX TRIAGE MULTILINGUAL DATA AT BLAZING SPEED

In the movies, investigations are clear-cut and fast. Look for a body with bullet wounds and expended shell casings nearby. Look for the gun; there’s no need to look for a knife (no stab wounds) or a hammer (no evidence of blunt force trauma). The reality of digital investigations is more like looking for a body buried somewhere in a 5,000-acre junkyard with a mountain of debris on every acre. Forget the ‘needle in the haystack’ (that’s too easy); you’re looking for a specifc needle in a stack of needles.

Nuix specializes in tackling this kind of problem, expanding beyond investigations to include eDiscovery and data governance. It enables users to swiftly reduce the scope of a case from hundreds of systems to just the relevant ones. How? The Nuix engine is blazingly fast. It eats terabytes of data for lunch, thoroughly unpacking, processing and enriching the most complex data types — including unstructured and semi-structured text, mobile phone images, videos, files nested in PST or NSF files, social media data and forensic images. Other tools may silently fail on difficult files, but not Nuix.

Nuix then enriches data with normalization, concept grouping, deduplication and other programmatic analytics that empower analysts to ask questions (Where’s the body?) in order to ask better, targeted questions (Where’s the gun, what type of round was used, where else have similar rounds been found, is there pattern?). Nuix boasts of a 90% reduction in turnaround time for various types of investigations quickly reducing data to only what’s relevant and necessary to answer the questions being asked.

ROSETTE MEETS THE MULTILINGUAL CHALLENGE

We sought a partner to meet the surge of data that was becoming increasingly multilingual. Without proper language support, relevant data could be missed or erroneously excluded from a case. For Nuix, the multilingual text processing also had to be fast, thorough and accurate because:

  • In eDiscovery, multilingual documents need to be searchable such that a paragraph-long, English email footer doesn’t obscure the crucial one-sentence Japanese email body where the critical evidence is located.
  • In investigations, all bad actors do not communicate in English. Investigators without multilingual capabilities need a tool that overcomes the language barrier.
  • In data governance, the data containing names and personally identifiable information needs to be identified and securely stored, regardless of the language it is written in.

Nuix chose to partner with Basis Technology for its sophisticated, AI-powered text analytics platform, Rosette®. Operating at the same blazing speed as the Nuix Engine, Rosette identifies the language of unstructured text and then enriches it with language-specific processing in 30+ languages and their native scripts. Rosette is consistently accurate across European languages, ArabicChineseJapanese, Korean, Persian, Russian, and Urdu, ensuring that Nuix searches are accurate and comprehensive.

For example, languages without spaces between words — e.g., ChineseJapanese, and Korean — need the words to be segmented to be accurately searched. Complex languages like Arabic add affixes before, in the middle and at the end of words. Thus the stems and roots of words must be identified to enable a comprehensive search. An exact match search in Arabic for “book” (kitaab) will not match the plural “books” (kutub), unless you know that the root of both words is k-t-b.

Rosette-enriched text also enables Nuix to apply its own analytics.

In data governance or eDiscovery, you don’t want to give out personally identifiable information (PII) when you have to show data. Being able to understand PII in multiple languages quickly, accurately and at scale are essential.

Rosette also stood out to Nuix for its track record powering mission-critical systems for government intelligence, border security, financial compliance and eComms surveillance, as well as customer feedback analysis.

THE PROOF IS IN THE RESULTS

By integrating Rosette, Nuix strengthened its offerings in three key areas:

For eDiscovery, Rosette detects different language regions in a single document, so that text in each language section is properly processed to be searchable. One pass with Rosette produces a report on what proportion of a corpus of evidence is in which languages before early case assessment even begins. Every full-text search will be thorough and comprehensive, uncovering the most relevant information quickly.

In an investigation, the language used in communications can provide valuable clues. If Rosette reveals that one actor only speaks his native tongue with his mother, but then starts using it in another conversation with another person, that could be an anomaly that warrants further examination. This is particularly important in cases of human trafficking and crimes against children, where speed is essential to save lives.

Finally, with governance, understanding where your company stores sensitive data — such as unencrypted credit card numbers, electronic personal healthcare information (ePHI) or PII, is of critical importance. If a data breach occurs, you need to quickly know what the hackers found. Accurate search across languages is an indispensable tool.

AN ECOSYSTEM OF CAPABILITY TO MEET FUTURE NEEDS

Nuix has already encountered cases on the scale of hundreds of terabytes. Data volumes are increasing at an unbelievable rate, especially if you add in social media and chat messages. To think that any individual is going to go through all that data is unrealistic. There needs to be a programmatic way to cull it down.

The need to cope with astronomical data volumes is already appearing outside of traditional knowledge-based tasks. The COVID-19 pandemic has only accelerated the massive move to digital data.

“Basis Technology and Nuix are empowering legal technologists, intelligence analysts and law enforcement to cope with the information avalanche they face every day,” said Carl Hoffman, CEO of Basis Technology. “We support Nuix’s vision of building a capabilities ecosystem that combines solutions from multiple partners to meet these challenges.”

We need to be prepared for what is going to happen, and working with Basis Technology helps us do just that for our customers. We don’t yet know the shape of the data, but it definitely isn’t all going to be in English, which is why Rosette is such an essential piece. The ability to meet the future needs of our customers will enable and empower them to continue to do their jobs; uncovering waste fraud and abuse, prosecuting the guilty and exonerating the innocent.  This requires constant vigilance, and a collaborative pushing of the envelope of what’s possible.

Source: https://www.nuix.com/blog/basis-technology-and-nuix-triage-multilingual-data-blazing-speed

Nuix Partners with EDMS Consultants to Target Mining, Energy, and Utilities

Perth, Australia – May 11, 2021, Global software company Nuix (www.nuix.com, ASX:NXL) and leading solution provider EDMS Consultants, have announced a new partnership to offer Nuix solutions to the natural resources sector in Western Australia and ASEAN region.

Both companies aim to provide litigation and investigations technology to support the booming natural resources sector which faces increasing regulations, class actions, cybersecurity and privacy issues, internal investigations, and intellectual property disputes.

“Throughout the years we have been in the business, the energy, resources, and utilities sectors are among the most highly regulated industries,” said Peter Buck, Business Development Director of EDMS Consultants. “Now more than ever, operators need full access to their unstructured data or data silos to ensure regulatory compliance.”

He added, “We have worked with PETRONAS, BP, Exxon, PTTEP, and KPOC (PETRONAS/ Shell / ConocoPhillips) on various services throughout the years, and we believe based on experience Nuix has the ideal solution for big organisations with unstructured data”.

The explosion of unstructured data places an increasing burden on large enterprises – especially those in the mining and energy sector that manage very complex projects – to sort through the massive volumes of content they gather, generate and exchange every day. Added to this challenge, the often remote and distributed business model with operations and assets spread over a wide geographical area means that information governance and data access are crucial.

‘’Nuix has a proven history of partnering with large enterprises to solve their messy data challenge,’’ said Jonathan Rees, Nuix Executive Vice President, International. “We have the world’s leading technology for extracting intelligence from high volumes of structured and unstructured data, forged from our experience with regulatory inquiries. Opening new markets and customer segments will continue our growth path and I am excited to partner with EDMS, to drive our combined solution and services, into the wide footprint EDMS has in the natural resources industry.”

About Nuix

Nuix (www.nuix.com, ASX:NXL) creates innovative software that empowers organisations to simply and quickly find the truth from any data in a digital world. We are a passionate and talented team, delighting our customers with software that transforms data into actionable intelligence and helps them overcome the challenges of litigation, investigation, governance, risk, and compliance.

About EDMS

EDMS is a leading solution provider in the Asia Pacific Region, providing enterprise data solutions to the Energy, Resource & Utility industry. We continuously explore and find the best solution to offer our clients. We have a multi-disciplined team of specialists, based in Kuala Lumpur, Malaysia, and Perth, Australia to support our clients. EDMS has implemented projects to the leading Energy, Resource & Utility throughout the region.

An Ending to “End-to-end?”

Data Warehouse

Written By: Jason Purcell

You’ve likely heard the common catchphrase ‘end-to-end’ many times in our little eDiscovery world. It’s a buzzword that has helped to serve many of us in the investigations, eDiscovery, and compliance communities. Even Gartner uses it, stating “By 2023, more than 70% of enterprise IT leaders will upgrade to end-to-end e-discovery software to reduce time and legal spend, up from 10% in 2019.”

In recent years, there has been an undeniable uptick in enterprise customers leveraging a combination of software and eDiscovery consultants helping to build their own end-to-end in-house eDiscovery and information governance program. In helping to architect many of these, it occurred to me that the very phrase itself can be quite misleading.

FROM LEFT TO RIGHT

The left ‘end’ is rather straightforward. The duty to preserve electronically stored information (ESI) gets triggered when litigation is reasonably anticipated. From there, we know the rest—preservation, collection, processing, and review of discoverable ESI ensues. Makes sense.

The right ‘end’ is where it gets a little foggy and some logical questions begin to surface:

  • Does it truly end with a production / presentation?
  • If so, is it safe to presume that each end-to-end process is an isolated, matter-by-matter task that has a defined beginning and a defined end?
  • What about all the time and money we just spent on the last case? Are we going to let all those expensive coding and case strategy decisions go to waste?
  • Wait, are you telling me that we are also going to have to re-collect, re-process, and re-review everything again the next time a new matter pops up, even if the same custodian’s data is required again?

At Nuix, we embrace these questions and begin asking questions of our own, for your eDiscovery and information governance program’s benefit:

  • Why collect the same data over and over?
  • Why process data more than once?
  • Why not create a principal data inventory of your frequent flier custodians’ ESI?
  • Why can’t we leverage modernized scalable architecture to be able to search, analyze, and cull even the largest and most voluminous data sets?  
  • If data makes it to review, do you want to be clever with those coding decisions and bolt them back onto your ESI warehouse, ensuring that these costly coding decisions get reused to help guide attorneys for future matters?
  • If redactions are being made for PII, PHI, trade secrets, etc., would it be helpful to carry those coding decisions and redactions forward for each new matter containing that identical record?
  • For data that has been processed and is no longer responsive to legal hold, wouldn’t it help to be able to safely and easily release the data from your ESI warehouse and where it lives in the wild?

AN ANSWER TO YOUR QUESTIONS

These questions have led us to create a 360° approach to the litigation lifecycle that saves millions of dollars and thousands of hours of time previously spent in collection, processing, and review. Perhaps even more important, it delivers consistency across future reviews and productions. As the intelligence layer grows over time, your ESI warehouse becomes smarter, more agile, and exponentially more valuable.

Combining enterprise-grade collection, processing, and review technology with knowledgeable experts can help you build a defensible, repeatable, and future-proof eDiscovery and information governance program. In short, putting an end to ‘end-to-end.’

Source: https://www.nuix.com/blog/ending-end-end