A participant at a hackathon event. A participant at a hackathon event.

DG Developing Open Source AI Tool to Help with the Fight against Corruption

April 23, 2024
Gabriel Inchauspe, Kelley Sams, DG Comms
Program, Thought Leadership

Development Gateway: An IREX Venture (DG) is pleased to announce that—with support from Accountability Lab (AL)—we’re developing an open-source AI tool that we hope to register as a Digital Public Good and can be used by those fighting corruption as they develop innovative digital tools. The need for the tool was identified in DG’s work with AL’s HackCorruption initiative. 

Once the tool is developed, users will be able to extract data from various types of documents. Having an open source tool for this type of work is essential to anti-corruption efforts because a lack of machine-readable data that cannot be easily aggregated adds to the opacity that can hide corruption. This tool will increase accessibility of and transparency around hard-to-access information, which will advance accountability. 

What is HackCorruption?

HackCorruption is a series of events and policy work led by AL and aimed at supporting  innovative solutions to identify and combat corruption. HackCorruption is supported by the Bureau of International Narcotics and Law Enforcement in the U.S. Department of State and the USAID Countering Transnational Corruption Grand Challenge for Development, in partnership with the Center for International Private Enterprise and DG. 

Why develop this tool?

The need for this tool was identified as part of DG’s work with HackCorruption regional hackathon teams. Several teams that participated in the South America hackathon in April 2023 discovered that they were limited in the work they could do by not having access to free tools to scrape information from documents, including government contracts. Having access to this sort of information allows anti-corruption tools to aggregate,  monitor, and identify potential corruption. 

Acting as a mentor to several of these teams, DGer Gabriel Inchauspe knew such AI tools exist, but none of them are open source and therefore, anti-corruption workers are limited in accessing them. Inchauspe and DGer Kelley Sams then collaborated with AL to obtain funding and define the scope of a project to create such a tool, which will support and ease the administrative burden faced by anti-corruption workers.

“Anti-corruption tool developers and advocates will be able to use this new tool to quickly extract significant amounts of data. They can then use anti-corruption formulas to identify red flags within the extracted data. For example, if data indicates that consistently low competition exists on certain government contracts, corruption may be present or that institution may be more vulnerable to corruption. Because this tool will allow anti-corruption work to be done more efficiently and on a larger scale, it will help strengthen accountability and ultimately, strengthen governing and civil society institutions.” 

Gabriel Inchauspe

What will the tool do?

The AI tool—a beta version of which DG plans to launch by the end of 2024—will be based on publicly available source code and will be trained on ethically sourced data, with multiple rounds of testing done to ensure that the tool is statistically unlikely to produce non-existent or inaccurate information (i.e., hallucinate). 

In the first phase of developing the tool, it will be able to capture information from documents in English that are in a standardized format (e.g., pdf, JSON, Excel, OpenContracting formatting, etc.). A planned second phase is to expand the tool to be able to extract information from documents or photo files that are handwritten and/or contain languages other than English, including those using non-Roman alphabets. 

“Having access to an open-source AI tool to easily gather inaccessible information will give teeth to anti-corruption laws. While many anti-corruption laws require governments to share information on contracting, finance, etc., this information is often shared in inaccessible formats that prevent or reduce the ability to identify corruption. Creating this tool will make these anti-corruption laws and efforts more meaningful by strengthening mechanisms to hold those in power accountable.”

Kelley Sams

In addition to our long history of creating and implementing technical solutions to advance transparency, DG is uniquely positioned to develop this tool given our previous success in ensuring digital solutions are used and supported for years after development. After all, many tools in the anti-corruption space exist or are being created, but fewer are used and scaled. As we develop this tool, DG is committed to ensuring the tool’s design and usability are responsive to anti-corruption workers’ needs and ultimately, strengthen anti-corruption and accountability efforts.

Stay tuned for more as the tool is developed! 

Share

Recent Posts

Three “Hacks” in Advancing Anti-Corruption Work To Strengthen Accountability

Since partnering with Accountability Lab (AL) on HackCorruption in 2023, staff from Development Gateway: An IREX Venture (DG) and AL have mentored teams from the hackathons hosted by the project, which is aimed at leveraging innovative digital tools to identify and fight corruption. After serving as the mentorship team for hackathon winners, DGers Gabriel Inchauspe and Kelley Sams, along with AL’s David Sada have identified three hacks in mentoring anti-corruption change-makers and ultimately, furthering transparency and accountability through the creation and use of digital tools.

April 3, 2024  
aLIVE Program Reaches Milestone: Livestock Data Standards Endorsed by Ethiopia’s Ministry of Agriculture

In early 2024, the "a Livestock Information Vision Ethiopia" (aLIVE) governing committee endorsed a comprehensive set of standards to guide the collection, storage, and maintenance of livestock data in Ethiopia (i.e., a data standard). The data standard specifically focuses on standardizing data on cattle, sheep, goats, and camels in the country. The National Livestock Data Standard document contains standardized data sets for national animal data recording, animal disease, diagnosis, treatment, vaccination recording, animal events recording, location, and other additional attributes.  

March 28, 2024 Agriculture
Custom Assessment Landscape Methodology 2.0 – Reflections After Five Years

Our new white paper re-introduces our flagship methodology, Custom Assessment Landscape Methodology (CALM), to help our partners, collaborators, and teammates better engage with a flexible and adaptive assessment methodology.

March 19, 2024 Process & Tools