Convert a PNG screenshot of a neural network architecture diagram from a research paper into an editable .vsdx Visio file with native shapes you can modify.
Rebuild a model architecture figure from an arXiv paper as a Visio diagram to customize colors, reorder blocks, and update labels before reusing in your own publication.
Use the included PowerShell scaffold helpers to have Codex draw rectangles, arrows, and text at specific coordinates in a new Visio page.
Requires Windows with Microsoft Visio installed, the COM automation layer will not work without a licensed Visio installation.
This repository is a skill for OpenAI Codex, the AI coding assistant, that helps researchers convert paper figures into editable Microsoft Visio diagrams. The core use case is a researcher who has a PNG screenshot, a reference image, or an AI-generated model architecture diagram and needs it rebuilt as a .vsdx file made of native Visio shapes rather than a flat image paste. The distinction matters in academic work: a .vsdx file with native rectangles, arrows, text boxes, and groups lets an author change colors, reorder blocks, update labels, and adjust spacing before submitting or publishing. Embedding the original PNG instead looks the same at first glance but cannot be edited at the element level. The skill defines a structured workflow for Codex to follow. It starts with analyzing the reference figure to identify panels, modules, arrows, dashed feedback lines, repeated elements like heatmap grids or node sequences, and any text that must be reproduced exactly. It then maps pixel coordinates from the reference image to Visio page coordinates and calls Visio's COM automation API to draw each element as a native shape. A separate validation step checks that the final .vsdx contains no large embedded image files, that all text remains editable, and that the layout matches the reference. The repository includes two PowerShell scripts. One is a utility for backing up .vsdx files, exporting page previews as PNG, and inspecting the internal package structure of a .vsdx file. The other is a scaffold that provides helper functions for drawing rectangles, ovals, lines, arrows, and text at specified coordinates, giving Codex a starting point for writing a rebuild script. The default style settings follow academic paper conventions: Times New Roman font, 0.9 to 1.2 point borders, low-saturation background colors, and consistent arrow styles. The tool runs on Windows with Microsoft Visio installed, since the COM automation layer requires Visio to be present.
← pengjunchi0 on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.