Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
WordPress AI Vulnerability Hunting
Discover an AI-assisted workflow for finding and validating WordPress plugin vulnerabilities, from code scanning to proof-of-concept generation and reporting.
I built a repeatable AI-assisted workflow for finding and validating vulnerabilities in WordPress plugins. In the demo, I’ll show the process end-to-end: selecting a plugin, launching a controlled test environment with a custom harness, then using Codex and a tailored AGENTS.md to scan the source, trace risky execution paths, validate the finding with a proof of concept, and generate a disclosure-ready report.
- CodexCodex is OpenAI's autonomous AI software engineering agent: it executes full development tasks in a sandboxed cloud environment.Codex is the advanced, cloud-based software engineering agent from OpenAI, built on a specialized model like `codex-1` (a fine-tuned version of `o3`). It operates on an asynchronous delegation model, allowing developers to assign complete tasks—not just receive suggestions—via the ChatGPT interface. The agent works independently in a secure, isolated cloud container provisioned with the user's GitHub repository and environment. It reads code, writes new features, fixes bugs, runs tests, and drafts Pull Requests (PRs) for review, significantly accelerating the development lifecycle. Access is provided through ChatGPT Plus, Pro, and Enterprise plans.
- AGENTSAutonomous software entities using large language models to reason, select tools, and execute complex workflows independently.Agents shift the focus from conversation to execution: they use frameworks like LangGraph or CrewAI to break down complex objectives into actionable tasks. These systems leverage external tools (Tavily for search, GitHub for code, or Salesforce for CRM) to operate across digital environments. Current benchmarks show agents can automate up to 80% of routine knowledge work by managing their own reasoning loops. These entities deliver finished outputs (validated data, resolved tickets, or deployed software) with minimal human intervention.
- WordPressThe world's leading open-source Content Management System (CMS): it powers over 43% of all websites globally, dominating the CMS market with a 60%+ share.WordPress is the definitive open-source CMS, built on PHP and MySQL, and known for its unparalleled flexibility and massive ecosystem. It powers over 43% of all websites on the internet, including major sites like CNN and NBC, securing its 60%+ dominance in the CMS market. The platform's core strength lies in its extensibility: users access over 60,000 free plugins through the official repository (total plugins exceed 90,000), along with tens of thousands of themes for instant design changes. For e-commerce, the WooCommerce plugin is a powerhouse, running approximately one-third of all online stores. This robust, community-driven architecture allows WordPress to scale from a simple blog to a complex corporate website or a full-scale online shop.
- PHPPHP is the open-source, server-side scripting language (Hypertext Preprocessor) driving dynamic web content: it is the core engine for platforms like WordPress and Drupal.PHP (Hypertext Preprocessor) is a powerful, open-source, server-side scripting language built specifically for web development: it handles everything from database integration (MySQL, PostgreSQL) to session management. Its ubiquity is undeniable, serving as the foundational technology for major Content Management Systems (CMS) like WordPress and Drupal. The language runs on the Zend Engine, and recent releases, specifically the PHP 8.x series, introduced significant performance gains, including Just-In-Time (JIT) compilation, cementing its status as a fast, reliable, and continuously evolving platform for building modern, high-traffic web applications.
- Proof of ConceptA Proof of Concept (POC) is a targeted, low-fidelity exercise designed to verify that a product, feature, or technology is technically feasible before committing budget and engineering hours.In product development, a Proof of Concept isolates the riskiest technical assumption of a project and tests it in a controlled environment. Unlike a prototype (which tests usability) or an MVP (which tests market demand), a POC answers one specific question: can we actually build this? For example, a fintech team might build a raw, unpolished script to verify if a legacy API can process transactions in under 200 milliseconds. By focusing strictly on feasibility rather than design, a POC gives stakeholders the empirical evidence needed to greenlight a project or pivot early, saving hundreds of development hours.