As we explored in Part 1, improving the AI-readiness of your developer portal does not mean rebuilding everything from scratch. A focused, strategic approach can make your content more usable, discoverable, and valuable, both to developers and AI-powered tools. Here's how to start:
1. Apply the Pareto Principle: Prioritize high-impact content
Improving your entire content library can be daunting. Start with the 20% of your content that drives 80% of user interactions and AI value. Focus first on the Minimum Viable Documentation (MVD):
- API Reference: Use structured formats like OpenAPI for REST APIs to expose endpoints and schema in a machine-readable way.
- Onboarding guides: Break down workflows into clear, step-by-step instructions. Use visual elements with descriptive alt text and consistent internal links.
- API overviews: Provide plain-language summaries of API products, focusing on capabilities and business value for non-technical users.
For these critical content types, aim for machine-readable formats such as Markdown and YAML. If your current writing tools create unstructured outputs (e.g., Google Docs, Word), ensure the final content is converted to formats that AI systems can parse reliably.
2. Improve documentation quality with a targeted content audit
Before automating anything, audit your existing content. Assess:
- Accuracy and relevance
- Usage patterns (via site analytics or support data)
- Gaps in structure or clarity
Use this to identify what to improve or rewrite. This supports AI-readiness and elevates the overall developer experience.

You can fill out our self-assessment to gain valuable insights into elevating your user experience and documentation quality.
Also, review your editorial process. Establish writing standards that ensure consistency in tone, terminology, structure, and metadata. These help AI models deliver accurate answers and recommendations.
3. Strengthen content hierarchy and metadata
A strong information architecture is essential for AI to understand your content ecosystem. Focus on:
- Consistent and straightforward URL structures to reflect hierarchy and topic groupings.
- Bad examples:
-
www.zero-gravity.org/node/32
-
www.zero-gravity.org/bpapi-tutorial
-
- Good examples:
-
www.zero-gravity.org/apis-for-the-universe
-
www.zero-gravity.org/api-catalog/best-product-api/tutorial
-
- Bad examples:
- Parent-child relationships between content types (e.g., concept → tutorial → reference).
- Bad examples:
-
www.zero-gravity.org/best-api-product-reference
-
www.zero-gravity.org/best-api-product-reference-tutorial
-
- Good examples:
-
www.zero-gravity.org/api-catalog/best-product-api/api-reference
-
www.zero-gravity.org/api-catalog/best-product-api/tutorial
-
- Bad examples:
- Rich metadata including tags, categories, versions, and supported features.


This foundation allows AI tools to grasp how topics relate, enabling smarter search, personalization, and recommendation systems.
We discussed earlier how a healthy content hierarchy and consistent metadata also improve human usability. These principles equally benefit machines: a clear win-win.
4. Make content discoverable - Internally and for AI
Well-structured internal- and cross-linking boosts usability and AI comprehension. Use:
- Manual linking to connect related topics, API references, and tutorials.
- Automated linking tools like Drupal’s SmartLinker AI to scale internal linking consistently.
Good linking not only guides users, it helps AI models understand topical clusters, which improves their ability to retrieve, summarize, and contextualize content correctly.
5. Convert legacy content to structured formats
Legacy documentation often lives in PDFs, Word files, or unstructured HTML. These formats can be expensive for AI to process and difficult to extract meaning from. Here is how to handle them:
HTML to Markdown
As we discussed, AI tools often charge based on the number of tokens processed, and HTML (the traditional structured content format of web pages) adds significant overhead. Markdown is cleaner, cheaper to process, and retains essential structure.
Use tools like Markdownify and llms.txt to convert web content from HTML to Markdown and offer an entry point to AI tools.



To put the difference into perspective, we calculated the number of tokens in different formats needed to transmit the same information using the GPT-4o & GPT-4o mini Tokenizer:
- The original HTML version contains 40 988 characters, which breaks down into 11 184 tokens due to its heavy use of tags and styling elements.
- After converting the same content into Markdown (preserving only the essential structure and meaning), the size drops to 6 577 characters and just 1 580 tokens.
Markdown can be 5 to 10 times more token-efficient than HTML, meaning it is significantly cheaper and faster for AI systems to process.
PDF to Markdown
PDFs are notoriously inconsistent, especially with mixed content types. If you have tagged PDFs, prioritize converting those first, as they contain some structural hints.
Tools like Marker (an open-source extractor) can convert PDFs into Markdown. This makes content easier to reuse in AI training, indexing, and content generation.
Let’s take a look at an example text-based but semantically untagged PDF.



After converting the PDF content to Markdown (preserving only the essential structure and meaning), the output contains 5 741 characters and 1 148 tokens. The conversion took 7.5 seconds.
Can an AI-based search engine access external PDF content?
AI-based search engines cannot access external PDF content by default.
AI search engines on your site can only access internal content and uploaded files. If your portal only links to external PDFs, the AI won't know what is inside them, leading to poor search results. Similarly, if your site links to PDFs hosted elsewhere, the AI cannot access them unless it has permission, the PDFs are public and indexable, or your search system crawls and caches them.

So if you rely solely on links to external PDFs, your search engine becomes more like a directory, not a semantic search tool.
What would an AI need to return valuable results from PDFs?
To deliver AI-powered search results for PDFs, the PDFs must be accessible to the search system, well-structured, and associated with relevant metadata.
Best practice
If the content must live externally (e.g., for compliance), then:
- Periodically import or sync the key external PDFs into your portal.
- Extract and index their content in a structured way.
- Present users with summarized or previewed content with the option to open the full PDF.
In summary, conversion is not just about format, it also highlights underlying quality issues. Fixing those will enhance both human and machine experiences.
Bonus: Use AI to enhance content quality
AI can also support content creation and enrichment. In Drupal, for example, AI Automators can perform specific enhancements, like generating alt text for images, improving both accessibility and AI comprehension.
Unlike AI Agents that interpret and respond to user inputs dynamically, Automators apply predictable transformations that ensure your content meets quality and format standards consistently.
Final thoughts on AI-readiness
AI-readiness is not a one-time fix, it is an ongoing content maturity process. Based on our experience, effective content management supports collaboration by accommodating different contributor roles. A robust content management platform, such as Zero Gravity provides non-technical writers with an intuitive environment for creating documentation, while also supporting developers through CI/CD pipeline integration for code-centric workflows. Enabling contributions across roles strengthens collaboration and improves content quality.
We recommend starting with high-value assets, improving their structure and clarity, and converting your legacy content to more machine-friendly formats. As your portal grows, well-governed content will not only support developers more effectively but also unlock the full potential of AI-powered assistance, search, and generation.
The effectiveness of AI features like smart search, contextual help, and automated troubleshooting is affected by the quality of the content on your developer portal. The amount of effort needed to prepare your content for AI depends on how much importance your company has placed on content and documentation.
Organizations that have already invested in creating well-structured, semantically rich, and up-to-date content will find themselves better positioned for AI adoption.
For companies where documentation and content were not a primary focus, there will be a considerable amount of work needed to assess, convert, structure, and enrich their existing legacy content before they can fully leverage the benefits of AI.
Strategies for improving AI-readiness include conducting content audits, improving content hierarchy, and using tools to convert legacy content to more structured formats.
While content strategy may not be the only ingredient in a successful AI journey, with it, you create the conditions for AI to truly enhance the developer experience. From there, data infrastructure, tooling, and cross-functional collaboration can build on this foundation to drive sustainable AI-driven transformation.

Fill out our technology-agnostic assessment, receive actionable insights to enhance user experience and documentation quality.
Resources
- Elements of a well-structured OpenAPI Specification
- AI success begins with a strong developer portal content strategy
- Onboarding As a Key Aspect of Business Success
- Preparing for a World Mediated by AI
- 3 ideas on AI readiness, the role of APIs and developer portals in generative AI systems
- Fabrizio Ferri-Benedetti’s Should you write documentation differently for LLMs?
- Meet your future co-worker: Understanding the rise of AI Agents
- Chris Despopoulos’:
- Dries Buytaert’s How AI could reshape CMS platforms
- AI-Ready Content Accelerator
- Unlocking the Value of AI-Ready Content: Navigating Regulatory Compliance
- Before you invest in AI, assess your AI-readiness + Reduce image hide-and-seek with artificial intelligence
- How to Prepare Content for AI
- Technical Writing Guidelines to Create AI Friendly Content
- How “AI-Ready” Is Your Content?
- AI in CMS: What Can You Really Do with Your Website?
- Preparing Product Content for AI: Unified Knowledge and Governance in Technical Documentation
- 5 Key Benefits of Integrating AI into Your Business
- AI-ready data: Roadblocks, best practices, benefits, applications, tools and technologies, and future trends
All Pronovix publications are the fruit of a team effort, enabled by the research and collective knowledge of the entire Pronovix team. Our ideas and experiences are greatly shaped by our clients and the communities we participate in.