The Impact of JSON-LD Metadata on ChatGPT Visibility
12 Pages Posted: 19 Nov 2025
Date Written: October 01, 2025
Abstract
Purpose: This study examines whether implementing structured metadata (JSON-LD Schema.org markup) on websites can improve a business’s visibility in ChatGPT responses. We focus on real estate agencies as a case study, analyzing which site attributes correlate with an agency being “known” or referenced by ChatGPT.
Methods: We gathered public data on 1,508 real estate agents in Germany and identified which of these agents ChatGPT could provide information about (indicating ChatGPT visibility). For each agent’s website, we recorded the presence of Schema.org metadata (FAQPage, Organization, Product schemas), as well as SEO/technical factors like mobile optimization, robots.txt, sitemap, headings usage, image alt-text, internal links, and page load speed. A logistic regression was used to determine which factors significantly predict ChatGPT visibility, controlling for other variables.
Results: Agents whose websites included FAQPage schema markup were far more likely to be visible on ChatGPT (6.2% of visible agents had FAQ schema vs. only 0.8% of non-visible; p = 0.002). Presence of Product schema (e.g. schema for listings or services) also strongly correlated with visibility (17.2% vs 1.8%; p < 0.001). Key technical SEO features were likewise more common among ChatGPT-visible agents: nearly all had mobile-friendly sites (99.0% vs 88.8% of non-visible; p < 0.001) and a robots.txt file (92.3% vs 80.6%; p < 0.001). Visible agents’ sites were more likely to use structured headings (94.7% had an <h2> vs 74.4% of non-visible; 86.6% had an <h3> vs 61.4% of non-visible; p < 0.001), and their pages loaded faster on average (418 ms vs 623 ms; p = 0.014). A multivariate logistic regression confirmed FAQPage schema as the strongest positive predictor of ChatGPT visibility (odds ratio ≈ 13, p < 0.001), followed by Product schema (OR ≈ 4, p < 0.001). Other significant factors included mobile optimization (OR ≈ 5.2, p = 0.026), presence of a robots.txt (OR ≈ 3.4, p < 0.001), and the use of multiple heading levels (having <h2>: OR ≈ 3.3; <h3>: OR ≈ 2.3; both p < 0.001).
Conclusions: Our findings provide empirical support that adding Schema.org metadata – particularly FAQ schemas that supply question-answer pairs – is associated with significantly higher chances of being recognized or cited by AI chat systems like ChatGPT. In practice, this suggests that businesses can improve their AI-driven visibility by adopting structured data and following strong SEO practices. We discuss the implications for “Generative Engine Optimization” (GEO) and estimate the potential monetary benefits of increased AI visibility, which can be substantial given the high client value in real estate.
Keywords: ChatGPT, GEO, Generative Engine Optimization, JSON-LD, Schema.org, FAQPage schema markup, AI Visibility, Online Marketing
Suggested Citation: Suggested Citation