Supported Japanese sites

Each site has a support status, supported URL pattern, allowed outputs, and known limitations.

No affiliation: crawl-hub is not affiliated with, endorsed by, or an official API of any listed site. Support status describes crawl-hub behavior for public URL patterns only.

SiteStatusDomainsOperationsOutputsKnown limitations
2Nd Streetbeta2ndstreet.jp, www.2ndstreet.jpauto, soldhtml, jsonP0 managed reuse EC candidate.
Amazon Jpbetaamazon.co.jp, www.amazon.co.jpauto, soldhtml, jsonP0 managed candidate, but anti-bot volatility means registry should emphasize unsupported/failure observation over guaranteed success.
At Cosmebetacosme.net, www.cosme.netauto, soldhtml, jsonP0 managed parse-html candidate; direct fetch fixture and parser golden output required.
At Cosme L L Mbetacosme.net, www.cosme.netauto, soldhtml, jsonDatastructor parser variant promoted to beta for local validation; registry aliasing and expected schema still need normalization.
Axesbetaaxes-net.com, www.axes-net.comauto, soldhtml, jsonP0 managed fashion/luxury EC candidate.
Beauty Hotpepperbetabeauty.hotpepper.jpauto, soldhtml, jsonP0 managed parse-html candidate for beauty/salon data.
Beauty Hotpepper Jpbetabeauty.hotpepper.jpauto, soldhtml, jsonParser variant from managed; keep separate until normalized into one site/work_type mapping.
Brandoffbetabrandoff-store.com, www.brandoff-store.comauto, soldhtml, jsonP0 reuse/luxury EC overlap candidate.
Daikokuyabetadaikokuya78.com, www.daikokuya78.comauto, soldhtml, jsonP0 managed reuse/luxury EC candidate.
Digimartbetadigimart.net, www.digimart.netauto, soldhtml, jsonP0 managed musical-instrument marketplace candidate.
Gallery Rarebetagalleryrare.jp, www.galleryrare.jpauto, soldhtml, jsonP0 reuse/luxury EC overlap candidate.
Ginzobetaginzo.jp, www.ginzo.jpauto, soldhtml, jsonP0 reuse/luxury EC overlap candidate.
Goo Net Carbetagoo-net.com, www.goo-net.comauto, soldhtml, jsonP0 managed parse-html candidate for used-car data.
Hedybetahedy.jp, www.hedy.jpauto, soldhtml, jsonP0 managed reuse/luxury EC candidate.
Hello Workbetahellowork.mhlw.go.jpauto, soldhtml, jsonP0 managed public job-data candidate. Confirm acceptable request pattern before live tests.
Hojyokin Portalbetahojokin-portal.jp, www.hojyokin-portal.jp, hojyokin-portal.jpauto, soldhtml, jsonP0 managed subsidy-data candidate. Domain spelling should be verified during registry import.
Houjin Goobetahoujin.goo.toauto, soldhtml, jsonP0 managed company-data candidate; verify domain and representative URL before live tests.
Irbank Netbetairbank.netauto, soldhtml, jsonP0 managed financial/company-data candidate.
Itownpagebetaitp.ne.jp, www.itp.ne.jpauto, soldhtml, jsonP0 managed local-business data candidate. Confirm privacy/compliance fit before broad enablement.
Kitamura Camerabetashop.kitamura.jpauto, soldhtml, jsonP0 managed candidate. Internal helper dependencies must stay hidden behind managed fetch.
Komehyobetakomehyo.jp, www.komehyo.jpauto, soldhtml, jsonP0 reuse/luxury EC overlap candidate.
Mandarakebetamandarake.co.jp, order.mandarake.co.jpauto, soldhtml, jsonP0 managed collector/reuse EC candidate.
Maonlinebetamaonline.jp, www.maonline.jpauto, soldhtml, jsonP0 managed business/news parser candidate. Avoid storing article body artifacts by default.
Mercaribetajp.mercari.com, mercari.comauto, soldhtml, jsonP0 managed candidate promoted to beta for local live-route testing; signature/DPoP dependencies can still affect fetch reliability.
Monotarobetamonotaro.com, www.monotaro.comauto, soldhtml, jsonP0 overlap candidate with industrial EC parser coverage.
Netmallbetanetmall.hardoff.co.jpauto, soldhtml, jsonP0 overlap candidate for reuse EC. Promote only after fixture tests cover sold/active item differences.
Okurabetaokura-online.jp, www.okura-online.jpauto, soldhtml, jsonP0 managed reuse/luxury EC candidate.
Rakumabetafril.jp, item.fril.jpauto, soldhtml, jsonP0 managed marketplace candidate.
Rakutenbetarakuten.co.jp, item.rakuten.co.jpauto, soldhtml, jsonP0 managed EC candidate; fixture URL and golden JSON required before stable.
Reclobetareclo.jp, www.reclo.jpauto, soldhtml, jsonP0 managed reuse/luxury EC candidate.
Recruit Agentbetar-agent.com, www.r-agent.comauto, soldhtml, jsonP0 managed job-data candidate. Confirm allowed modes and rate limits before live tests.
Retrobetaretro.jp, www.retro.jpauto, soldhtml, jsonP0 managed reuse/luxury EC candidate.
Shareviewbetashareview.jp, www.shareview.jpauto, soldhtml, jsonP0 managed review/product-data candidate.
Shimamurabetashop-shimamura.com, www.shop-shimamura.comauto, soldhtml, jsonP0 managed retail parser candidate.
Snkrdunkbetasnkrdunk.com, www.snkrdunk.comauto, soldhtml, jsonP0 overlap candidate; route behavior should be observed before stable promotion.
Subsidybetajgrants.go.jp, www.jgrants.go.jpauto, soldhtml, jsonP0 managed subsidy-data candidate; verify canonical domain and schema.
Surugayabetasuruga-ya.jp, www.suruga-ya.jpauto, soldhtml, jsonP0 overlap candidate with managed scrape and managed parse-html coverage.
Suumobetasuumo.jpauto, soldhtml, jsonP0/P1 managed candidate. TLS may be enabled later by feature flag if direct is unreliable.
Suumo Basebetasuumo.jpauto, soldhtml, jsonParser base/alias row from managed promoted to beta for local validation; work_type mapping still needs normalization.
Suumo Buy New Apartmentbetasuumo.jpauto, soldhtml, jsonDatastructor SUUMO new-apartment parser candidate.
Suumo Buy New Apartment Childbetasuumo.jpauto, soldhtml, jsonChild/detail parser variant promoted to beta for local validation; schema and parent mapping still need finalization.
Suumo Rentalbetasuumo.jpauto, soldhtml, jsonDatastructor SUUMO rental parser candidate.
Tamtambetahs-tamtam.co.jp, www.hs-tamtam.co.jpauto, soldhtml, jsonP0 managed retail/hobby parser candidate.
Techno Aidbetatechno-aids.or.jp, www.techno-aids.or.jpauto, soldhtml, jsonP0 managed public/assistive-device data candidate.
Townworkbetatownwork.net, www.townwork.netauto, soldhtml, jsonP0 managed job-data candidate. Confirm live-test policy and rate limits before enabling release gating.
Trefacbetatrefac.jp, ec.treasure-f.comauto, soldhtml, jsonP0 overlap candidate. Datastructor parser name may be trefac_online; normalize aliases in registry import.
Uedabetaueda78.com, www.ueda78.comauto, soldhtml, jsonP0 managed reuse/luxury EC candidate.
Unsupported Playwright Requireddisabledexample-playwright-required.invalidauto, soldhtmlSentinel row for unsupported tests. Phase 1 must return explicit unsupported/disabled behavior for sites requiring browser execution.
Unsupported Scraperapi Requireddisabledexample-scraperapi-required.invalidauto, soldhtmlSentinel row for unsupported tests. Phase 1 must not silently route to ScraperAPI/proxy providers.
Vectorbetavector-park.jp, www.vector-park.jpauto, soldhtml, jsonP0 candidate, but technical architecture calls out possible internal Lambda/helper dependency. Keep external route normalized.
Yahoo Auctionbetaauctions.yahoo.co.jp, page.auctions.yahoo.co.jpauto, soldhtml, jsonP0 candidate. Any Lambda/proxy helper dependency must remain internal to managed fetch and must not be exposed as a route.
Yahoo Fleamarketbetapaypayfleamarket.yahoo.co.jpauto, soldhtml, jsonP0 managed marketplace candidate; fixture URL and parser golden output still required.
Yahoo Shoppingbetashopping.yahoo.co.jp, store.shopping.yahoo.co.jpauto, soldhtml, jsonP0 managed EC candidate; use registry observation before promising broad Yahoo Shopping coverage.
Yodobashibetayodobashi.com, www.yodobashi.comauto, soldhtml, jsonP0 overlap candidate with managed parser coverage.