Imemezela ukusekelwa kwe-OpenAI-ehambisanayo ye-API yamaphoyinti okugcina we-Amazon SageMaker AI

Namuhla, i-Amazon SageMaker AI yethula ukusekelwa kwe-OpenAI-ehambisanayo kwe-API kuma-endpoints wesikhathi sangempela. Uma usebenzisa i-OpenAI SDK, i-LangChain, noma i-Strands Agents, manje ungakwazi ukunxenxa amamodeli ku-SageMaker AI ngokushintsha kuphela i-URL yakho yesiphetho. Awulidingi iklayenti langokwezifiso, isembozo se-SigV4, noma ikhodi ebhalwe kabusha.
Uhlolojikelele
Ngalokhu kwethulwa, iziphetho ze-SageMaker AI zidalula i- /openai/v1 indlela eyamukela izicelo zokuqedwa kwengxoxo futhi ibuyisele izimpendulo njengoba ivela esiqukathini, okuhlanganisa nokusakaza-bukhoma. Amaphoyinti okugcina e-OpenAI avuliwe kuwo wonke amaphoyinti okugcina kanye nezingxenye ze-inference kusetshenziswa ama-API ajwayelekile we-SageMaker AI kanye ne-SDK.
Imizila ye-SageMaker AI esekelwe egameni lephoyinti lokugcina ku-URL, ukuze noma yiliphi iklayenti elihambisana ne-OpenAI lisebenza ngaphandle kwebhokisi. Manje ungakha amathokheni omphathi anesikhathi esinqunyelwe wezindawo zakho zokugcina futhi uwasebenzise namakhasimende akho e-OpenAI.
Ukuze uthole isibonelo esisebenzayo esifaka phakathi ukusetshenziswa nokucela, bona incwadi yamanothi ehambisana nayo ku-GitHub.
“Sisebenzisa ama-agent e-AI ekhodi asebenzisa abahlinzeki abaningi be-LLM ngesango le-LLM (i-Bifrost) ekhuluma iphrothokholi yokuqedela ingxoxo ye-OpenAI. Isici sethokheni yomphathi sisivumela ukuthi sengeze i-SageMaker njengendawo yokugcina evumelana ne-OpenAI – akukho ukusayinda kwe-SigV4 ngokwezifiso – ngakho isebenza ngokwemvelo ngesango lethu, i-Vercel AI SDK evamile, kanye ne-Open AI SDK.” kusho uGiorgio Piatti (AI/ML Engineer – Caffeine.AI)
Sebenzisa amacala
Ukugeleza komsebenzi we-Agentic kungqalasizinda ephethwe
Uma wakha ama-agent e-AI anezinyathelo eziningi anezinhlaka ezifana ne-Strands Agents noma i-LangChain, ungakwazi manje ukusebenzisa lokho kugeleza komsebenzi ngokuphelele kuma-endpoints akho e-SageMaker AI. Ama-ejenti akho ashayela amamodeli asebenzisa isixhumi esibonakalayo esivumelana ne-OpenAI ezakhelwe phezu kwawo, kodwa ukucabanga kusebenza ezimweni ezizinikele ze-GPU ku-akhawunti yakho.
Ukusingathwa kwamamodeli amaningi ane-interface eyodwa
Uma usebenzisa amamodeli amaningi—ngokwesibonelo, i-Llama yemisebenzi evamile, i-Mistral ecushwe kahle yomsebenzi oqondene nesizinda, kanye nemodeli encane yokuhlukanisa—ungawasingatha wonke endaweni eyodwa yokugcina ye-SageMaker AI usebenzisa izingxenye ze-inference. Imodeli ngayinye ithola ukwabiwa kwensiza yayo, futhi ngayinye ingabizwa nge-OpenAI SDK efanayo. Awudingi amaklayenti e-API ahlukene noma umzila onengqondo wekhodi yohlelo lokusebenza.
Inikeza amamodeli ashunwe kahle ngaphandle kokushintshwa kwekhodi
Uma ulungisa kahle amamodeli omthombo ovulekile wecala lakho lokusebenzisa elithile, ungawasebenzisa ku-SageMaker AI futhi uwashayele ngesixhumi esibonakalayo esivumelana ne-OpenAI esivele sisetshenziswa izinhlelo zakho zokusebenza. Ushintsho kuphela i-URL yephoyinti lokugcina. Lonke uhlelo lokusebenza—izingcingo ze-SDK, ukuqonda kokusakaza, ukufometha kokwaziswa—akushintshi.
Uhlolojikelele lwesixazululo
Kulokhu okuthunyelwe, sihamba kulokhu okulandelayo:
- Indlela ukuqinisekiswa kwethokheni yomphathi kusebenza kanjani namaphoyinti okugcina we-SageMaker AI.
- Isebenzisa futhi ibiza iphoyinti lokugcina lemodeli eyodwa.
- Isebenzisa futhi inxenxa izingxenye zokushoyo zokusebenzisa amamodeli amaningi.
- Ukuhlanganiswa nohlaka lwama-Strands Agents.
Okudingekayo
Ukuze uhambisane nalolu hlelo lokusebenza, kufanele ube nalokhu okulandelayo:
- I-akhawunti ye-AWS enezimvume zokudala izindawo zokugcina ze-SageMaker AI.
- I-SageMaker Python SDK (
pip install sagemaker). - I-OpenAI Python SDK (
pip install openai). - Imodeli egcinwe ku-Amazon Simple Storage Service (Amazon S3). Isibonelo, i-Qwen3-4B ilandwe ku-Hugging Face.
- Indima yokwenza i-AWS Identity and Access Management (IAM) ukuze udale izindawo zokugcina,
AmazonSageMakerFullAccessinqubomgomo. - Indima yokwenza i-IAM ne
sagemaker:CallWithBearerTokenfuthisagemaker:InvokeEndpointizimvume zokucela indawo yokugcina.
Ukufakazela ubuqiniso ngamathokheni omphathi
I-SageMaker AI OpenAI-izindawo zokugcina ezihambisanayo zisebenzisa ithokheni yomphathi. I-SageMaker Python SDK ihlanganisa ijeneretha yamathokheni edala amathokheni anomkhawulo wesikhathi (avumeleke kufika emahoreni ayi-12) kusuka kuzifakazelo zakho ezikhona ze-AWS. Azikho izimfihlo ezengeziwe noma okhiye be-API abadingekayo.
Ithokheni iqukethe indima yakho noma imininingwane yomsebenzisi, futhi idinga sagemaker:CallWithBearerToken futhi sagemaker:InvokeEndpoint izimvume zesenzo.
Khiqiza ithokheni
Sebenzisa umbhalo olandelayo wePython ukwenza ithokheni.
Ijeneretha yamathokheni isebenzisa noma yiziphi izifakazelo ze-AWS ezitholakala endaweni yakho: izifakazelo zomsebenzisi we-IAM, iphrofayela yesibonelo ku-Amazon Elastic Compute Cloud (Amazon EC2), noma iseshini ye-AWS IAM Identity Center (SSO).
I generate_token umsebenzi ukhiqiza ithokheni yenkampani enesikhathi esinqunyelwe ukuze kuqinisekiswe nge-SageMaker APIs. Ngokuzenzakalelayo, amathokheni asebenza amahora angu-12, nakuba ungakwazi ukukhipha lokhu nge- expiry ipharamitha usebenzisa a timedelta inani noma kuphi phakathi kwesekhondi elingu-1 namahora angu-12. Umsebenzi wamukela isifunda, ongakukhetha aws_credentials_providerkanye nobude besikhathi sokuphelelwa yisikhathi. Uma kungekho Isifunda se-AWS esinikeziwe, sibuyela emuva ku- AWS_REGION imvelo variable. Uma engekho umhlinzeki wokuqinisekisa ohlinzekwayo, uxazulula izifakazelo kusetshenziswa uchungechunge lokuqinisekisa lwe-AWS oluzenzakalelayo, olusesha imithombo eminingi, okuhlanganisa nokuhlukahluka kwemvelo, ~/.aws/credentials, ~/.aws/configimininingwane yesiqukathi, namaphrofayili esibonelo. Ukuze uthole i-oda eligcwele lokulungiswa, bheka imibhalo yemininingwane ye-Boto3.
Amathokheni okuvuselela ngokuzenzakalelayo ezinhlelo zokusebenza ezisebenza isikhathi eside
Kuzinhlelo zokusebenza ezisebenza ngokuqhubekayo, ungasebenzisa iphethini yokuvuselela ngokuzenzakalela usebenzisa httpx ukuze kwenziwe ithokheni entsha esicelweni ngasinye:
Izimvume ze-IAM
Indima ye-IAM noma umsebenzisi ocela indawo yokugcina udinga izimvume ezilandelayo:
Njengomkhuba omuhle kakhulu, hlala ubeke imingcele Resource kuma-ARN athile okuphela kwawo InvokeEndpoint esikhundleni sokusebenzisa i-wildcard. Ithokheni yomphathi okhiqizwe kusukela kule ndima inezinga elifanayo lokufinyelela, ngakho inqubomgomo enesikophu esincane ikhawulela irediyasi yokuqhuma uma ithokheni ivezwa ngokungaqondile. Qaphela ukuthi CallWithBearerToken udinga i-wildcard ("*") Okwe Resource inkambu. Ayisekeli imikhawulo yeleveli yensiza.
Indlela ithokheni esebenza ngayo
Ithokheni yomphathi i-base64-encoded SigV4 URL esayinwe ngaphambilini. Uma ufona generate_tokeni-SageMaker AI SDK yenza isicelo kusevisi ye-SageMaker AI CallWithBearerToken isenzo, isayinda endaweni kusetshenziswa izifakazelo zakho ze-AWS, futhi ibhala ngekhodi i-URL ewumphumela esayiniwe njengochungechunge lwethokheni ephathekayo. Ayikho ikholi yenethiwekhi eyenziwa ngesikhathi sokukhiqiza amathokheni. Ukusayina kwenzeka ngokuphelele ohlangothini lweklayenti. Uma wethula leli thokheni endaweni yokugcina ye-SageMaker AI, isevisi iyayikhipha ikhodi, iqinisekise isiginesha ye-SigV4, iqinisekise ukuthi ithokheni ayikaphelelwa yisikhathi, futhi iqinisekisa ukuthi ubunikazi bomsuka be-IAM bunezimvume ezidingekayo. Ukuphila okusebenzayo kwethokheni kungaphansi kwevelu yokuphelelwa yisikhathi kanye nokuba semthethweni okusele kwemininingwane ye-AWS okusetshenziswe ukuyisayina.
Indlela engcono kakhulu yokuphepha: Ithokheni yomphathi iphethe ukugunyazwa okufanayo nemininingwane engaphansi ye-AWS esetshenziselwe ukuyikhiqiza. Phatha amathokheni ngokunakekelwa okufanayo nemininingwane. Ububanzi bendima ye-IAM esetshenziselwa ukukhiqiza amathokheni kuya kuzimvume ezincane ezidingekayo, ngokukhethekile sagemaker:InvokeEndpoint futhi sagemaker:CallWithBearerToken kuma-ARN wephoyinti lokugcina kuphela lowo ofonayo okudingeka awafinyelele. Ungakhiqizi amathokheni ezindimeni ezinezimvume ezinwetshiwe, njengalezo ezinikezwe ngu AdministratorAccess noma SageMakerFullAccess izinqubomgomo eziphethwe.
Ungagcini amathokheni kudiski, ezintweni eziguquguqukayo zemvelo, kumafayela okumisa, kusizindalwazi, noma kuma-caches asakaziwe. Ungawafaki amathokheni, futhi uwadlulisele kuphela ngezivumelwano zokuxhumana ezibethelwe njenge-HTTPS. Ukukhiqiza amathokheni kuwumsebenzi wasendaweni ongenayo inethiwekhi, ngakho-ke umkhuba onconyiwe ukukhiqiza ithokheni entsha lapho usetshenziswa noma usebenzise ukuvuselela okuzenzakalelayo. httpx.Auth iphethini eboniswe esibonelweni esandulele. Lokhu kugwema ubungozi bokuvuza kwamathokheni futhi kukusiza ukuthi usebenzise ithokheni enokuqinisekiswa okuphezulu okusele. Njengomkhuba ongcono kakhulu, setha isikhathi sokuphelelwa yisikhathi kwethokheni sibe yisikhathi esifushane kakhulu esidingwa umsebenzi wakho.
Sebenzisa iphoyinti lokugcina lemodeli eyodwa
Isiphetho semodeli eyodwa sisingatha imodeli eyodwa futhi sinikeza izicelo ngokuqondile. Isibonelo esilandelayo sisebenzisa i-Qwen3-4B kusetshenziswa i-SageMaker AI vLLM Deep Learning Container ku- ml.g6.2xlarge isibonelo.
Qaphela: Amaphoyinti okugcina we-SageMaker AI angenisa izindleko ngenkathi usesevisini, ngokunganaki ithrafikhi. Ukuze uthole imininingwane eyengeziwe, bheka ikhasi lentengo le-Amazon SageMaker AI.
Iphoyinti lokugcina lishintshela ku InService isimo emizuzwini embalwa. Uma isilungile, isebenzisa kokubili i-SageMaker AI evamile /invocations indlela kanye nendlela ehambisana ne-OpenAI ku /openai/v1/chat/completions.
Cela isiphetho semodeli eyodwa
Ngephoyinti lokugcina enkonzweni, linxuse usebenzisa i-OpenAI Python SDK. I-URL eyisisekelo ilandela le fomethi:
I model insimu idlulela esitsheni. Ngenxa yokuthi i-SageMaker AI icela imizila ngokusekelwe egameni lephoyinti lokugcina ku-URL, ungagcina le nkambu ingenalutho noma uyisethe ukuze ifane negama lemodeli elilindelwe yisitsha sakho.
Sebenzisa iphoyinti lokugcina lengxenye yenkomba
Ngezingxenye ze-inference, ungasingatha amamodeli amaningi endaweni eyodwa yokugcina, ngayinye enokwabiwa kwensiza okuhlanganisiwe okuzinikele. Ngezingxenye ze-inference, imodeli ihlotshaniswa nengxenye esikhundleni sokucushwa kwephoyinti lokugcina:
Ungakha izingxenye ezengeziwe ze-inference endaweni efanayo ukuze ubambe amamodeli amaningi anezilinganiso ezizimele kanye nokwabiwa kwezinsiza.
Cela izingxenye ze-inference
Ukuze ucele ingxenye ethile ye-inference, faka igama layo kumzila we-URL:
Isibonelo esilandelayo sibonisa izingxenye ezimbili ze-inference endaweni yokugcina okwabelwana ngayo, ngayinye eqondiswe iklayenti elihlukile le-OpenAI elabelana ngephuli yokuxhumana:
Okwabelwana ngakho httpx.Client ivumela zombili izimo zeklayenti le-OpenAI ukuthi ziphinde zisebenzise izikhathi ze-TLS ezifanayo kanye nephuli yokuxhumana.
Hlanganisa nama-Strands Agents
I-Strands Agents ingumthombo ovulekile we-SDK yokwakha ama-agent e-AI. Ngenxa yokuthi ama-Strands Agents asekela abahlinzeki bemodeli ehambisana ne-OpenAI, manje usungasebenzisa ukugeleza komsebenzi kwama-ejenti amaningi ngokuphelele kungqalasizinda yakho ye-SageMaker AI. Lokhu kukunikeza ukuguquguquka kwezinhlelo zokusebenza ze-ejenti ngokulawulwa kwamaphoyinti okugcina azinikele. Idatha yakho ayilokothi ishiye i-akhawunti yakho, futhi ukhetha ngokuqondile ukuthi iyiphi imodeli esetshenziswa abenzeli bakho.
Hlanza
Ukuze ugweme izindleko eziqhubekayo, susa amaphoyinti akho okugcina nezisetshenziswa ezihlotshaniswayo uma usuqedile. I-SageMaker AI endpoints ingenisa izindleko ngenkathi isesevisini, kungakhathaliseki ukuthi zithola ithrafikhi.
Isiphetho
Ngokusekelwa kwe-OpenAI-compatible API, i-Amazon SageMaker AI isusa umgoqo wokuhlanganisa phakathi kwalapho iningi lezinhlelo zokusebenza ze-AI likhona namuhla kanye nengqalasizinda ezidinga ukukala. Ungagcina ikhodi yakho ekhona, usebenzise noma yiluphi uhlaka oluhambisana ne-OpenAI, futhi usebenzise ukucatshangelwa kumaphoyinti okugcina azinikele nge-GPU, ukukala, nezilawuli zokuhlala zedatha ozidingayo. Ukuze uqalise, sebenzisa imodeli endaweni yokugcina yesikhathi sangempela ye-SageMaker AI usebenzisa isitsha esisekelwayo, faka i-SageMaker Python SDK, bese ukhomba iklayenti lakho le-OpenAI ekugcineni kwe-URL. Ukuze ufunde kabanzi, bona Sebenzisa i-SageMaker AI nge-OpenAI-compatible APIs ku- I-Amazon SageMaker AI Developer Guidenoma vula ikhonsoli ye-Amazon SageMaker AI ukuze udale isiphetho sakho sokuqala.
Mayelana nababhali



