Čet, 09.02.2012.

RSS feed

 Novosti
G.Skill Ares
·····················································
Emilio Ghilardi odlazi iz AMD-a
·····················································
ASUS DRW-24B5ST
·····················································
AMD Financial Analyst Day
·····················································
Otvoren prvi Lenovo Exclusive Store u Hrvatskoj
·····················································
Shuttle XPC Barebone SZ68R5
·····················································
AMD Radeon HD7950
·····················································
Na tržištu novi Core i5 desktop i Celeron mobilni procesori
·····················································
ASUS HD 7970 DirectCU II
·····················································
Bill Gates uručio stipendije hrvatskim studentima
·····················································
AIDA64 v2.20
·····················································
Nikad slabija prodaja Atoma
·····················································
AMD Opteron 6200 serija osvaja nagrade
·····················································
HP noviteti za početak 2012.
·····················································
Rekordna 2011. za Intel

 Članci
SteelSeries premium igrača oprema – za gamere samo najbolje ····················································· Hitachi Touro Desktop 3TB/To & Mobile 500GB/Go - backup preko USB 3.0 ····················································· ASUS Radeon HD7970 - nova single-chip kraljica ····················································· Test MSGW X79 konfiguracije ····················································· Fractal Design Arc, Core 3000, Define Mini i Define R3 ····················································· Seasonic M12II Bronze 750W i Thermaltake Toughpower XT Platinum 1275W – dvojac iz budućnosti ····················································· Noctua NF-F12 Focused Flow test ····················································· Canyon X-Ray Messenger Bag, CNL – MS007X miš & CNL – HP03X slušalice ····················································· AMD Graphics Core Next & Southern Islands ····················································· Dell Vostro V131 - na posao sa stilom
 Download
Catalyst 11.12 Vista i 7 driveri + CCC (64-bit) ····················································· Catalyst 11.12 Vista i 7 driveri + CCC (32-bit) ····················································· Catalyst 11.12 WinXP driveri + CCC (64-bit) ····················································· Catalyst 11.12 WinXP driveri + CCC (32-bit) ····················································· ForceWare 280.26 WHQL Vista i 7 za GF serije 6, 7, 8, 9, GT200, GT400 i GT500 (32-bit) ····················································· ForceWare 280.26 WHQL Vista i 7 za GF serije 6, 7, 8, 9, GT200 i GT400 (64-bit) ····················································· ForceWare 280.26 WHQL WinXP za GF serije 6, 7, 8, 9, GT200, GT400 i GT500 (32-bit) ····················································· ForceWare 280.26 WHQL Vista za GF serije 6, 7, 8, 9, GT200, GT400 i GT500 (64-bit) ····················································· GPU-Z v.0.5.3 ····················································· CPU-Z v1.57.1
Članci  
    Kategorije         Svi članci         Top hardware         Pretraživanje    

Intel Pentium 4 "Prescott" - Vrući kamen spoticanja
15.02.2004.
    « Prev  : Stranica 2 od 11 :  Next »      By ::  Ozren Žagmešter 
- Vatrogasci -

Vatrogasci - Poboljšani Branch Predictor, ALU, SSE3, itd.

Prvi vatrogasac za predugi pipeline s kojim ćemo se pozabaviti jest Improved Branch Predictor. Najveći broj kašnjenja u procesoru uzrokovan je potrebom da se očisti pa ponovo napuni pipeline nakon krive predikcije granjanja (branch prediction). Da bi se ova negativna pojava što više smanjila, Intel je napravio puno teoretskih i praktičkih istraživanja ponašanja algoritma za predikciju i na kraju ga uspio poboljšati. Nećemo ulaziti u to kako cijela stvar radi jer tako nešto bi moglo zanimati samo nekog teoretičara. Bitno je pogledati tablicu ispod i vidjeti koliki je napredak. Za demonstraciju Intel je koristio SPECint_base2000 (radi se o broju krivih predikcija na svakih 100 instrukcija):

 SPECint_base2000 podtest:
Northwood
Prescott
Napredak u %
164.gzip
1.03
1.01
1.9
175.vpr
1.32
1.21
8.3
176.gcc
0.85
0.70
17.7
181.mcf
1.35
1.22
9.6
186.crafty
0.72
0.69
4.2
197.parser
1.06
0.87
17.9
252.eon
0.44
0.39
11.4
253.perlbmk
0.62
0.28
54.8
254.gap
0.33
0.24
27.3
255.vortex
0.08
0.09
-12.5
256.bzip2
1.19
1.12
5.9
300.twolf
1.32
1.23
6.8

TOTAL

12.8%

Kao što vidimo, novi Branch Predictor trebao bi u prosjeku raditi bolje za 12.8%. To je fino poboljšanje no samo po sebi nedovoljno - 12.8% bolja predikcija ne može nadoknaditi produženje pipelinea za 50+%. Zato je Intel tweaknuo i ALU jedinice. Broj im je ostao isti (3 - od kojih 2 za jednostavne operacije (koje rade na 2x većoj frekvenciji od samog procesora) + 1 za kompleksne). Jedna od ALU jedinica namijenjenih za jednostavne kalkulacije obogaćena je jedinicom za računjanje rotacija i sl. i sada ona radi sve te kalkulacije a ne najsporija ALU jedinica - ona za kompleksne kalkulacije. Množenje cijelih brojeva (integera) prebačeno je s FPU-a na ALU i samim tim je isto ubrzan cijeli proces.

Intel je popravio i data pre-fetching, kako onaj hardverski, tako i onaj softverski. Tu je i veći broj Write Combining buffera što omogućava veći broj izvršavanja instrukcija za snimanje i učitavanje podataka. Intel je i osjetno povećao količinu cache memorije. Tako Prescott dolazi s 16kB L1 i 1024kB L2 cachea dok stariji Northwood dolazi s 8kB (L1) + 512kB (L2) cachea. Za P4 procesore je količina cache memorije iznimno bitna i to se lijepo vidjelo s P4 Extreme Edition procesorima. Naćalost, latencija cachea na Prescottu je viša (podaci isčupani iz CPU-Z-a):

Athlon 64 cache latency:
Level 1 size = 64KB latency = 3 cycles
Level 2 size = 1024KB latency = 14 cycles (11 cycles for this only level)

Northwood:
Level 1 size = 8KB latency = 2 cycles
Level 2 size = 512KB latency = 19 cycles (17 cycles for this only level)

Prescott:
Level 1 size = 16KB latency = 4 cycles
Level 2 size = 1024KB latency = 30 cycles (26 cycles for this only level)

Različiti programi dati će različite brojeve, ali svi će se složiti oko jednoga - latencija L1 i L2 cache memorije na Prescottu je viša nego ona na Northwoodu.

Zadnja novost s kojom ćemo se pozabaviti jest SSE3 set instrukcija. Gledajući ime, dalo bi se zaključiti da se radi o opsežnom setu instrukcija kao što su SSE i SSE2, no realno to nije tako. Radi se o samo 13 novih instrukcija koje su se originalno trebale zvati PNI (Prescott New Instructions) ali je Intel zaključio da je za potrebe marketinga SSE3 puno bolji izbor. Najveći broj instrukcija je tu da bi se poboljšala kompleksna aritmetička računanja (addsubps, addsubpd, movsldup, movshdup i movddup). Tu su još četiri instrukcije za grafiku (haddps, hsubps, haddpd i hsubpd), jedna za "x87-na-cijeli-broj" konverziju (fisttp), jedna za video enkodiranje (1ddqu) te dvije za sinhronizaciju threadova (monitor, mwait). Zadnje dvije instrukcije su tu da naprave Hyper Threading efikasnijim i po nama baš i nebi trebale spadati pod istu grupu kao i ostale instrukcije. HT je općenito poboljšan no točno na koji način još ne znamo. Nama se čini da je najviše truda otišlo na smanjenje "performance hita" kod nekih stvari kada je HT uključen, a manje da se one stvari koje rade dobro pod HT-om dodatno ubrzaju.



 « Prev  :  Skoči na  :  Next »  
::  Isprintaj članak  ::  Vaš Komentar  ::  Top 





  Anketa
Na matičnu ploču trošim...
...500 - 1000kn.
...1000 - 1500kn.
...1500 - 2500kn.
...manje od 500kn.
...više od 2500kn.


  VI&MI
  » Pišite Nam!


  Worldwide
Suradjujemo sa:

ABIT
Albatron
AMD
AOpen
Apacer
ATi
Chaintech
Club3D
Cooler Master
Connect3D
Corsair
EPoX
Gainward
Gigabyte
Hitachi GST
Infineon
Kingmax
Leadtek
nVidia
Sapphire
Silverstone
Soltek
Spire
TerraTec
Western Digital

(c) 1999-2012 :: PC Ekspert Team :: Sva prava pridrzana. ISSN 1334-2940