- ×îºóµÇ¼
- 2015-3-17
- ÔÚÏßʱ¼ä
- 1396 Сʱ
- ¼ÄÍбÒ
- 22475
- ÉùÍû
- 266
- ×¢²áʱ¼ä
- 2003-7-14
- ÔĶÁȨÏÞ
- 255
- Ìû×Ó
- 188
- ¾«»ª
- 88
- »ý·Ö
- 4353
- UID
- 140258
   
- ÉùÍû
- 266
- ¼ÄÍбÒ
- 22475
- ×¢²áʱ¼ä
- 2003-7-14
- ¾«»ª
- 88
- Ìû×Ó
- 188
|
ÎÒ´Ó½ñÌìÔçÉÏ8µã¿ªÊ¼¿´ÎÄÏ×È»ºóдÎÄÕÂдµ½Á˽ñÌìÔçÉÏÏÖÔÚÊÇ0µãÁË~~
ÕâÆªÎÄÕÂÔçÔÚÁ½¸öÀñ°Ý֮ǰ¾ÍÒѾÔڳﻮÁË£¬¿ÉÊÇ¿¼Âǵ½¹ÛµãÒÔ¼°¿ÉÄܵÄÓ°Ï죬ÎÒÏȰÑËü·¢ÔÚÄÚ²¿£¬ÏÈÔÚÄÚ²¿½»Á÷Ò»ÏÂÒâ¼û£¬È»ºóÔٷųöÈ¥£¡
ImongµÄE-raterµ÷²é±¨¸æfor gmat gre and toefl
ǰÑÔ£º
2003Äê7ÔµÄʱºòÎÒ¿ªÊ¼×¼±¸gre×÷ÎÄ£¬È»ºó¾ÍÌýµ½Á˵±Ê±¹ØÓÚe-raterµÄһЩ´«ÑÔ£¬²¢ÇÒ¿´µ½ÁËһЩÁ÷´«Éõ¹ãµÄÌû×Ó£¬ÐÄÀïÃæ×ÔÈ»Êܵ½Á˲»Ð¡µÄÓ°Ïì¡£µÈµ½¿¼Íê×÷ÎÄ£¬ÔÚ×÷Îİæ·ÀÏÌû×ÓµÄʱºò£¬»¹¿´µ½Á˵±Ê±Ôø¾Óдγ³µÃºÜÀ÷º¦£¬Ò²ÊǸúe-raterÓйأ¬Ò²ÓÐÈËÌá³öÁË·´²µµÄÒâ¼û£¬´Ó6Ôµ½8ÔÂÒ»Ö±¶¼ÓÐÕâ·½ÃæµÄ»°Ìâ¡£ÔÙÍùºó£¬¾ÍÊǵ½12ÔµÄʱºòÓÐÒ»Ìì°æÃæÉÏÓÐÈËÌáÆðe-raterµÄÊÂÇ飬ÎÒ¾ÍÈ¥googleËÑÁËһϣ¬ÕâÒ»ËѲ»Òª½ô£¬ÏÅÁËÎÒÒ»´óÌø£ºÎÒÒ»Ö±ÒÔΪûÓÐe-raterÕâôһ¸ö¶«Î÷£¬½á¹ûÇé¿ö±íÃ÷Õâ¶«Î÷µÄÈ·´æÔÚ¡£
Àä¾²ÏÂÀ´×Ô¼ººÃºÃÏëÁËÒ»Ï룬¾õµÃÓбØÒª°ÑÊÂÇé¸ãÇå³þ¡£µ±³õ×Ô¼º×¼±¸×÷ÎĵÄʱºòÔø¾±»Ò»Ð©³³µÃ·Ð·ÐÑïÑïµÄÏûÏ¢Ó°ÏìµÄ·Ç³£À÷º¦£¬»¹ÓôÃÆ¹ýÒ»Õó×Ó¡£ÍƼº¼°ÈË£¬ÒªÊÇÄܰÑÊÂÇéµÄǰǰºóºóµ÷²éÇå³þ£¬Èôó¼Ò¶¼·ÅÐÄ£¬¶¼Ã÷°×£¬ÄǾͺÃÁË¡£ÓÚÊÇÎÒ¾ÍÊÔ×ÅÈ¥ÕÒÏà¹ØµÄ×ÊÁÏ¡ª¡ª»¹ËãÐÒÔË£¬Í¨¹ýÍøÂ磬²¢²»ºÜÀ§ÄѾÍÕÒµ½ÁËÊ®¼¸ÆªºÜ¹Ø¼üµÄÎÄÏ×£¬×ÐϸÑжÁÍêÁËÖ®ºó£¬×ÔÓÐÒ»·¬ÌرðµÄÌå»á£¬¶øÇÒ¾õµÃÊ®·ÖÓбØÒªºÍ´ó¼Ò½»Á÷Ò»ÏÂÕâЩ¸ÐÏë¡£
·Ï»°ÉÙ˵£¬½øÈëÕýÌâ¡£
µÚÒ»²¿·Ö£ºE-raterµÄ´æÔÚÓëÏÖ×´
Ê×ÒªµÄÎÊÌâ±ØÈ»ÊÇÕâ¸ö£ºµ½µ×ÓÐûÓÐe-raterÕâ¸ö¶«Î÷£¿´ð°¸ÊÇ£ºÓУ¡¸ù¾ÝÎÄÏ×µ÷ÑеĽá¹û£¬1999ÄêµÄʱºòETS¾ÍÒѾÔÚ×ÅÊÖÑо¿ÕâÖÖ½Ð×öAutomated Essay GradingµÄ¶«Î÷£¬È»ºó2001ÄêÔÚgmat¿¼ÊÔÖоÍÒѾ¿ªÊ¼Ó¦ÓÃÁË¡£ÁíÍ⣬´ÓÍøÂçÉÏËѼ¯µÄÐÅÏ¢±íÃ÷£¬Ä¿Ç°¹úÄÚºÍETSºÏ×÷µÄÌ©µÂ¼¯ÍÅÒѾÒý½øÁËÕâ¸öµç×ÓÆÀ¸Äϵͳ×÷Ϊ¡°ÏßÉÏÍи£×÷ÎIJâÆÀϵͳ¡±×÷ΪÔÚÍøÂçÉϸø¿¼Éú×¼±¸Íи£×÷ÎĵÄÒ»ÖÖÓªÀûÐÔ·þÎñƽ̨¡£¹ØÓÚÕâÒ»µã¿ÉÒÔ·ÃÎÊÕâ¸öµØÖ·µÃµ½Ö¤Êµ£ºhttp://www.englishtide.com/ets.asp £¨µ«ÊǹØÓÚÍи£Õýʽ¿¼ÊÔµÄ×÷ÎÄÆÀ·Ö£¬ÓÉÓÚÎÒûÓпÌÒâÈ¥ÊÕ¼¯Ïà¹ØÎÄÏ××ÊÁÏ£¬Òò¶øÇé¿ö²»ÄÜÈ·¶¨¡££©
ÄÇô˵µ½GRE£¬Ä¿Ç°µÄ×´¿öÊÇÊ²Ã´ÄØ£¿¸ù¾Ý½ñÌìÔÚETS¹Ù·½ÍøÕ¾ÉÏ·¢²¼µÄÐÅÏ¢£¬¿ÉÒԵóö½áÂÛ£ºÄ¿Ç°GRE×÷ÎĵIJâÆÀÈÔÈ»ÊÇÈ˹¤Íê³É¡£Õâ¸ö½áÂÛÊÇͨ¹ý±È½Ïgre.orgºÍmba.comÉÏÃæµÄ¹ØÓÚessay scoring¹Ù·½ËµÃ÷µÄ´ë´ÇµÃµ½µÄ¡£GMAT·½Ã棬mba.comµÄ´ë´ÇÊÇÕâÑùµÄ£ºEach of your essays in the AWA section will be given two, independent ratings, one of which may be performed by E-rater®. ¶øGRE·½Ã棬gre.orgµÄ´ë´ÇÈçÏ£ºEach essay receives a score from two trained readers, using a 6-point holistic scale.ÔÚGREÕâ±ß£¬Ñ¹¸ù¾ÍûÓÐÌáµ½¹ýe-raterÕâ¸ö×Ö¡£Òò´Ë£¬ÎҵóöµÄ½áÂÛ£¬¾ÍÊÇĿǰµÄGRE×÷ÎÄÈÔÈ»ÊÇÈ˹¤ÆÀ·Ö¡£
£¨ÄÇô£¬Èç¹û¶ÔGRE¿¼Éú£¬µÃµ½Õâ¸ö½áÂÛÒѾ×ã¹»»òÕßÖ»¶ÔÕâ¸ö½áÂÛ¸ÐÐËȤµÄ»°£¬ºóÃæµÄÄÚÈÝÒ²¿ÉÒÔ²»¿´ÁË£©
ÉÏÃæÁ½¶Î½éÉܵÄÓ¦¸ÃÊÇ´ó¼Ò×î¹Ø×¢µÄÊÂÇé¡£¹ØÓÚe-rater±¾Éí£¬ÆäʵÓÐÁ½µã»¹ÊÇÖµµÃÒ»ÌáµÄ¡£µÚÒ»£¬ETSÕâ±ßµÄE-rater´ÓÎÊÊÀµ½ÏÖÔÚ£¬Ò²¾Àú×Ų»¶Ï×ÔÎÒÍêÉÆµÄ¹ý³Ì£¬ÏÖÔÚÓ¦ÓõÄE-raterϵͳµÄÉè¼ÆÒѾÊǶÔ×î³õµÄϵͳÉè¼Æ½øÐÐÁ˸Ľø¡£ºÜÃ÷ÏÔµÄÒ»¸öÀý×Ó£¬¾ÍÊÇÏÖÔÚETSÌṩµÄAutomated Essay Gradingϵͳ£¬³ýÁË×î³õµÄE-rater£¬ÓÖÓÐÁËÒ»¸öCriterionµÄϵͳ£¬ºóÕßÊÇ¿ÉÒÔÌá³öһЩ·ÖÎöÒâ¼û¶ø²»½öÊÇÆÀ·Ö£¬Ä¿Ç°ETSÌṩµÄGRE×÷ÎÄScore it now£¬¾ÍÊÇ»ùÓÚCriterionµÄ¿ª·¢£¬¶øÌ©µÂÔòÔÚÍøÂçÉÏÌṩÁËE-raterºÍCriterionÁ½Õß¡£µÚ¶þ£¬³ýÁËETSµÄE-raterÒÔÍ⣬ÈÔÈ»ÓÐÆäËüµÄ¼¯ÍÅÔÚ×öAutomated Essay GradingÕâÑùµÄÏîÄ¿¡£´Ó2001Äê·¢²¼µÄ¼¸ÆªÎÄÏ××ÛÊöÉÏÄܹ»¿´µ½µÄÓÐ4¸ö°æ±¾µÄAutomated Essay Gradingϵͳ£¬ETS¿ª·¢µÄE-raterÖ»ÊÇÆäÖÐÖ®Ò»¡£
µÚ¶þ²¿·Ö£ºE-rater¹¤×÷ÔÀí
¹ØÓÚE-rater£¬´ó¼ÒÐÄÀïÃæ×î·¸àÖ¹¾µÄ£¬Äª¹ýÓÚ¶ÔÕâ¸öϵͳµÄÓÐЧÐԵϳÒÉ¡£×¶ËµÄÀý×Ó£¬¿ÖÅÂÊÇÔø¾Óп¼ÉúÔÚGMATµÄ¿¼ÊÔÖУ¬ÔÚ×Ô¼ºµÄessayÀïÃæÐ´¹ý¡°I don¡¯t want to be graded by a robot¡±ÕâÑùµÄ¾ä×Ó¡£
1999ÄêBusiness Week¶ÔFred McHale£¬µ±Ê±µÄGMAC Vice-President for Assessment & Research£¬½øÐÐÁ˲ɷã¬ÆäÖоÍÌáµ½ÁËÕâ¸öÎÊÌ⣺(http://businessweek.com/bschools/originals/bs90329.htm)
Q: What has been the biggest challenge surrounding the E-Rater since you've implemented it? Have you encountered a lot of skepticism? Are folks scratching their heads wondering how this electronic assessment software actually works, wondering if the results have any validity?
A: There has been a lot of skepticism, and it was expected. People tend to think that E-Rater is just your average grammar-checker on your word processor. But that's just not the case. All we can do is show the results.
µÄÈ·£¬all they can do is to show the results. ÔÚETS¹Ù·½ÍøÕ¾ÉÏÕÒµ½µÄÎÄÏ×±íÃ÷£¬E-raterÅзֺÍhuman readerÅзÖÖ®¼äµÄÒ»ÖÂÐÔÒ»Ö±ÊÇËûÃǵÄÑз¢×鹨עµÄÖØµã¡£ÔÚÖÁÉÙ3ƪÎÄÏ×Öж¼Ìáµ½ÁËÈçÏÂËùÊöµÄʵÑ飺¶Ôijһ¸öÌâÄ¿ÕÒ³öhuman reader¸ø³öÁË1,2,3,4,5,6µÄÎÄÕ¸÷nƪ£¬È»ºóÄøøE-raterÅУ¬È»ºóÑо¿¸ø³ö·ÖÊýµÄÒ»ÖÂÐÔ¡£ÊµÑé½á¹û±íÃ÷£¬E-raterµÄÅзÖÓëhuman readerÅзֵÄExact AgreementÓëAdjacent AgreementµÄÇé¿öÊǾø¶Ô¶àÊý£¬¶ø³öÏÖDisagreementµÄÇé¿öÔòÊǾø¶ÔÉÙÊý¡£¸ù¾Ý¹«²¼µÄʵÑé½á¹û£¬¸÷¸ö·ÖÊý¶ÎµÄE-raterÅзÖÓëHuman readerÒ»ÖÂÐÔ×ÜÊÇ´óÓÚ80%£¬Æ½¾ùagreementµÄ±ÈÂÊÔÚ90%×óÓÒ¡£´ËÍ⣬¿¼Âǵ½Á½¸öHuman readerÖ®¼äÅзÖÒ»ÖÂÐÔÒ²´æÔÚ²îÒ죨ÕâÒ»µãÒ²ÓÐÏà¹ØµÄʵÑé¼Ç¼£©£¬ÔÙ½«ÕâÖÖ²îÒìºÍE-raterÓëHuman readerÖ®¼ä²îÒìµÄÇé¿ö½øÐбȽϣ¬ËùµÃµ½µÄ½áÂÛÊÇ£¬E-raterÅзֵÄÓÐЧÐÔ£¨Ò²¾ÍÊÇÎÄÏ×Öз´¸´³öÏÖµÄAutomated Essay Score Validity£©ÊÇÍêÈ«¿ÉÒԵõ½±£Ö¤µÄ¡£
ÄÇE-raterµ½µ×ÊÇÔõô×öµ½ÕâÒ»µãµÄ£¿
Ò»¸ö¼ÆËã»ú³ÌÐòÄܹ»×ö³ö¶ÔһƪÎÄÕµÄÁ¿»¯ÆÀ¼Û£¿
Ò²ÐíÎÒÃǵÚÒ»´ÎÌý˵E-raterµÄʱºò£¬ÐÄÀïÃæ²úÉúµÄÒÉÎʾÍÊÇ£º¡°Ò»¸ö¼ÆËã»ú³ÌÐòÄÜ×öʲô£¿Í³¼ÆÎÄÕÂ×ÖÊý£¿¼ÆËãÆ½¾ù¾ä×Ó³¤¶È£¿Ä³¸ö×ÖÑÛ³öÏֵįµÂÊ£¿È»ºó¾ÍÆÀ¸ö·Ö³öÀ´£¿²»Êǰɣ¿¡±
ÕâÑùµÄ¿´·¨£¬¿ÖÅÂС¿´ÁËE-raterµÄ¹¦ÄÜÁË¡£
ËäÈ»E-raterµÄ¾ßÌåÆÀ¼Ûʶ±ðµÄÉè¼ÆÎÒÃÇÎÞ´ÓµÃÖª£¨Õâ¸ö×ÔÈ»£¬»ù±¾ÉÏÊÇÉÌÒµÃØÃÜ£©£¬²»¹ý´ÓĿǰ¿ÉÒÔÄõ½µÄÎÄÏ×ÖÐÒ²¿ÉÒÔ¿´³öÒ»µã¶ËÄßÀ´£¬ÀýÈ磺I also assume that shrinking high school enrollment¡ Õâ¾ä»°£¬ÖÁÉÙ¿ÉÒÔ·ÖÎö³öÀ´£ºalso±í´ïÁËparallel argument£¬that±í´ïÁËclaim£¬¾ä×ÓÉæ¼°µ½µÄcontentÔòÓÐassume shrink high school enrollment¡ Ò²¾ÍÊÇ˵£¬E-rater¹¤×÷ÔÀí£¬Ô¶Ô¶²»ÊǼòµ¥µÄͳ¼Æµã×ÖÊý£¬Í³¼ÆµãÓÃ´ÊÆµÂÊ¡£
ÔÙ¾ÙÒ»¸öÀý×Ó£º
Q: Differentiate between triggers and stored procedures.
A: Triggers are programs embedded within a table that are automatically invoked by updates to another table. Stored procedures are programs embedded within a table that can be called from an application program.
´ÓÕâÒ»¶ÎÖпÉÒÔʶ±ð³öʲô¶«Î÷ÄØ£¿ÎÄÏ×Öиø³öÁËÖÁÉÙÕ⼸µã£º
Syntactic Variety: ¡can be called from a program
¡that a program can call
Synonymy: ¡can be invoked from a program¡
Negation: ¡are NOT invoked by updates¡
Anaphoric Reference: TRIGGERS are programs. THEY are embedded¡
Òò´Ë¿ÉÒÔ¿´³ö£ºE-raterËùʶ±ðµÄÔªËØÒ²ÐíÔ¶Ô¶³¬¹ýÎÒÃÇÒ»°ãÄܹ»¹¹Ïë³öµÄ·¶Î§£¬¶ø¿ÖÅÂÎÒÃDz»µÃ²»³ÐÈÏÕâÖÖʶ±ðÊǺÏÀíÉè¼Æ²¢Ó¦Óõġ£²Î¿¼ÏÂÃæÕâ¶ÎÎÄÏ×£º
E-rater focuses on three general classes of essay features: discourse, indicated by various rhetorical features that are expected to occur throughout an essay; syntactic, indicated by the structure of sentences; and content, indicated by prompt-specific vocabulary expected to be present in the essay. A total of 59 features are ¡°extractable,¡± but in practice usually only the most predictive features, as measured by their regression weights, are retained and used for further scoring.
ÉÏÎÄÌá¼°µÄ59¸öfeatureÊÇÏ൱¹ã·ºµÄ¡£ÀýÈ磬¾Ísyntactic variety¶øÑÔ£¬ÎÄÏ×Öиø³öÁËÈçϼ¸µã£¨µ±È»£¬Õâ¸ölistÊDz»ÍêÈ«µÄ£©£ºnumber of complement clauses, subordinate clauses, infinitive clauses and relative clauses, occurrences of subjunctive modal auxiliary verbs such as would, could, should, might and may. ¶ÔÓÚArgument structure£¬E-rater×ÅÖØÊ¶±ðparallelism, contrast, evidence, argument developmentÒÔ¼°ÆäËüһЩcoherence relations. ÖÁÓÚDiscourseµÄ·½Ã棬ÏÂÃæÒ»¶ÎÎÄÏ׷dz£ÓÐÆô·¢ÐÔ£º
Literature in the field of discourse analysis points out that rhetorical relations can often be identified by the occurrence of cue words and specific syntactic structures (Cohen 1984, Mann and Thompson 1988, Hovy, et al. 1992, Hirschberg and Litman 1993, Van der Linden and Martin 1995, Knott 1996). E-rater follows this approach by identifying and quantifying an essay¡¯s use of cue words and other rhetorical structure features. For example, we adapted the conceptual framework of conjunctive relations from Quirk, et al. (1985) in which phrases such as ¡°In summary¡± and ¡°In conclusion,¡± are classified as conjuncts used for summarizing. E-rater identifies these phrases and others as cues for a Summary relation. Words such as ¡°perhaps¡± and ¡°possibly¡± are considered to be cues for a Belief relation, one used by the writer to express a belief while developing an argument in the essay. Words like ¡°this¡± and ¡°these¡± are often used within certain syntactic structures to indicate that the writer has not changed topics (Sidner 1986). In certain discourse contexts, structures such as infinitive clauses mark the beginning of a new argument.
ÓÉÉÏÎÄ¿ÉÒÔ¿´³ö£¬Í¨¹ý¶ÔÎÄÕµÄfeatureµÄʶ±ð£¬E-raterÍêÈ«¿ÉÒÔ×ö³ö¶ÔÎÄÕµÄÏà¹ØÅжϡ£¶øÏÂÃæ¾ÍÊÇÒ»¸öʵ¼ÊµÄÀý×Ó¡£¾ÍcoherenceÕâ¸ö·½Ãæ¶øÑÔ£¬ÏÂÃæµÄpassageµÃµ½ÁË6·Ö£¬ÆÀÓïÊÇ¡°The following paragraph demonstrates an example of a maximally coherent text, centering the company ¡¯Famous name¡¯s Baby Food¡¯ and continuing with the same center through the entire paragraph.¡±
Yet another company that strives for the ¡±big bucks¡± through conventional thinking is Famous name¡¯s Baby Food. This company does not go beyond the norm in their product line, product packaging or advertising. If they opted for an extreme market place, they would be ousted. Just look who their market is. As new parents, the Famous name customer wants tradition, quality and trust in their product of choice. Famous name knows this and gives it to them by focusing on ¡±all natural¡± ingredients, packaging that shows the happiest baby in the world and feel good commercials the exude great family values. Famous name has really stuck to the typical ways of doing things and in return has been awarded with a healthy bottom line.
¶øÏÂÃæÕâ¶ÎÆÀÓïºÍÏàÓ¦µÄÀýÎĽøÒ»²½ËµÃ÷ÁËE-rater¶ÔcoherenceµÄʶ±ð£º
Following the same mark-up conventions, we demonstrate text incoherence with an excerpt (a paragraph again) of a student essay scored 4. In this case, repeated Rough-Shift transitions are identified. Several entities are centered, opinion, success and conventional practices, none of which is linked to the previous or following discourse. This discontinuity created by the very short lived Cbs makes it hard to identify the topic of this paragraph and at the same time it is capturing the fact that the introduced centers are poorly developed.
ÏÂÃæ¾ÍÊÇËù˵µÄÕâ¶Î¶«ÀÎ÷³¶µÄ¶øÔÚcoherenceÉϱ»ÅÐΪ4·ÖµÄpassage:
I disagree with the opinion stated above. In order to achieve real and lasting success a person does not have to be a billionaire. And also because conventional practices and ways of thinking can help a person to become rich.
×ÛÉÏËùÊö£¬E-raterÓÐÄÜÁ¦×öµ½¶ÔÎÄÕ½øÐÐʶ±ðºÍÅж¨£¬ÎÄÏ×ÖÐÕª³öµÄÏÂÃæÕâÈý¶Î»°Êǵڶþ²¿·ÖÄÚÈݵÄ×îºÃ½áÊøÓï¡£
Overall, while it is largely the case that the raters were not actually counting occurrences of indicator cues representing e-rater features, they were tracing qualities that incorporate such features.
Specifically, when an essay writer would make a certain type of assertion in the essay, the raters would expect to see the associated use of certain types of syntactic structures. The absence of such syntax in such an instance would render the assertion superficial. While essays with and without such syntactic variety were both seen, clearly the essays containing the syntactic variety associated with that type of discourse were viewed by the raters as superior.
Obviously, e-rater does not read an essay, so it cannot ¡°look for¡± or ¡°evaluate¡± writing qualities. However, e-rater can, and does in some instances, detect evidentiary traces, the proverbial ¡°breadcrumbs in the path,¡± that signal these qualities, using its own version of the characteristics.
µÚÈý²¿·Ö£ºE-raterÔÚ¿¼ÊÔÆÀ·ÖÖеĹý³Ì
ÄÇô˵ÁËÕâô°ëÌì£¬ËÆºõE-rater±¾ÉíÊÇÒ»¸öÏ൱ǿ´óµÄ¹¤¾ß¡£Ò²Ðí´ó¼Ò»á²úÉúÕâÑùµÄÎÊÌ⣺E-raterÈç¹ûÕâÑùÇ¿Á¦£¬×÷ÎĵÄGradingÊDz»ÊÇ¿ÉÒÔÍÑÀëÈ˹¤µÄÅÐÔÄÁË£¿
ÏÖÔÚÎÒ¾ÍÒª½â¾öÕâ¸öÎÊÌâ¡£
¸Õ²Å˵Á˺ܶàE-rater¶ÔÎÄÕµÄʶ±ð¾ÍÏñ¡°breadcrumbs in the path¡±£¬ÄÇE-raterÈçºÎ¾ÍÄܹ»ÈϳöÕâЩBreadcrumbÄØ£¿ºÜÏÔÈ»£¬Èç¹ûûÓÐÔ¤ÏȵÄÈ˹¤µ÷ÊÔºÍËØ²ÄÊäÈ룬E-raterʲôҲʶ±ð²»³öÀ´¡£Ò²¾ÍÊÇ˵£¬E-rater²»¿ÉÄܶÀÁ¢µÄ¶ÔһƪÎÄÕ½øÐÐÆÀÅУ¬E-raterÅзÖ֮ǰ£¬ÆäʵÊÇÁíÓÐÒ»¸öÔ¤ÏȵĹý³Ì¡£
´ó¼Ò¿ÉÄܶ¼ÒѾ֪µÀ£¬¶ÔÓÚÿһ¸ö¶ÀÁ¢ÌâÄ¿£¬E-raterÄÇÀïÒѾ´æºÃÁ˼¸°ÙƪԤÏÈÈ˹¤ÆÀºÃ·Ö£¨¸÷¼¶·ÖÊý¶¼Óм¸Ê®Æª£©µÄÎÄÕ£¬ÕâЩÎÄÕµÄÓô¦ÊÇÊ²Ã´ÄØ£¿Ñ§Àí¿ÆµÄͬѧ£¨esp.ѧ¹ý»¯Ñ§µÄ£©¿´µ½Õâ¸ö´Ê¹À¼ÆÁ¢¿Ì¾ÍÃ÷°×ÁË£º±ê¶¨¡£ÉÏÎÄÒѾÌáµ½£¬E-rater¿ÉÒÔʶ±ð³ö50¶à¸ölinguistic feature£¬µ«ÊÇÿ¸öfeatureÖ®¼ä²¢·Ç¼òµ¥¼ÓºÍ£¬¶øÊÇÓÐregression wightµÄÒòËØ£¬ÄÇÕâ¸öregression weight´ÓÄÄÀïÀ´£¿µ±È»ÊÇÒªÓá°±ê×¼ÎÀ´¡°±ê¶¨¡±£¬Ò²¾ÍÊÇÓá°±ê×¼essay¡±À´È·Á¢£¡
ÕâÒ²¾ÍÒý³öÁËÕâÆªÎÄÕÂÀïÎÒҪǿµ÷µÄÒ»¸öºËÐÄÊÂʵ£ºÈç¹ûE-raterµÄÆÀ·Ö²»ÄܺÍHuman readerµÄÆÀ·ÖÓÐЧÎǺϣ¬E-rater¾Í¸ù±¾Ã»Óã¡Ò²¾ÍÊÇ˵£¬E-raterµÄÆÀ·Ö¹æÔò£¬È«²¿¶¼ÊÇÀ´Ô´ÓÚÔÏÈÉèÁ¢µÄHuman readerµÄÆÀ·Ö±ê×¼£¬¶¼ÊÇÒÔHuman readerµÄ±ê׼Ϊ»ù´¡¡£È«²¿µÄ±ê×¼£¬¶¼ÊÇÈËÈ·Á¢µÄ¡£
Ò²¾ÍÊÇ˵£¬E-rater±ØÐë·þ´ÓÈË£¬±ØÐëͨ¹ýµ÷ÊÔ´ïµ½Æä×ÔÉí×î´ó³Ì¶ÈµÄÓëHuman readerµÄÒ»Ö¡£
µÚËIJ¿·Ö£ºËùνµÄ¡°¶Ô²ß¡±
³öÏÖÁËÒ»¸öÐµĶ«Î÷£¬Ò»¿ªÊ¼µÄʱºò¿Ï¶¨»áÔì³ÉһЩÒÉÂǺͽôÕÅ£¬ÓÚÊÇ¡°E-raterµÄÓ¦¶Ô²ßÂÔ¡±¾Í³ÉΪÁËÈÈÃÅ»°Ìâ¡£ÄÇô£¬ÕæµÄÓÐÕâÑùµÄÓ¦¶Ô²ßÂÔÂð£¿
Ôø¾ÓÐһƪÁ÷´«ºÜ¹ãµÄÌû×ÓÀïд¹ý£º¡°µçÄÔÆÀ·ÖÆ÷DZÔڵظøÈ˹¤ÆÀ·ÖÕßÊ©¼ÓѹÁ¦¡£µçÄÔÆÀ·ÖÆ÷ºÍÈ˹¤ÆÀ·ÖÕ߸÷×Ô¸øÄãµÄ×÷ÎÄÆÀ·Ö£¬Èç¹û½á¹û´æÔÚ׎ϴóµÄ²îÒ죬ÄãµÄ×÷ÎĽ«»á±»´«µ½µÚÈý¸öÈ˹¤ÆÀ·ÖÕßµÄÊÖÖУ¨µ±È»ÕâÌá¸ßÁËETSµÄ·ÑÓã©¡£ÕâÄËÊô³£Àí£¬µ«ETS¾Ü¾øÕâô×ö£¬ËùÒÔΨһµÄ½á¹ûÊÇÈ˹¤ÆÀ·ÖÕß½«¾¡Á¦×ñѵçÄÔÆÀ·ÖÆ÷µÄ±ê×¼ºÍ¹æÔò¡£Ò²¾ÍÊÇ˵£¬ÒÔµçÄÔÆÀ·ÖÆ÷µÄÅзÖΪ׼£¬ÒòΪÏñGREÕâÑùµÄ±ê×¼»¯¿¼ÊÔÊDz»ÈÝÖ÷¹ÛºÍÆ«¼ûµÄ¡£ ËùÒÔ£¬²»ÒªÊÔͼȡÔÃÈ˹¤ÆÀ·ÖÕßÆÚ´ýËûÄÜ·ñ¾öµçÄÔÆÀ·ÖÆ÷¸øÄãµÄµÍ·Ö£¬¶øÓ¦¸Ã¾¡Á¿·ûºÏµçÄÔÆÀ·ÖÆ÷µÄ¹æÔò¡£¡±
¶Ô´Ë£¬ÎÒÔÚ´ËÔٻعËһϸղŵĽáÂÛ£ºE-raterµÄÆÀ·Ö¹æÔòÈ«²¿À´Ô´ÓÚHuman readerµÄÔ¤É裬Ҳ¾ÍÊÇ˵£¬Èç¹ûÓÐËùνµÄ¡°E-raterÓ¦¶Ô²ßÂÔ¡±£¬ÄÇÕâ¸ö²ßÂÔÒ²¾ÍÓ¦¸ÃÊÇÄܹ»Ó¦¶ÔHuman readerµÄ²ßÂÔ£¬ÔÙ½øÒ»²½Ëµ£¬Ò²²»¹ý¾ÍÊÇÒ»°ãµÄд×÷²ßÂÔ£¡
Ò²¾ÍÊÇ˵£¬ÈκÎËùνµÄ¡°ÓºÏE-raterµÄд×÷²ßÂÔ¡±£¬Èç¹ûÕæµÄ¿ÉÒÔ¡°ÓºÏ¡±E-raterµÄ»°£¬ÄÇÒ²¾ÍÊÇÔÚ¡°ÓºÏ¡±Human reader£¬ÄÇÒ²¾Í¸ù±¾Ã»Ê²Ã´ÐÂÏʵ쬽ö½öÊÇ×î»ù´¡×î¸ù±¾µÄд×÷·½·¨¡£¡°ÓºÏE-raterµÄд×÷²ßÂÔ¡±Õâ¸ö¶ÌÓïµÄÈ·Äܹ»ÎüÒýÑÛÇò£¬µ«ËüËù¸ø³öµÄ¡°²ßÂÔ¡±²¢·ÇËüÓÕµ¼È˹¹ÏëµÄ¡°µ¥¶ÀÓºÏE-rater¶øÊ¹»úÆ÷ÇãÏò¸ø¸ß·Ö¡±µÄ¡°²ßÂÔ¡±£¬ÒòΪ¸ù±¾¾ÍûÓÐÕâÖÖ²ßÂÔ£¡
ÕâÒ»µã¿ÖÅÂÓëÒ»²¿·ÖÈ˵ÄÉèÏ벻̫һÑù¡£¿ÉÄÜÓÐÏ൱һ²¿·ÖÈ˶¼ÐÄ´æ»ÃÏ룬ÈÏΪÓÐÄܹ»µ¥¶ÀÆÛƵ½E-raterµÄ²ßÂÔ£¬ÉõÖÁÒ»¸ö¹úÍâµÄÍøÕ¾É϶¼´ò³öÁË¡°how to fool the E-rater¡±µÄ×ÖÑù¡£ÄǺã¬Ëµµ½ÕâÒ»µã£¬ÕâÀïÓÐÒ»¸öÀ´×ÔETS¹Ù·½µÄÀý×Ó£ºETS×éÖ¯ÁËÒ»°ïר¼Ò¿ÌÒâȥдһЩessayÀ´³¢ÊÔtrick the E-rater£¬½á¹ûÄØ£¿×î³É¹¦µÄһλ½ÌÊÚÆµ½ÁË5·ÖµÄÕý²îÒ죨E-raterµÄ¼õÈ¥Human readerµÄ£©£¬È»¶ø¡°His principal strategy was simply to write several paragraphs and to repeat them (37 times, in fact!).¡±£¡¶øºóÃæ¼¸Î»³É¹¦ÕߵIJßÂÔÒ²¶¼ÊÇÀàËÆµÄ£¬ÀýÈç attempted, alternatively, to write essays that is rambled, missed the point, used faulty logic, or were haphazard in their progression, but used relevant content words, complex sentence structure, or other features valued by e-rater. ÇÒ²»ËµÕâЩÈË·ÑÁ˶àÉÙÐÄ»úд³öÕâÑùÄܹ»Ôì³ÉE-rater³öÏÖbugµÄÎÄÕ£¬ÕâÑùµÄÎÄÕ£¬ÄѵÀÔÚHuman readerÄÇÀïÄܹ»¹ý¹Ø£¿ÕâÑùд³öÀ´µÄ¶«Î÷£¬Äܲ»Äܱ»½Ð×ö¡°ÎÄÕ¡±¿ÖÅÂÒªÊ×ÏÈ´òÎʺš£
Èç¹ûËµÕæµÄÓÐfool the E-raterµÄtipµÄ»°£¬ÎÒÏëÉÏÃæÕâ¸ötip¿ÖÅÂÒªÈôó¼ÒʧÍûÁË¡£¶øÄܹ»Ìá³öÀ´µÄËùν¡°Õë¶ÔE-raterµÄtip¡±£¬Ò²²»¹ý¾Í³¬²»³öÄÇЩ³£¹æµÄд×÷¼¼ÇÉ¡£Ã»Ê²Ã´ÐÂÏʵġ£
ÔÙÈç¹û£¬ÕæµÄҪ˵ÓÐʲôtipµÄ»°£¬¿ÖÅÂÒ²Ö»ÓÐÕâÒ»µãÁË£º¾¡Á¿¿Ë·þÓï·¨ºÍƴдµÄ´íÎó£¬ÒòΪÕâÑùµÄ´íÎó»á¸ÉÈÅE-rater¶Ô¸ü¸ß¼¶µÄfeatureµÄʶ±ð¡£²»¹ý˵»ØÀ´£¬Õâ¸öËùνµÄtip£¬Ôõô¿´Ò²¿´²»³öÀ´ÊÇʲôºÜ¸ßÉîµÄ¡°fool the E-rater¡±µÄ¼¼ÇÉ£¬¶øÖ»²»¹ýÊÇ×î×î»ù±¾µÄ¶«Î÷¡£Ò²Ðí£¬ÊÇÒòΪ±¾À´¾Í²»´æÔÚÕâÖÖÕë¶Ô»úÆ÷µÄͶ»ú¼¼ÇÉÄØ£¿ÊDz»ÊÇ£¬E-rater²»ÖµµÃÎÒÃǹý·ÖµÄ¹Ø×¢£¬¶ø¾«Á¦»¹ÊÇÓ¦¸Ã·ÅÔÚ»ù±¾µÄд×÷ˮƽµÄÁ·Ï°ÉÏ£¿
Òò´Ë£¬ÔÚ±¾²¿·ÖµÄ¿ªÍ·ÒýÓõÄÄǶλ°£¬³ýÁËǰ°ë²¿·ÖÊÇÔÚÐðÊöÊÂʵ£¨¶øÇÒ£¬ËùνµÄ¡°Ê©¼ÓѹÁ¦¡±Ò²ÊÇȱ·¦Â۾ݵģ©£¬ºó°ë²¿·ÖµÄÍÆÂÛ£¬Ê²Ã´Ôö¼Ó·ÑÓã¬ETS¾Ü¾øÈçºÎ×ö£¬¸üÓÐÉõÕßʲô¡°È˹¤ÆÀ·ÖÕßŬÁ¦×ñÑE-raterµÄÔÔò¡±£¬µÈµÈÖîÈç´ËÀ࣬ÍêÈ«¶¼ÊÇÔÚȱ·¦Â۾ݵÄÇé¿öÏÂ×÷³öµÄ±¾Ä©µ¹ÖõÄÎóµ¼ÐÔÃýÂÛ¡£
ºó¼Ç£º
Ôø¾ÓÐÈËÌá³ö¹ý¡°E-rater¾ÍÊÇËÑË÷ÒýÇæµÄÔ¶·¿Ç×ÆÝ¡±µÄÂ۵㣬ȻºóÒý³öÁËÈô¸É½áÂÛ¡£²»¹ý¼òµ¥µÄÌá³öÉÏÃæÕâ¸öÂ۵㣬δÃâ»áÈÃÈËÎó½â¡£¾ÍĿǰµÄÎÄÏ×ÐÅÏ¢¶øÑÔ£¬E-raterµÄ¹¦ÄܺÍÉè¼ÆÔÔò¿ÖÅÂÔ¶±ÈÎÒÃÇÄܹ»ÉèÏëµÄ³Ì¶ÈÒª¸´ÔÓ£¬¶øÏëÒªÒ»¾ä»°¾«±ÙµÄ¸ÅÀ¨¿ÖŲ»ÊÇÄÇô¼òµ¥¡£ÖÁÉÙ£¬²ÎÕÕµÚ¶þ²¿·ÖÒýÓõÄ×ÊÁÏ£¬ÎÒÃÇ¿ÉÒÔÉÔ΢׼ȷһµãÀ´ÈÏʶE-raterÕâ¸ö¸ÅÄî¡£
´ÓETSµÄר¼Òµ·ÂÒʵÑéÒ²¿ÉÒÔ¿´µÃ³öÀ´£¬ÎªÊ²Ã´GMAT×î¶àÊÇ¡°one of them MAY be an e-rater¡±£¬ÎªÊ²Ã´¾ø¶Ô²»¿ÉÄÜÊÇÁ½¸öE-rater. »úÆ÷Ö»ÄÜ´ÓÈ˵ÄÉè¼Æ³ö·¢À´ÎªÈË·þÎñ£¬È˵ÄÅжϱê×¼×ÜÊÇ×î¸ù±¾µÄ¡£
ÏÂÃæÁгö±¾ÎĵÄÖ÷Òª²Î¿¼×ÊÁÏ¡£È«²¿×ÊÁ϶¼ÊÇ´ÓÍøÂçËѼ¯µÃÀ´£¬ÔÚgoogleÖÐÊäÈëe-rater×÷ΪËÑË÷¹Ø¼ü×Ö£¬»òÕ߸½¼ÓÉÏdoc»òpdf×÷ΪµÚ¶þ¹Ø¼ü×Ö£¬¿ÉÒÔËÑË÷µÃµ½¾ø´ó²¿·Ö±¾ÎIJο¼µÄÎÄÏס£
http://businessweek.com/bschools/originals/bs90329.htm
http://www.scoreitnow.org/e_rater.html
http://www.ets.org/research/erater.html £¨ÓÐÒ»´ó¶Ñ¹Ù·½·¢²¼µÄ¶«Î÷£©
http://www.englishtide.com/ets_erater.asp
Further Evaluation of Automated Essay Score Validity, P. Adam Kelly, Houston VA Medical Center and Baylor College of Medicine*, April 4, 2002
Automated Scoring of Essays: Evaluating Score Validity, P. Adam Kelly, Houston VA Medical Center and Baylor College of Medicine*,March 1, 2002
Computer Analysis of Essays
Jill Burstein, Karen Kukich, Susanne Wolff,
Chi Lu† and Martin Chodorow‡
Educational Testing Service, Princeton NJ
‡ Hunter College, New York City
Text Understanding Techniques for Automated Assessment, Claudia Leacock, Educational Testing Service, March 2001 GRE Board Professional Report No. 98-08bP ETS Research Report 01-03
Stumping E-Rater: Challenging the Validity of Automated Essay Scoring, Donald E. Powers, Jill C. Burstein, Martin Chodorow, Mary E. Fowles, Karen Kukich
Automated Evaluation of Coherence in Student Essays
Eleni Miltsakaki_, Karen Kukichy
_University of Pennsylvania
619Williams Hall, 36th & Spruce St., Philadelphia, PA 19104-6305, U.S.A.
elenimi@unagi.cis.upenn.edu
yEducational Testing Service
Rosedale Road, Princeton, NJ 08541, U.S.A.
kkukich@ets.org |
|