寄托天下
查看: 3866|回复: 13

[主题活动] 【CASK EFFECT】0910F阅读全方位锻炼--越障【SCI】 1-3 [复制链接]

Rank: 16Rank: 16Rank: 16Rank: 16

声望
3963
寄托币
23288
注册时间
2008-1-2
精华
50
帖子
2141

Sagittarius射手座 AW活动特殊奖 AW作文修改奖 IBT Elegance 挑战ETS奖章 US Advisor US Assistant 荣誉版主

发表于 2009-7-12 14:09:16 |显示全部楼层
本帖最后由 草木也知愁 于 2009-7-13 13:53 编辑


【CASK EFFECT】0910G阅读能力基础自测(速度、难度、深度、越障、真题、RAM)
https://bbs.gter.net/forum.php?mod=viewthread&tid=910464&highlight

【CASK EFFECT】0910G阅读全方位锻炼--难度【LSAT】汇总贴
https://bbs.gter.net/thread-982016-1-1.html


【CASK EFFECT】0910G阅读全方位锻炼--速度【CET】汇总贴
https://bbs.gter.net/thread-982018-1-1.html

【CASK EFFECT】0910F阅读全方位锻炼--越障【SCI】汇总贴
https://bbs.gter.net/thread-982020-1-1.html

【CASK EFFECT】0910G阅读全方位锻炼--真题【GRE】(后期推出)

【CASK EFFECT】0910G阅读全方位锻炼--深度【FICTION】(后期推出)

【CASK EFFECT】0910F阅读全方位锻炼--RAM 汇总贴(后期推出)


规则:


我每天贴出1000字左右的一篇文字(从我平时看的书或者paper里摘的)

没有别的要求,只要大家坚持读完就可以

如果你能坚持一个月,你会发现自己的阅读进化了~

[注]
1、直接在电脑屏幕面前做,虽然GRE阅读是在纸上考,但是这个过程会遏制你做笔记,同时给你的阅读造成视觉障碍,也就是把难度训练和抗干扰训练同步结合,增加效率(初期会很累,但是既然大家想要成为高手,那么就别对自己太温柔)
2、不用苛求速度,看完即可


Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

ABSTRACT
The BLAST programs are widely used tools for
searching protein and DNA databases for sequence
similarities. For protein comparisons, a variety of
definitional, algorithmic and statistical refinements
described here permits the execution time of the
BLAST programs to be decreased substantially while
enhancing their sensitivity to weak similarities. A new
criterion for triggering the extension of word hits,
combined with a new heuristic for generating gapped
alignments, yields a gapped BLAST program that runs
at approximately three times the speed of the original.
In addition, a method is introduced for automatically
combining statistically significant alignments produced
by BLAST into a position-specific score matrix,
and searching the database using this matrix. The
resulting Position-Specific Iterated BLAST (PSIBLAST)
program runs at approximately the same
speed per iteration as gapped BLAST, but in many
cases is much more sensitive to weak but biologically
relevant sequence similarities. PSI-BLAST is used to
uncover several new and interesting members of the
BRCT superfamily.
INTRODUCTION
Variations of the BLAST algorithm (1) have been incorporated
into several popular programs for searching protein and DNA
databases for sequence similarities. BLAST programs have been
written to compare protein or DNA queries with protein or DNA
databases in any combination, with DNA sequences often
undergoing conceptual translation before any comparison is
performed. We will use the blastp program, which compares
protein queries to protein databases, as a prototype for BLAST,
although the ideas presented extend immediately to other
versions that involve the translation of a DNA query or database.
Some of the refinements described are applicable as well to
DNA–DNA comparison, but have yet to be implemented.
BLAST is a heuristic that attempts to optimize a specific
similarity measure. It permits a tradeoff between speed and
sensitivity, with the setting of a ‘threshold’ parameter, T. A higher
value of T yields greater speed, but also an increased probability
of missing weak similarities. The BLAST program requires time
proportional to the product of the lengths of the query sequence
and the database searched. Since the rate of change in database
sizes currently exceeds that of processor speeds, computers
running BLAST are subjected to increasing load. However, the
conjunction of several new algorithmic ideas allow a new version
of BLAST to achieve improved sensitivity at substantially
augmented speed. This paper describes three major refinements
to BLAST.
(i) For increased speed, the criterion for extending word pairs
has been modified. The original BLAST program seeks short
word pairs whose aligned score is at least T. Each such ‘hit’ is then
extended, to test whether it is contained within a high-scoring
alignment. For the default T value, this extension step consumes
most of the processing time. The new ‘two-hit’ method requires
the existence of two non-overlapping word pairs on the same
diagonal, and within a distance A of one another, before an
extension is invoked. To achieve comparable sensitivity, the
threshold parameter T must be lowered, yielding more hits than
previously. However, because only a small fraction of these hits
are extended, the average amount of computation required
decreases.
(ii) The ability to generate gapped alignments has been added.
The original BLAST program often finds several alignments
involving a single database sequence which, when considered
together, are statistically significant. Overlooking any one of
these alignments can compromise the combined result. By
introducing an algorithm for generating gapped alignments, it
becomes necessary to find only one rather than all the ungapped
alignments subsumed in a significant result. This allows the T
parameter to be raised, increasing the speed of the initial database
scan. The new gapped alignment algorithm uses dynamic
programming to extend a central pair of aligned residues in both
directions. For speed, earlier heuristic methods (2,3) confined the
alignments produced to a predefined strip of the dynamic
alignments that drop in score no more than Xg below the best
score yet seen. The algorithm is able thereby to adapt the region
of the path graph it explores to the data.
(iii) BLAST searches may be iterated, with a position-specific
score matrix generated from significant alignments found in
round i used for round i + 1. Motif or profile search methods
frequently are much more sensitive than pairwise comparison
methods at detecting distant relationships. However, creating a
set of motifs or a profile that describes a protein family, and
searching a database with them, typically has involved running
several different programs, with substantial user intervention at
various stages. The BLAST algorithm is easily generalized to use
an arbitrary position-specific score matrix in place of a query
sequence and associated substitution matrix. Accordingly, we
have automated the procedure of generating such a matrix from
the output produced by a BLAST search, and adapted the BLAST
algorithm to take this matrix as input. The resulting Position-
Specific Iterated BLAST, or PSI-BLAST, program may not be as
sensitive as the best available motif search programs, but its speed
and ease of operation can bring the power of these methods into
more common use.
After describing these refinements to BLAST in greater detail,
we consider several biological examples for which the sensitivity
and speed of the program are greatly enhanced.
已有 3 人评分声望 收起 理由
henry117 + 1 草木MM是学生物的啊
dairyman + 1 ee
家家☆yoonjae + 2 嘿嘿~ 草草辛苦~

总评分: 声望 + 4   查看全部投币

使用道具 举报

Rank: 8Rank: 8

声望
925
寄托币
16929
注册时间
2009-5-31
精华
1
帖子
700

荣誉版主 AW活动特殊奖 AW小组活动奖 Cancer巨蟹座 GRE梦想之帆 GRE斩浪之魂 GRE守护之星

发表于 2009-7-12 14:19:18 |显示全部楼层
SF ~\(≧▽≦)/~啦啦啦 马上去看。。。sigh一下草草的排版 =。=
Believe your believes, that's it.

使用道具 举报

Rank: 16Rank: 16Rank: 16Rank: 16

声望
3963
寄托币
23288
注册时间
2008-1-2
精华
50
帖子
2141

Sagittarius射手座 AW活动特殊奖 AW作文修改奖 IBT Elegance 挑战ETS奖章 US Advisor US Assistant 荣誉版主

发表于 2009-7-12 14:20:46 |显示全部楼层
那个是从pdf上粘下来的 分栏格式 没办法。。。

使用道具 举报

Rank: 8Rank: 8

声望
925
寄托币
16929
注册时间
2009-5-31
精华
1
帖子
700

荣誉版主 AW活动特殊奖 AW小组活动奖 Cancer巨蟹座 GRE梦想之帆 GRE斩浪之魂 GRE守护之星

发表于 2009-7-12 14:29:33 |显示全部楼层
【摸下巴ing】 DNA比对的program么?

学到2个新生词~ heuristic method n.探试法
                augmented adj.扩张的
Believe your believes, that's it.

使用道具 举报

Rank: 5Rank: 5

声望
78
寄托币
2811
注册时间
2007-2-10
精华
0
帖子
5

GRE斩浪之魂 GRE梦想之帆

发表于 2009-8-17 11:49:54 |显示全部楼层
我晕了······
看下来真的只懂大概意思而已,细节东西完全不理解:P
heuristic  试探的,启发式的
活出生命的浓度!

使用道具 举报

Rank: 4

声望
68
寄托币
1236
注册时间
2008-10-9
精华
0
帖子
3
发表于 2009-8-21 23:16:39 |显示全部楼层
话说没看懂

使用道具 举报

Rank: 6Rank: 6

声望
166
寄托币
3397
注册时间
2009-1-16
精华
1
帖子
53

GRE梦想之帆 AW小组活动奖

发表于 2009-9-2 00:53:17 |显示全部楼层
alignments生物学上的解释是什么?这个单词给我造成很大的干扰。。
这文章晦涩的我只知道它在讲BLAST
See U in pittsburgh!

使用道具 举报

Rank: 2

声望
9
寄托币
182
注册时间
2009-2-10
精华
0
帖子
2
发表于 2009-9-3 16:24:34 |显示全部楼层
AAA...AAA...AAA 总算读完了

使用道具 举报

Rank: 3Rank: 3

声望
7
寄托币
277
注册时间
2008-3-1
精华
0
帖子
8
发表于 2009-9-8 16:18:26 |显示全部楼层
Alignment应该是一种对齐,或者说是配准吧。。不知道啦。
为什么都是生物的呢呢

使用道具 举报

Rank: 16Rank: 16Rank: 16Rank: 16

声望
3963
寄托币
23288
注册时间
2008-1-2
精华
50
帖子
2141

Sagittarius射手座 AW活动特殊奖 AW作文修改奖 IBT Elegance 挑战ETS奖章 US Advisor US Assistant 荣誉版主

发表于 2009-9-8 17:39:24 |显示全部楼层
Alignment应该是一种对齐,或者说是配准吧。。不知道啦。
为什么都是生物的呢呢
kelp0308 发表于 2009-9-8 16:18


后面就有其他的了

选取【越障】文章就是要选长词难词、人名、专有名词多的资料

使用道具 举报

Rank: 9Rank: 9Rank: 9

声望
430
寄托币
4498
注册时间
2008-1-16
精华
5
帖子
71

荣誉版主 AW小组活动奖 IBT Smart Scorpio天蝎座 GRE守护之星

发表于 2009-9-9 00:32:45 |显示全部楼层
我也只能看懂大概 专业术语好难啊……
新世界!

使用道具 举报

Rank: 8Rank: 8

声望
350
寄托币
6118
注册时间
2009-8-16
精华
2
帖子
198

GRE斩浪之魂

发表于 2009-9-20 01:34:15 |显示全部楼层
10# 草木也知愁
草木mm这篇有几个关键词……
refinement,说的是blask方法的引入和改进,其中引入对比和算法?
读着感觉有重复的地方啊,似乎有地方重复但是草木没删掉,开头一堆和第二大部分的开头似乎一样的说……

使用道具 举报

Rank: 7Rank: 7Rank: 7

声望
401
寄托币
5013
注册时间
2008-9-29
精华
3
帖子
298

GRE斩浪之魂

发表于 2009-12-19 09:28:58 |显示全部楼层
本帖最后由 lghscu 于 2009-12-19 09:35 编辑

这篇ABSTRACT采用algorithm常用路线
背景+旧算法+特点+新算法+改进特点
tradoff between speed and sensitivity是重点
开头一堆和第二大部分的开头似乎一样的说

摘要与引言在内容上确实可以有重复的地方
但也不是完全重复
引言会将摘要的背景描述得更为详尽

-------------
新算法的refinements是
1 低阈值使计算代价低
2 高阈值使库搜索快速
3 自适应机制的引入
-------------

使用道具 举报

Rank: 5Rank: 5

声望
38
寄托币
943
注册时间
2010-8-26
精华
0
帖子
6
发表于 2011-5-10 00:32:28 |显示全部楼层
看是看完了,感觉看的时候脑子直接罢工了。。囧。。
做一个学术的 可爱的 有深度的人

使用道具 举报

RE: 【CASK EFFECT】0910F阅读全方位锻炼--越障【SCI】 1-3 [修改]

问答
Offer
投票
面经
最新
精华
转发
转发该帖子
【CASK EFFECT】0910F阅读全方位锻炼--越障【SCI】 1-3
https://bbs.gter.net/thread-982765-1-1.html
复制链接
发送
回顶部